rwkv.cpp/rwkv
Alex 3587ff9e58
Sync ggml with upstream (#38)
* Sync ggml with upstream

* Remove file filters from Actions triggers

* Update ggml

* Add Q4_2 and Q4_3 support

* Improve output of perplexity measuring script

* Add tests for new formats

* Add token limit argument to perplexity measuring script

* Update README

* Update README

* Update ggml

* Use master branch of ggml
2023-04-22 20:25:29 +05:00
..
20B_tokenizer.json Add text generation and chat scripts 2023-04-02 15:03:31 +04:00
chat_with_bot.py Improve the prompt & fix chinese display issue & support commands (#34) 2023-04-22 12:48:44 +05:00
convert_pytorch_to_ggml.py Sync ggml with upstream (#38) 2023-04-22 20:25:29 +05:00
generate_completions.py suggestions 2023-04-03 08:25:54 +02:00
measure_pexplexity.py Sync ggml with upstream (#38) 2023-04-22 20:25:29 +05:00
merge_lora_into_ggml.py Add LoRA loading support 2023-04-15 20:46:30 +04:00
quantize.py Sync ggml with upstream (#38) 2023-04-22 20:25:29 +05:00
requirements.txt Add text generation and chat scripts 2023-04-02 15:03:31 +04:00
rwkv_cpp_model.py Add Q4_1_O format 2023-04-07 09:55:39 +04:00
rwkv_cpp_shared_library.py Remove reference impl comparison test 2023-04-08 10:01:29 +04:00
sampling.py Add text generation and chat scripts 2023-04-02 15:03:31 +04:00