rwkv.cpp/rwkv
Alex 1198892888
Add support for Q5_0, Q5_1 and Q8_0 formats; remove Q4_1_O format (#44)
* Remove Q4_3 support

* Add Q5_0, Q5_1, Q8_0 support

* Add more clear message when loading Q4_3 model

* Remove Q4_1_O format

* Fix indentation in .gitmodules

* Simplify sanitizer matrix
2023-04-29 17:39:11 +05:00
..
20B_tokenizer.json Add text generation and chat scripts 2023-04-02 15:03:31 +04:00
chat_with_bot.py Improve chat_with_bot.py script (#39) 2023-04-22 20:33:58 +05:00
convert_pytorch_to_ggml.py Add support for Q5_0, Q5_1 and Q8_0 formats; remove Q4_1_O format (#44) 2023-04-29 17:39:11 +05:00
generate_completions.py suggestions 2023-04-03 08:25:54 +02:00
measure_pexplexity.py Sync ggml with upstream (#38) 2023-04-22 20:25:29 +05:00
merge_lora_into_ggml.py Add LoRA loading support 2023-04-15 20:46:30 +04:00
quantize.py Add support for Q5_0, Q5_1 and Q8_0 formats; remove Q4_1_O format (#44) 2023-04-29 17:39:11 +05:00
requirements.txt Add text generation and chat scripts 2023-04-02 15:03:31 +04:00
rwkv_cpp_model.py Add Q4_1_O format 2023-04-07 09:55:39 +04:00
rwkv_cpp_shared_library.py Add support for Q5_0, Q5_1 and Q8_0 formats; remove Q4_1_O format (#44) 2023-04-29 17:39:11 +05:00
sampling.py Add text generation and chat scripts 2023-04-02 15:03:31 +04:00