rwkv.cpp

History

Alex 1198892888 Add support for Q5_0, Q5_1 and Q8_0 formats; remove Q4_1_O format (#44 ) * Remove Q4_3 support * Add Q5_0, Q5_1, Q8_0 support * Add more clear message when loading Q4_3 model * Remove Q4_1_O format * Fix indentation in .gitmodules * Simplify sanitizer matrix		2023-04-29 17:39:11 +05:00
..
20B_tokenizer.json	Add text generation and chat scripts	2023-04-02 15:03:31 +04:00
chat_with_bot.py	Improve chat_with_bot.py script (#39 )	2023-04-22 20:33:58 +05:00
convert_pytorch_to_ggml.py	Add support for Q5_0, Q5_1 and Q8_0 formats; remove Q4_1_O format (#44 )	2023-04-29 17:39:11 +05:00
generate_completions.py	suggestions	2023-04-03 08:25:54 +02:00
measure_pexplexity.py	Sync ggml with upstream (#38 )	2023-04-22 20:25:29 +05:00
merge_lora_into_ggml.py	Add LoRA loading support	2023-04-15 20:46:30 +04:00
quantize.py	Add support for Q5_0, Q5_1 and Q8_0 formats; remove Q4_1_O format (#44 )	2023-04-29 17:39:11 +05:00
requirements.txt	Add text generation and chat scripts	2023-04-02 15:03:31 +04:00
rwkv_cpp_model.py	Add Q4_1_O format	2023-04-07 09:55:39 +04:00
rwkv_cpp_shared_library.py	Add support for Q5_0, Q5_1 and Q8_0 formats; remove Q4_1_O format (#44 )	2023-04-29 17:39:11 +05:00
sampling.py	Add text generation and chat scripts	2023-04-02 15:03:31 +04:00