rwkv.cpp/rwkv
Alex dea929f8ca
Various improvements & upgrade ggml (#75)
* Use types from typing for better compatibility with older Python versions

* Split last double end of line token as per BlinkDL's suggestion

* Fix MSVC warnings

* Drop Q4_2 support

* Update ggml

* Bump file format version for quantization changes

* Apply suggestions
2023-05-27 16:02:24 +05:00
..
prompt punish repetitions & break if END_OF_TEXT & decouple prompts from chat script (#37) 2023-04-30 18:50:05 +05:00
20B_tokenizer.json Add text generation and chat scripts 2023-04-02 15:03:31 +04:00
chat_with_bot.py Various improvements & upgrade ggml (#75) 2023-05-27 16:02:24 +05:00
convert_pytorch_to_ggml.py Various improvements & upgrade ggml (#75) 2023-05-27 16:02:24 +05:00
convert_pytorch_to_ggml.test.py Various improvements & upgrade ggml (#75) 2023-05-27 16:02:24 +05:00
generate_completions.py Flush output every token in generate_completions.py (#73) 2023-05-26 17:23:58 +05:00
measure_pexplexity.py Sync ggml with upstream (#38) 2023-04-22 20:25:29 +05:00
merge_lora_into_ggml.py Various improvements & upgrade ggml (#75) 2023-05-27 16:02:24 +05:00
quantize.py Various improvements & upgrade ggml (#75) 2023-05-27 16:02:24 +05:00
requirements.txt Add text generation and chat scripts 2023-04-02 15:03:31 +04:00
rwkv_cpp_model.py Silence PyTorch warnings by using untyped storage (#72) 2023-05-26 17:21:18 +05:00
rwkv_cpp_shared_library.py Various improvements & upgrade ggml (#75) 2023-05-27 16:02:24 +05:00
sampling.py Add text generation and chat scripts 2023-04-02 15:03:31 +04:00