rwkv.cpp

History

Alex dea929f8ca Various improvements & upgrade ggml (#75 ) * Use types from typing for better compatibility with older Python versions * Split last double end of line token as per BlinkDL's suggestion * Fix MSVC warnings * Drop Q4_2 support * Update ggml * Bump file format version for quantization changes * Apply suggestions		2023-05-27 16:02:24 +05:00
..
prompt	punish repetitions & break if END_OF_TEXT & decouple prompts from chat script (#37 )	2023-04-30 18:50:05 +05:00
20B_tokenizer.json	Add text generation and chat scripts	2023-04-02 15:03:31 +04:00
chat_with_bot.py	Various improvements & upgrade ggml (#75 )	2023-05-27 16:02:24 +05:00
convert_pytorch_to_ggml.py	Various improvements & upgrade ggml (#75 )	2023-05-27 16:02:24 +05:00
convert_pytorch_to_ggml.test.py	Various improvements & upgrade ggml (#75 )	2023-05-27 16:02:24 +05:00
generate_completions.py	Flush output every token in generate_completions.py (#73 )	2023-05-26 17:23:58 +05:00
measure_pexplexity.py	Sync ggml with upstream (#38 )	2023-04-22 20:25:29 +05:00
merge_lora_into_ggml.py	Various improvements & upgrade ggml (#75 )	2023-05-27 16:02:24 +05:00
quantize.py	Various improvements & upgrade ggml (#75 )	2023-05-27 16:02:24 +05:00
requirements.txt	Add text generation and chat scripts	2023-04-02 15:03:31 +04:00
rwkv_cpp_model.py	Silence PyTorch warnings by using untyped storage (#72 )	2023-05-26 17:21:18 +05:00
rwkv_cpp_shared_library.py	Various improvements & upgrade ggml (#75 )	2023-05-27 16:02:24 +05:00
sampling.py	Add text generation and chat scripts	2023-04-02 15:03:31 +04:00