* Use types from typing for better compatibility with older Python versions * Split last double end of line token as per BlinkDL's suggestion * Fix MSVC warnings * Drop Q4_2 support * Update ggml * Bump file format version for quantization changes * Apply suggestions |
||
|---|---|---|
| .. | ||
| prompt | ||
| 20B_tokenizer.json | ||
| chat_with_bot.py | ||
| convert_pytorch_to_ggml.py | ||
| convert_pytorch_to_ggml.test.py | ||
| generate_completions.py | ||
| measure_pexplexity.py | ||
| merge_lora_into_ggml.py | ||
| quantize.py | ||
| requirements.txt | ||
| rwkv_cpp_model.py | ||
| rwkv_cpp_shared_library.py | ||
| sampling.py | ||