rwkv.cpp

History

LoganDark 3f8bb2c080 Allow creating multiple contexts per model (#83 ) * Allow creating multiple contexts per model This allows for parallel inference and I am preparing to support sequence mode using a method similar to this * Fix cuBLAS * Update rwkv.h Co-authored-by: Alex <saharNooby@users.noreply.github.com> * Update rwkv.cpp Co-authored-by: Alex <saharNooby@users.noreply.github.com> * Inherit print_errors from parent ctx when cloning * Add context cloning test * Free * Free ggml context when last rwkv_context is freed * Free before exit * int main * add explanation of ffn_key_size * Update rwkv_instance and rwkv_context comments * Thread safety notes --------- Co-authored-by: Alex <saharNooby@users.noreply.github.com>		2023-06-03 15:06:24 +05:00
..
CMakeLists.txt	Allow creating multiple contexts per model (#83 )	2023-06-03 15:06:24 +05:00
expected_logits.bin	Add robust automatic testing (#33 )	2023-04-20 11:00:35 +05:00
test_context_cloning.c	Allow creating multiple contexts per model (#83 )	2023-06-03 15:06:24 +05:00
test_ggml_basics.c	Various improvements (#52 )	2023-05-08 14:28:54 +05:00
test_tiny_rwkv.c	File parsing and memory usage optimization (#74 )	2023-05-31 16:31:19 +05:00
tiny-rwkv-660K-FP16.bin	Add robust automatic testing (#33 )	2023-04-20 11:00:35 +05:00
tiny-rwkv-660K-FP32.bin	Add robust automatic testing (#33 )	2023-04-20 11:00:35 +05:00