rwkv.cpp

Author	SHA1	Message	Date
Luciano	8d4a855c24	Add embedding mode with arg flag. Currently working (#282 ) * working but ugly * add arg flag, not working on embedding mode * typo * Working! Thanks to @nullhook * make params argument instead of hardcoded boolean. remove useless time check * start doing the instructions but not finished. This probably doesnt compile * Embeddings extraction support --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2023-03-24 17:05:13 +02:00
Georgi Gerganov	3cd8dde0d1	Revert "Fix memory allocation issues and seg faults" This reverts commit `4870e455b3`. Will provide the correct fix later	2023-03-24 06:22:28 +02:00
Georgi Gerganov	4870e455b3	Fix memory allocation issues and seg faults	2023-03-24 00:11:53 +02:00
Georgi Gerganov	483bab2e3d	Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439 ) Should make results reproducible for different number of threads and batch sizes	2023-03-23 23:22:01 +02:00
Yusuf Kağan Hanoğlu	d5850c53ca	Add missing header for memcpy (#386 ) fixed: memcpy is not defined	2023-03-22 10:55:45 +02:00
Georgi Gerganov	928480ef5b	Init llama_context_params properly from CLI (#370 )	2023-03-22 07:45:14 +02:00
Georgi Gerganov	f5a77a629b	Introduce C-style API (#370 ) * Major refactoring - introduce C-style API * Clean up * Add <cassert> * Add <iterator> * Add <algorithm> .... * Fix timing reporting and accumulation * Measure eval time only for single-token calls * Change llama_tokenize return meaning	2023-03-22 07:32:36 +02:00