rwkv.cpp

Commit Graph

Author	SHA1	Message	Date
Mathmagician8191	82c4ac78f4	Add support for the world tokenizer (#86 ) * Add support for the world tokenizer * Move tokenizer logic to rwkv_tokenizer.py * Added test for the tokenizer	2023-06-08 16:37:18 +05:00
YorkZero	241350fde6	Feature add cublas support (#65 ) * chore: add ggml import in the head of rwkv.h * chore: add ggml import in the head of rwkv.h * feat: add cublas support * feat: update rwkv.cpp * feat: remove unused change * chore: fix linux build issue * chore: sync ggml and offload tensor to gpu * chore: comment out tensors which occurs error on GPU * chore: update comment and readme * chore: update ggml to recent * chore: add more performance test results * chore: add more performance test results * chore: fix problem of reading file more than 2 gb * chore: merge master * chore: remove unused comment * chore: fix for comments * Update README.md * Update rwkv.cpp --------- Co-authored-by: Alex <saharNooby@users.noreply.github.com>	2023-05-29 17:10:19 +05:00
Alex	dea929f8ca	Various improvements & upgrade ggml (#75 ) * Use types from typing for better compatibility with older Python versions * Split last double end of line token as per BlinkDL's suggestion * Fix MSVC warnings * Drop Q4_2 support * Update ggml * Bump file format version for quantization changes * Apply suggestions	2023-05-27 16:02:24 +05:00
柏园猫	1c363e6d5f	Fix encoding issue when loading prompt data (#58 ) * Fix encoding issue when loading prompt data * Update chat_with_bot.py Fix code style --------- Co-authored-by: Alex <saharNooby@users.noreply.github.com>	2023-05-13 21:53:54 +05:00
Alex	5eb8f09c14	Various improvements (#47 ) * Update ggml * Pack only rwkv.dll for Windows releases Test executables would not be packed anymore. * Move test code into a separate file * Remove redundant zeroing * Refactor chat script	2023-04-30 20:27:14 +05:00
Jarrett Ye	3621172428	punish repetitions & break if END_OF_TEXT & decouple prompts from chat script (#37 ) * punish repetitions & break if END_OF_TEXT * decouple prompts from chat_with_bot.py * improve code style * Update rwkv/chat_with_bot.py Co-authored-by: Alex <saharNooby@users.noreply.github.com> * Update rwkv/chat_with_bot.py Co-authored-by: Alex <saharNooby@users.noreply.github.com> * add types * JSON prompt --------- Co-authored-by: Alex <saharNooby@users.noreply.github.com>	2023-04-30 18:50:05 +05:00
Alex	c736ef5411	Improve chat_with_bot.py script (#39 )	2023-04-22 20:33:58 +05:00
Jarrett Ye	ac663631e1	Improve the prompt & fix chinese display issue & support commands (#34 ) * update the prompt * Fix/chinese display issue * remove debug code * support commands (#1) +reset +gen +i +qq +qa +++ ++ + * run_rnn before decode * remove debug code * deep copy logits * remove extra print() * print newline if reach max_tokens_per_generation * fix typo in init prompt * Update rwkv/chat_with_bot.py Co-authored-by: Alex <saharNooby@users.noreply.github.com> * Update rwkv/chat_with_bot.py Co-authored-by: Alex <saharNooby@users.noreply.github.com> * Update rwkv/chat_with_bot.py Co-authored-by: Alex <saharNooby@users.noreply.github.com> * Update rwkv/chat_with_bot.py Co-authored-by: Alex <saharNooby@users.noreply.github.com> * refine code & type annotation * add comments for commands * support change temp & top_p during chat. * set default language & prompt --------- Co-authored-by: Alex <saharNooby@users.noreply.github.com>	2023-04-22 12:48:44 +05:00
hypnopump	a9cb9adfd6	streaming output	2023-04-04 18:27:04 +02:00
hypnopump	6f3fb01913	suggestions	2023-04-03 08:25:54 +02:00
hypnopump	a64aaa81ec	initial addition	2023-04-03 00:52:26 +02:00
saharNooby	e0684e8104	Add text generation and chat scripts	2023-04-02 15:03:31 +04:00

12 Commits