|  saharNooby | 678f5233a5 | Add LoRA loading support | 2023-04-15 20:46:30 +04:00 | 
				
					
						|  saharNooby | e4268a36c8 | Update file format documentation | 2023-04-14 18:59:16 +04:00 | 
				
					
						|  saharNooby | 85db23c7de | Add script that measures perplexity | 2023-04-08 10:41:16 +04:00 | 
				
					
						|  saharNooby | e04baa032c | Remove reference impl comparison test | 2023-04-08 10:01:29 +04:00 | 
				
					
						|  saharNooby | c40941d9d0 | Add Q4_1_O format | 2023-04-07 09:55:39 +04:00 | 
				
					
						|  saharNooby | fa9ad13a39 | Free ggml context when model is garbage collected | 2023-04-06 20:27:33 +04:00 | 
				
					
						|  hypnopump | a9cb9adfd6 | streaming output | 2023-04-04 18:27:04 +02:00 | 
				
					
						|  PXLKSR | 977efba905 | we actually build a dylib on macos | 2023-04-04 10:19:06 +02:00 | 
				
					
						|  hypnopump | 0a0cabc4c7 | for consistency | 2023-04-03 08:27:00 +02:00 | 
				
					
						|  hypnopump | 6f3fb01913 | suggestions | 2023-04-03 08:25:54 +02:00 | 
				
					
						|  hypnopump | a64aaa81ec | initial addition | 2023-04-03 00:52:26 +02:00 | 
				
					
						|  saharNooby | e0684e8104 | Add text generation and chat scripts | 2023-04-02 15:03:31 +04:00 | 
				
					
						|  saharNooby | 935d16f5db | Move library wrapper to separate file, refactor code | 2023-04-02 12:24:40 +04:00 | 
				
					
						|  saharNooby | 972e28d48d | Implement INT4 conversion and inference | 2023-04-01 19:22:01 +04:00 | 
				
					
						|  saharNooby | a1e1d34c93 | Add Python wrapper for C library | 2023-04-01 16:02:22 +04:00 | 
				
					
						|  saharNooby | 7130a89d1f | [FILE FORMAT CHANGED] Reverse dimensions in ggml file (makes it more similar to llama.cpp format) | 2023-04-01 14:41:30 +04:00 | 
				
					
						|  saharNooby | f6d45baec0 | Support FP16 inference | 2023-04-01 11:53:49 +04:00 | 
				
					
						|  saharNooby | fe98c94a63 | [FILE FORMAT CHANGED] Use ggml_get_rows to get embedding | 2023-04-01 11:28:32 +04:00 | 
				
					
						|  saharNooby | 16ec7a5c18 | Add fail-fast version of the test | 2023-04-01 11:15:15 +04:00 | 
				
					
						|  saharNooby | 0fcb7c64c6 | Remove reference implementation code and test against pre-created logits | 2023-04-01 11:09:24 +04:00 | 
				
					
						|  saharNooby | 6fe9486cee | Finally, FP32 inference | 2023-04-01 10:06:39 +04:00 | 
				
					
						|  saharNooby | 61c6b1a4e0 | Add comparison against reference implementation script, implement state & logits saving | 2023-03-31 20:23:42 +04:00 | 
				
					
						|  saharNooby | d00f28581a | Add reference implementation of RWKV RNN | 2023-03-31 19:57:16 +04:00 | 
				
					
						|  saharNooby | fe272dc3d3 | Minor changes | 2023-03-31 10:24:12 +04:00 | 
				
					
						|  saharNooby | 873cb954d0 | Make ln0 work correctly | 2023-03-30 20:01:26 +04:00 | 
				
					
						|  saharNooby | 2f51451561 | Initial commit | 2023-03-30 17:55:30 +04:00 |