saharNooby
|
f6d45baec0
|
Support FP16 inference
|
2023-04-01 11:53:49 +04:00 |
saharNooby
|
fe98c94a63
|
[FILE FORMAT CHANGED] Use ggml_get_rows to get embedding
|
2023-04-01 11:28:32 +04:00 |
saharNooby
|
16ec7a5c18
|
Add fail-fast version of the test
|
2023-04-01 11:15:15 +04:00 |
saharNooby
|
0fcb7c64c6
|
Remove reference implementation code and test against pre-created logits
|
2023-04-01 11:09:24 +04:00 |
saharNooby
|
6fe9486cee
|
Finally, FP32 inference
|
2023-04-01 10:06:39 +04:00 |
saharNooby
|
61c6b1a4e0
|
Add comparison against reference implementation script, implement state & logits saving
|
2023-03-31 20:23:42 +04:00 |