saharNooby
|
f6d45baec0
|
Support FP16 inference
|
2023-04-01 11:53:49 +04:00 |
saharNooby
|
fe98c94a63
|
[FILE FORMAT CHANGED] Use ggml_get_rows to get embedding
|
2023-04-01 11:28:32 +04:00 |
saharNooby
|
6fe9486cee
|
Finally, FP32 inference
|
2023-04-01 10:06:39 +04:00 |
saharNooby
|
61c6b1a4e0
|
Add comparison against reference implementation script, implement state & logits saving
|
2023-03-31 20:23:42 +04:00 |
saharNooby
|
01d667f066
|
Implement exp, max, 1_minus_x, sigmoid operators in ggml
|
2023-03-31 19:04:35 +04:00 |
saharNooby
|
fe272dc3d3
|
Minor changes
|
2023-03-31 10:24:12 +04:00 |
saharNooby
|
56bf4fc856
|
Implement time mixing, fix matrix shape mismatch
|
2023-03-30 20:29:41 +04:00 |
saharNooby
|
873cb954d0
|
Make ln0 work correctly
|
2023-03-30 20:01:26 +04:00 |
saharNooby
|
2f51451561
|
Initial commit
|
2023-03-30 17:55:30 +04:00 |