94a43faaca
Use the hub models for llama2.c ( #284 )
2023-07-31 12:51:14 +01:00
4bf2ebf836
Use u8 tensors for masks. ( #273 )
2023-07-29 11:32:58 +01:00
3eb2bc6d07
Softmax numerical stability. ( #267 )
...
* Softmax numerical stability.
* Fix the flash-attn test.
2023-07-28 13:13:01 +01:00
550a13a547
Use the binary decoder for llama2.c. ( #230 )
...
* Use the binary decoder for llama2.c.
* Add the temperature.
* Formatting tweak.
* Fix the rotary embeddings.
2023-07-24 10:56:08 +01:00
35b65fed88
Add llama2.c as an example. ( #229 )
...
* Start adding llama2.c.
* Model loading.
* Add the llama-v2 model.
* Start converting the weights.
* Rotary embedding tweaks.
* Get the model to generate some tokens.
2023-07-24 09:13:50 +01:00