cfab6e7616
Mention phi-v3 in the readmes. ( #2122 )
2024-04-24 20:54:24 +02:00
c4cfcf1539
Tweak the readme for phi and the default sample length. ( #1450 )
2023-12-16 18:11:36 -06:00
79eab519fd
Fix phi example ( #1436 )
...
* Fix phi example
* Remove the cuda mention.
---------
Co-authored-by: Laurent <laurent.mazare@gmail.com >
2023-12-15 07:01:10 -06:00
5e33c85c8f
Quantized version for phi-v2. ( #1430 )
...
* Quantized version for phi-v2.
* More quantized support.
2023-12-13 21:16:34 -06:00
00948eb656
Formatting tweak. ( #1111 )
2023-10-16 21:02:53 +01:00
af67672207
Add support for Puffin-Phi-v2. ( #1110 )
...
* Add support for Puffin-Phi-v2.
* Tweak the file name.
* Support the config for puffin-phi-v2.
* Update the readme.
2023-10-16 20:54:21 +01:00
1ce7fe2543
Add more examples to the phi readme. ( #956 )
2023-09-24 18:19:05 +01:00
912a3d63b0
Use the proper block size for quantizing models. ( #933 )
...
* Use the proper block size for quantizing models.
* Use the proper dimension.
2023-09-22 21:36:56 +01:00
df6f5240ba
Complete the mixformer implementation. ( #930 )
...
* Complete the mixformers implementation.
* Tweak the attention.
* Add the phi-1.5 example.
* Improve the phi example.
* Bugfix.
* Get the phi example to work.
2023-09-22 20:03:16 +01:00