|
902d0b9166
|
More model cloning. (#1126)
* More model cloning.
* More cloning on quantized models.
|
2023-10-18 21:55:46 +01:00 |
|
|
392fe02fba
|
Move the common quantized-nn code to a shared module. (#1063)
|
2023-10-09 06:22:22 +01:00 |
|
|
783735cf22
|
Use softmax-last-dim where possible. (#1057)
|
2023-10-08 13:16:42 +01:00 |
|
|
1fcac4afed
|
Expose a function to clear the KV cache on mixformers. (#964)
|
2023-09-26 05:41:07 +01:00 |
|
|
0007ae9c11
|
Add the quantized mixformer model. (#953)
* Add the quantized mixformer model.
* Add the quantized option in the phi example.
|
2023-09-24 15:03:48 +01:00 |
|