Files
candle/candle-metal-kernels/src
Laurent Mazare b13a82a438 Separate quantized phi-3 implementation. (#2157)
* Separate quantized phi-3 implementation.

* Integrate the quantized phi3 model.=

* Small fixes, get the generation to work properly.

* Keep the old llama implementation around.

* Change the default.
2024-05-04 10:14:57 +02:00
..
2024-01-22 15:15:19 +00:00
2024-04-27 20:17:35 +02:00
2024-04-05 08:32:58 +02:00
2024-04-27 20:17:35 +02:00