Files
candle/candle-transformers
Laurent Mazare d54e02d73d Avoid a contiguous call in the quantized phi 3 model. (#2209)
* Simplify the KvCache api.

* Avoid a contiguous call in the quantized phi3 model.
2024-05-23 21:24:55 +02:00
..
2024-03-23 15:26:09 +01:00
2024-03-02 18:50:01 +01:00

candle-transformers