Files
candle/candle-transformers
Laurent Mazare 3ad4770eb6 Use cat for faster MQA computation. (#2043)
* Use cat for faster MQA computation.

* Move the function to utils + use it in mistral.

* Use the shared repeat-kv in a few more models.

* Fix.
2024-04-12 09:15:10 +02:00
..
2024-03-23 15:26:09 +01:00
2024-03-02 18:50:01 +01:00

candle-transformers