Files
candle/candle-examples
Laurent Mazare f9ecc84477 GQA support in the quantized model. (#555)
* GQA support in the quantized model.

* Fix the reshaping.

* Fix the main llama model.

* Infer the proper gqa from the model kind.
2023-08-22 19:41:10 +01:00
..
2023-07-26 07:48:10 +01:00

candle-examples