Files
candle/candle-examples/examples
Laurent Mazare f9ecc84477 GQA support in the quantized model. (#555)
* GQA support in the quantized model.

* Fix the reshaping.

* Fix the main llama model.

* Infer the proper gqa from the model kind.
2023-08-22 19:41:10 +01:00
..
2023-08-20 18:19:37 +01:00
2023-08-20 18:19:37 +01:00
2023-08-20 18:19:37 +01:00