Files
candle/candle-core
Laurent Mazare deee7612da Quantized version of mistral. (#1009)
* Quantized version of mistral.

* Integrate the quantized mistral variant.

* Use the quantized weight files.

* Tweak the quantization command.

* Fix the dtype when computing the rotary embeddings.

* Update the readme with the quantized version.

* Fix the decoding of the remaining tokens.
2023-09-30 18:25:47 +01:00
..
2023-09-29 07:54:45 +01:00
2023-09-29 15:49:30 +01:00
2023-06-27 11:57:27 +02:00
2023-06-27 11:57:27 +02:00

candle

Minimalist ML framework for Rust