candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 02:38:10 +00:00

Files

Laurent Mazare deee7612da Quantized version of mistral. (#1009 )

* Quantized version of mistral.

* Integrate the quantized mistral variant.

* Use the quantized weight files.

* Tweak the quantization command.

* Fix the dtype when computing the rotary embeddings.

* Update the readme with the quantized version.

* Fix the decoding of the remaining tokens.

2023-09-30 18:25:47 +01:00

coco_classes.rs

Move the yolo shared bits to a common place. (#548 )

2023-08-22 13:03:07 +01:00

imagenet.rs

Move the imagenet specific bits to a separate file. (#571 )

2023-08-23 16:42:09 +01:00

lib.rs

Streaming mode for reporting the generated tokens (#1007 )

2023-09-30 15:04:11 +01:00

token_output_stream.rs

Quantized version of mistral. (#1009 )

2023-09-30 18:25:47 +01:00