mirror of https://github.com/huggingface/candle.git synced 2025-06-16 02:38:10 +00:00

Files

Laurent Mazare 10d47183c0 Quantized version of flux. (#2500 )

* Quantized version of flux.

* More generic sampling.

* Hook the quantized model.

* Use the newly minted gguf file.

* Fix for the quantized model.

* Default to avoid the faster cuda kernels.

2024-09-26 10:23:43 +02:00

src

Quantized version of flux. (#2500 )

2024-09-26 10:23:43 +02:00

tests

Soft Non-Maximum Suppression (#2400 )

2024-08-10 07:57:52 +02:00

Cargo.toml

Metavoice - first cut (#1717 )

2024-03-02 18:50:01 +01:00

README.md

Add some missing readme files. (#304 )

2023-08-02 10:57:12 +01:00

README.md

candle-transformers