mirror of https://github.com/huggingface/candle.git synced 2025-06-16 02:38:10 +00:00

Files

Laurent Mazare 30a958e5dd Quantized mixtral model (#1442 )

* Add the Mixtral model.

* Add more of the mixtral layers.

* Add the final layers for mixtral.

* Sketch the expert selection.

* Add some expert routing logic.

* Hopefully finish the routing logic for mixtral.

* Add the mixtral example.

* Fix the weight filenames.

* Bugfix.

* Another fix.

* Yet another fix + remove the unused pragma.

* Shape fix.

* Support for quantized mixtral.

* Support mixtral in the quantized example.

* Mlp or moe type.

* Fix the expert field namings.

* Refactor the mlp bit.

* More MoE logic.

* Add the MoE quantized logic.

* Fix the experts length.

2023-12-15 19:16:06 -06:00

examples

Quantized mixtral model (#1442 )

2023-12-15 19:16:06 -06:00

src

Metal part 1 - Scaffolding for metal. (#1308 )

2023-11-10 08:35:48 +01:00

build.rs

Add nvcc ccbin support to examples (#1401 )

2023-12-03 16:01:16 +00:00

Cargo.toml

Update for 0.3.1. (#1324 )

2023-11-11 18:48:52 +00:00

README.md

Add some missing readme files. (#304 )

2023-08-02 10:57:12 +01:00

README.md

candle-examples