Add the Mixtral model. (#1437)

* Add the Mixtral model.

* Add more of the mixtral layers.

* Add the final layers for mixtral.

* Sketch the expert selection.

* Add some expert routing logic.

* Hopefully finish the routing logic for mixtral.

* Add the mixtral example.

* Fix the weight filenames.

* Bugfix.

* Another fix.

* Yet another fix + remove the unused pragma.

* Shape fix.

* Add a readme.
This commit is contained in:
Laurent Mazare
2023-12-15 14:19:56 -06:00
committed by GitHub
parent 79eab519fd
commit 614842b311
4 changed files with 788 additions and 0 deletions

View File

@ -14,6 +14,7 @@ pub mod llama2_c_weights;
pub mod marian;
pub mod mistral;
pub mod mixformer;
pub mod mixtral;
pub mod mpt;
pub mod persimmon;
pub mod quantized_blip;