candle/examples at 30a958e5dd6152da0d9e4cf5ce338bd2dd6a0ec4 - candle - Gitea: Git with a cup of tea

huggingface/candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 02:38:10 +00:00

Files

History

Laurent Mazare 30a958e5dd Quantized mixtral model (#1442 )

* Add the Mixtral model.

* Add more of the mixtral layers.

* Add the final layers for mixtral.

* Sketch the expert selection.

* Add some expert routing logic.

* Hopefully finish the routing logic for mixtral.

* Add the mixtral example.

* Fix the weight filenames.

* Bugfix.

* Another fix.

* Yet another fix + remove the unused pragma.

* Shape fix.

* Support for quantized mixtral.

* Support mixtral in the quantized example.

* Mlp or moe type.

* Fix the expert field namings.

* Refactor the mlp bit.

* More MoE logic.

* Add the MoE quantized logic.

* Fix the experts length.

2023-12-15 19:16:06 -06:00

..

Speed up bert with approx gelu (#1410 )

2023-12-06 17:46:37 +01:00

Adapt more examples to the updated safetensor api. (#947 )

2023-09-23 21:26:03 +01:00

Add a KV cache to marian decoding. (#1226 )

2023-10-31 08:47:44 +00:00

Convmixer example (#1074 )

2023-10-11 19:51:10 +01:00

Remove some dead-code annotations. (#629 )

2023-08-27 18:52:33 +01:00

Adapt more examples to the updated safetensor api. (#947 )

2023-09-23 21:26:03 +01:00

Distibert (#1366 )

2023-11-24 15:09:14 +00:00

Adapt more examples to the updated safetensor api. (#947 )

2023-09-23 21:26:03 +01:00

Adapt more examples to the updated safetensor api. (#947 )

2023-09-23 21:26:03 +01:00

Use the hub model file when possible. (#1190 )

2023-10-26 20:00:50 +01:00

Adapt more examples to the updated safetensor api. (#947 )

2023-09-23 21:26:03 +01:00

Infer the config for llama2-c. (#1208 )

2023-10-28 19:00:39 +01:00

llama_multiprocess

Self-contained safetensors for the multiprocess llama example. (#950 )

2023-09-24 06:54:49 +01:00

Add a KV cache to marian decoding. (#1226 )

2023-10-31 08:47:44 +00:00

Quantized version of mistral. (#1009 )

2023-09-30 18:25:47 +01:00

Add the Mixtral model. (#1437 )

2023-12-15 14:19:56 -06:00

Allow for different behavior between training and eval (#1213 )

2023-10-29 07:53:09 +01:00

Support for timegroupnorm in encodec. (#1291 )

2023-11-07 22:39:59 +01:00

Add more models to the onnx example. (#1273 )

2023-11-05 16:57:26 +01:00

Fix phi example (#1436 )

2023-12-15 07:01:10 -06:00

Quantized mixtral model (#1442 )

2023-12-15 19:16:06 -06:00

Add info about MADLAD-400 in readme files (#1287 )

2023-11-07 15:21:59 +01:00

reinforcement-learning

Add DDPG and fix Gym wrapper (#1207 )

2023-10-28 19:53:34 +01:00

Add the quantized mpt model. (#1123 )

2023-10-18 16:29:38 +01:00

Expose the larger resnets (50/101/152) in the example. (#1131 )

2023-10-19 13:48:28 +01:00

segment-anything

Add negative prompts to segment-anything. (#1000 )

2023-09-30 06:17:42 +01:00

stable-diffusion

Add more mentions to SDXL Turbo in the readme. (#1397 )

2023-12-03 10:41:21 +00:00

Quantized version of StableLM. (#1058 )

2023-10-08 15:42:38 +01:00

Add support to UL2 model family (#1300 )

2023-11-09 18:55:09 +01:00

Update readme.md (#1322 )

2023-11-12 09:46:19 +00:00

Allow for different behavior between training and eval (#1213 )

2023-10-29 07:53:09 +01:00

Readme updates. (#1134 )

2023-10-20 09:08:39 +01:00

Use the whisper-v3 tokenizer now that it has been added. (#1337 )

2023-11-16 22:10:31 +00:00

Remove some unusued bits. (#1067 )

2023-10-09 19:49:57 +01:00

Use the llama weight names for the Yi example. (#1381 )

2023-11-27 20:42:52 +00:00

Ensure to copy data to cpu before iterating. (#1360 )

2023-11-23 07:24:25 +00:00

Fix linspace implementation (#1358 )

2023-11-23 07:35:13 +00:00

onnx_basics.rs

[ONNX] Support a couple more ops. (#1284 )

2023-11-06 22:44:58 +01:00