candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-15 18:28:24 +00:00

Author	SHA1	Message	Date
Laurent Mazare	614842b311	Add the Mixtral model. (#1437 ) * Add the Mixtral model. * Add more of the mixtral layers. * Add the final layers for mixtral. * Sketch the expert selection. * Add some expert routing logic. * Hopefully finish the routing logic for mixtral. * Add the mixtral example. * Fix the weight filenames. * Bugfix. * Another fix. * Yet another fix + remove the unused pragma. * Shape fix. * Add a readme.	2023-12-15 14:19:56 -06:00
niu tech	79eab519fd	Fix phi example (#1436 ) * Fix phi example * Remove the cuda mention. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2023-12-15 07:01:10 -06:00
Laurent Mazare	5e33c85c8f	Quantized version for phi-v2. (#1430 ) * Quantized version for phi-v2. * More quantized support.	2023-12-13 21:16:34 -06:00
Laurent Mazare	2b3a018be7	Support for phi-2. (#1429 ) * Support for phi-2. * Use the v2 naming scheme.	2023-12-13 20:59:29 -06:00
Juarez Bochi	9bd94c1ffa	Speed up bert with approx gelu (#1410 )	2023-12-06 17:46:37 +01:00
emka	8418154ee0	Add nvcc ccbin support to examples (#1401 )	2023-12-03 16:01:16 +00:00
emka	99b7273b03	Add compute cap env support to examples (#1400 )	2023-12-03 16:00:24 +00:00
Laurent Mazare	16161145ae	Add the leo models to the quantized examples. (#1398 )	2023-12-03 12:30:41 +00:00
Laurent Mazare	0738df5290	Add more mentions to SDXL Turbo in the readme. (#1397 )	2023-12-03 10:41:21 +00:00
Edwin Cheng	37bf1ed012	Stable Diffusion Turbo Support (#1395 ) * Add support for SD Turbo * Set Leading as default in euler_ancestral discrete * Use the appropriate default values for n_steps and guidance_scale. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2023-12-03 08:37:10 +01:00
Lucas de Ávila Martins	5aa1a65dab	Add quantized Starling, fix open-chat prompt (#1393 ) * Add quantized Starling, fix open-chat prompt * Fix open-chat and starling prompts	2023-12-02 16:47:19 +00:00
Laurent Mazare	7c3cfd1086	Use the llama weight names for the Yi example. (#1381 )	2023-11-27 20:42:52 +00:00
Odunayo	762e996ce6	Distibert (#1366 ) * add bce with logit loss * add bce with logit loss * remove imports * fix tiny bug * add test documentation and refactor function * fix test cases and formatting * distilbet files * Apply various cleanups. * More cleanups. * More polish. --------- Co-authored-by: laurent <laurent.mazare@gmail.com>	2023-11-24 15:09:14 +00:00
MilkFather	ca19a9af62	Fix linspace implementation (#1358 ) * Fix linspace implementation `steps` should be strictly greater than 1 to make it consistent with the context. * Handle steps == 0 and steps == 1. * Fix rustfmt. --------- Co-authored-by: laurent <laurent.mazare@gmail.com>	2023-11-23 07:35:13 +00:00
Marcus Asteborg	ec23427d60	Ensure to copy data to cpu before iterating. (#1360 )	2023-11-23 07:24:25 +00:00
Lucas de Ávila Martins	f49bf6a81d	Fix OpenChat 3.5 tokenizer (#1347 )	2023-11-19 18:48:04 +00:00
Lucas de Ávila Martins	992a788da1	Add OpenChat 3.5 to quantized examples (#1346 ) * Add OpenChat to quantized examples * Add chat prompt * Make the openchat example more in line with the other models. * Fix a typo. --------- Co-authored-by: laurent <laurent.mazare@gmail.com>	2023-11-19 18:28:52 +00:00
Laurent Mazare	9ab3f9729f	Use the whisper-v3 tokenizer now that it has been added. (#1337 ) * Use the whisper-v3 tokenizer now that it has been added. * Use the appropriate nospeech token.	2023-11-16 22:10:31 +00:00
drbh	92a05b51cf	fix: address clippy 0.1.74 issues (#1336 ) - clippy::needless-borrows-for-generic-args - clippy::reserve-after-initialization	2023-11-16 21:15:22 +00:00
Ryan Kopf	f4fcf60900	Update readme.md (#1322 ) Updating the readme to coincide with other examples. If you try to run it as previously written, you will get a "cannot find the path specified" error.	2023-11-12 09:46:19 +00:00
Bernardo de Lemos	12561b31d3	Fix pose estimation image path (#1326 )	2023-11-12 09:45:26 +00:00
Laurent Mazare	a209ce8ceb	Update for 0.3.1. (#1324 )	2023-11-11 18:48:52 +00:00
Laurent Mazare	a007f8fdb4	Add the Yi-6b and Yi-34b models. (#1320 ) * Add the Yi-6b model. * Add the 34b model. * Add the yi example. * Fix the weight file names.	2023-11-11 12:00:48 +01:00
Michael Leandersson	2341aa079e	Fix quantized zephyr chat prompt (#1314 ) (#1317 ) * Fix quantized zephyr chat prompt (#1314) * Avoid using a mutable variable. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2023-11-11 09:14:12 +01:00
Nicolas Patry	26c4e5bf1d	Metal part 1 - Scaffolding for metal. (#1308 ) * Metal part 1 - Scaffolding for metal. * Remove tracing.	2023-11-10 08:35:48 +01:00
Juarez Bochi	18d30005c5	Add support to UL2 model family (#1300 ) * Add support to UL2 model family * Update docs with UL2 * Create ActivationWithOptionalGating to avoid polluting activations * Also refactor quantized t5 * Remove useless conversion * Revert Activation::NewGelu name change * Remove useless return * Apply rustfmt and clippy recommendations * Reuse t5::ActivationWithOptionalGating in quantized version * (cosmetic change) use a match rather than ifs + avoid early returns. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2023-11-09 18:55:09 +01:00
Ogundepo Odunayo	6958384327	Add support for TrOCR Model (#1303 ) * add bce with logit loss * add bce with logit loss * remove imports * fix tiny bug * add test documentation and refactor function * fix test cases and formatting * add trocr model * fix formatting * commit the actual model lol * more formatting * remove tokenizer config	2023-11-09 18:49:17 +01:00
Laurent Mazare	2feb0b054f	Add the mel filters for 128 bins. (#1295 )	2023-11-08 08:23:53 +01:00
Laurent Mazare	2d28497197	Preliminary support for whisper v3. (#1294 ) * Preliminary support for whisper v3. * Add the missing files.	2023-11-08 06:42:52 +01:00
Laurent Mazare	7920b45c8a	Support for timegroupnorm in encodec. (#1291 )	2023-11-07 22:39:59 +01:00
Laurent Mazare	d4a45c936a	Quantized model small tweaks (#1290 ) * Support the shape op in ONNX. * Share the axis normalization bits. * Add some limited support for gather. * Unsqueeze. * Comparison with broadcasting. * Add Not + handle i32. * Tweaks for the quantized model.	2023-11-07 21:21:37 +01:00
Juarez Bochi	d5c2a7b64b	Add info about MADLAD-400 in readme files (#1287 )	2023-11-07 15:21:59 +01:00
Juarez Bochi	508f811b93	Add support for MADLAD400 (#1285 ) * Add support for madlad * Add support for quantized MADLAD	2023-11-07 05:35:37 +01:00
Laurent Mazare	a773a4b22b	[ONNX] Support a couple more ops. (#1284 ) * Support the shape op in ONNX. * Share the axis normalization bits. * Add some limited support for gather. * Unsqueeze. * Comparison with broadcasting. * Add Not + handle i32.	2023-11-06 22:44:58 +01:00
DTJ11235	5a363dbc26	Adds check for 7b-zephyr and uses correct template (#1283 ) * Adds check for 7b-zephyr and uses correct template * Handle zephyr as mistral. * Disable the protoc bits of the CI. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2023-11-06 21:05:39 +01:00
Laurent Mazare	2a45bcf943	Put the onnx example behind a feature flag. (#1276 ) * Put the onnx example behind a feature flag. * Exclude the onnx bits from the workspace. * README tweaks.	2023-11-06 07:45:07 +01:00
Laurent Mazare	f365a075e5	Add more models to the onnx example. (#1273 ) * Add more models to the onnx example. * Input validation. * Input validation. * Bugfix. * Implement clip. * BatchNorm support. * Get the efficientnet onnx to work.	2023-11-05 16:57:26 +01:00
Laurent Mazare	928a9d906e	[ONNX] Do not generate values for constants. (#1272 ) * Do not generate values for constants. * Add an onnx based example using squeezenet.	2023-11-05 11:23:14 +01:00
Laurent Mazare	e08fbb6543	Add support for distil whisper (#1245 ) * Add support for distil-whisper. * Add distil-large. * Rename the large model.	2023-11-02 19:32:35 +01:00
Laurent Mazare	c12ad45562	Add a KV cache to marian decoding. (#1226 )	2023-10-31 08:47:44 +00:00
Laurent Mazare	7d0202710b	Instructions for generating the tokenizer configs for marian-mt. (#1225 )	2023-10-31 07:56:26 +01:00
Laurent Mazare	392a00a147	Add support for the marian base model. (#1221 )	2023-10-30 19:20:36 +00:00
Laurent Mazare	4c967b9184	Use the hub files for the marian example. (#1220 ) * Use the hub files for the marian example. * Use the secondary decoder. * Add a readme. * More readme.	2023-10-30 17:29:36 +00:00
Laurent Mazare	969960847a	Bugfixes for marian-mt. (#1219 ) * Bugfixes for marian-mt. * Apply the final decoding head. * More fixes.	2023-10-30 11:44:19 +00:00
Lukas Kreussel	174b208052	PyO3: Better shape handling (#1143 ) * Negative and `args` shape handling Rename to `PyShapeWithHole` + validate that only one hole exists * Regenerate stubs --------- Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>	2023-10-29 15:41:44 +00:00
Laurent Mazare	7bbde55c61	Marian MT model (#1210 ) * Skeleton files for the marian MT model. * Marian initialization. * Implement the attention forward method. * Forward pass for the encoder side. * Expose the encoder and decoder. * Start plugging the decoder. * Forward pass for the decoder layer. * Set up the marian example. * Add some missing backtraces. * Bugfix.	2023-10-29 15:12:22 +00:00
Laurent Mazare	55bc3382cf	Allow for different behavior between training and eval (#1213 ) * Forward with training. * Do not use dropout on vgg evaluation.	2023-10-29 07:53:09 +01:00
drbh	dece37c6f4	feat: implement VGG13, VGG16 and VGG19 (#1211 ) * feat: implement VGG13, VGG16 and VGG19 * Cosmetic fixes. * More cosmetic tweaks + avoid re-loading the weights on each final layer. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2023-10-29 06:10:23 +00:00
Travis Hammond	498c50348c	Add DDPG and fix Gym wrapper (#1207 ) * Fix Gym wrapper - It was returning things in the wrong order - Gym now differentiates between terminated and truncated * Add DDPG * Apply fixes * Remove Result annotations * Also remove Vec annotation * rustfmt * Various small improvements (avoid cloning, mutability, get clippy to pass, ...) --------- Co-authored-by: Travis Hammond <travis.hammond@alexanderthamm.com> Co-authored-by: Laurent <laurent.mazare@gmail.com>	2023-10-28 19:53:34 +01:00
Laurent Mazare	012ae0090e	Infer the config for llama2-c. (#1208 )	2023-10-28 19:00:39 +01:00

1 2 3 4 5 ...

510 Commits