candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Author	SHA1	Message	Date
Laurent Mazare	d60eba1408	Streamline the glm4 example. (#2694 )	2024-12-31 09:21:41 +01:00
jetsung	460616fc84	Update README.org (#2670 ) The command line error in the CPU section of the documentation.	2024-12-30 11:32:02 +01:00
Akshay Ballal	91f1f019b1	Added XLMRobertaModel for Reranking (#2686 ) * add xlm-roberta-base * Add task enum for fill-mask and reranker in xlm-roberta example; update README and fix attention mask dimensions - Introduced a new `Task` enum to replace string task identifiers in the xlm-roberta example. - Updated the logic in `main.rs` to handle tasks using the new enum. - Enhanced README with example output for fill-mask task. - Fixed dimension retrieval in `prepare_4d_attention_mask` function for better clarity and safety. * Clippy fix. --------- Co-authored-by: laurent <laurent.mazare@gmail.com>	2024-12-30 11:16:57 +01:00
Edgar Riba	5c2f893e5a	make DepthAnythingV2 more reusable (#2675 ) * make DepthAnythingV2 more reusable * Fix clippy lints. --------- Co-authored-by: laurent <laurent.mazare@gmail.com>	2024-12-21 12:06:03 +01:00
cdoko	145aa7193c	Add Nvembed v2 model (#2649 ) * Update mod.rs * Create mod.rs * Create decoder.rs * Create model.rs * Create main.rs * Create README.md * Update README.md * Update main.rs * Update and rename decoder.rs to embedding.rs * Update mod.rs * Update model.rs	2024-12-03 10:56:01 +01:00
iskng	4f59ed38b0	Adds support for stella_en_v5 embedding model -400M variant (#2608 ) * Adds support for stella_en_v5 embedding model -400M variant * Unified stella * WIP: Unified Stella * Combined stella for both 1.5B and 400M variants * Cargo fmt for the CI * removed redundant stella-400m model and example after merge into stella-en-v5 * cargo fmt --all --------- Co-authored-by: Anubhab Bandyopadhyay <4890833+AnubhabB@users.noreply.github.com> Co-authored-by: laurent <laurent.mazare@gmail.com>	2024-11-29 09:01:08 +01:00
Anubhab Bandyopadhyay	54e7fc3c97	Lint fixes introduced with Rust 1.83 (#2646 ) * Fixes for lint errors introduced with Rust 1.83 * rustfmt * Fix more lints. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2024-11-28 23:00:21 +01:00
Adam Nelson	23ed8a9ded	Fix for whisper-microphone example failure if audio isn't chunk aligned (#2645 ) At least on my macOS Sequoia system (MBP 14" 2021, M1 Pro), when I run the `whisper-microphone` example after it has gathered 10 seconds of audio, it fails before the transcription: ``` Error: Insufficient buffer size 384 for input channel 0, expected 1024 ``` At least for the audio device I'm using (Airpods Pro Max), there is no guarantee that each audio buffer is a multiple of 1024 samples. Thus at the end of the 10 seconds, `buffered_pcm` can have some samples at the end that do not form a complete 1024 sample chunk. This fixes that by tracking when there is a partial chunk at the end of the buffer, and leaving it in `buffered_pcm` to be processed on the next loop iteration. Note that, in the interest of keeping this PR as small as possible, I didn't make any other changes to this example.	2024-11-27 22:35:11 +01:00
Laurent Mazare	00d8a0c178	Remove some unused macros. (#2618 ) * Remove some unused macros. * More unused fixes.	2024-11-15 16:46:55 +01:00
Laurent Mazare	3fba2b5fc4	Add the SmolLM2 models. (#2595 ) * Add the SmolLM2 models. * More SmolLM2 support.	2024-11-03 17:11:12 +01:00
Czxck001	530ab96036	Support Skip Layer Guidance (SLG) for Stable Diffusion 3.5 Medium (#2590 ) * support skip layer guidance (slg) for stable diffusion 3.5 medium * Tweak the comments formatting. * Proper error message. * Cosmetic tweaks. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2024-11-01 18:10:40 +01:00
Laurent Mazare	7ac0de15a9	Lazy upcasting for t5. (#2589 )	2024-10-30 18:08:51 +01:00
Czxck001	d232e132f6	Support sd3.5 medium and MMDiT-X (#2587 ) * extract attn out of joint_attn * further adjust attn and joint_attn * add mmdit-x support * support sd3.5-medium in the example * update README.md	2024-10-30 06:19:07 +01:00
Laurent Mazare	139ff56aeb	Reduce memory usage for sd 3.5. (#2582 )	2024-10-28 22:45:02 +01:00
Laurent Mazare	498bc2cdc9	Release the mmdit model earlier to reduce memory usage. (#2581 ) * Stable diffusion 3.5 support. * Clippy fixes. * CFG fix. * Remove some unnecessary clones. * Avoid duplicating some of the code. * Release the mmdit model earlier to reduce memory usage.	2024-10-28 16:06:53 +01:00
Laurent Mazare	37e0ab8c64	Stable diffusion 3.5 support. (#2578 ) * Stable diffusion 3.5 support. * Clippy fixes. * CFG fix. * Remove some unnecessary clones. * Avoid duplicating some of the code.	2024-10-27 10:01:04 +01:00
Laurent Mazare	3699c1a053	Fix the repo name for llama 3.1. (#2576 ) * Fix the repo name for llama 3.1. * Fix the book.	2024-10-26 11:25:04 +02:00
Laurent Mazare	3d1dc06cdb	Enable stable-diffusion 3 on metal. (#2560 )	2024-10-14 08:59:12 +02:00
Anubhab Bandyopadhyay	f553ab5eb4	Adds support for Stella_en_v5 embedding model - 1.5B variant (#2551 ) * Stella_en_1.5B_v5 * Separated creation. This is a critical step for numerical accuracy and would be documented in the readme * EmbedDim would require clone and copy * WIP: example * Examples added * a litte more in README	2024-10-13 23:09:12 +02:00
Czxck001	6eab6b57f5	Fix the guide to gain access to Stable Diffusion 3 Medium (#2559 )	2024-10-13 22:55:26 +02:00
Czxck001	ca7cf5cb3b	Add Stable Diffusion 3 Example (#2558 ) * Add stable diffusion 3 example Add get_qkv_linear to handle different dimensionality in linears Add stable diffusion 3 example Add use_quant_conv and use_post_quant_conv for vae in stable diffusion adapt existing AutoEncoderKLConfig to the change add forward_until_encoder_layer to ClipTextTransformer rename sd3 config to sd3_medium in mmdit; minor clean-up Enable flash-attn for mmdit impl when the feature is enabled. Add sd3 example codebase add document crediting references pass the cargo fmt test pass the clippy test * fix typos * expose cfg_scale and time_shift as options * Replace the sample image with JPG version. Change image output format accordingly. * make meaningful error messages * remove the tail-end assignment in sd3_vae_vb_rename * remove the CUDA requirement * use default_value in clap args * add use_flash_attn to turn on/off flash-attn for MMDiT at runtime * resolve clippy errors and warnings * use default_value_t * Pin the web-sys dependency. * Clippy fix. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2024-10-13 22:08:40 +02:00
SethWen	0d96ec31e8	feat: intergrate chinese clip and add example (#2555 ) * start to impl chinese clip * impl vision model * copy code from bert * refactor use * refactor use again * fix text model * refactor * try to fix text model * tuning * tuning chinese clip * delete useless code * revert code * Clippy fixes. * Also apply cargo fmt. --------- Co-authored-by: laurent <laurent.mazare@gmail.com>	2024-10-10 15:18:55 +02:00
Akshay Ballal	937e8eda74	Add BertForMaskedLM to support SPLADE Models (#2550 ) * add bert for masked lm * working example * add example readme * Clippy fix. * And apply rustfmt. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2024-10-07 23:28:21 +02:00
Laurent Mazare	f856b5c3a7	pyo3 update. (#2545 ) * pyo3 update. * Stub fix.	2024-10-06 10:09:38 +02:00
dengelt	410c89f72a	Add required feature for whisper example in Readme (#2539 )	2024-10-04 14:29:55 +02:00
Laurent Mazare	56aacb05da	Make the RNN configs accessible from the models. (#2541 )	2024-10-04 14:22:23 +02:00
Laurent Mazare	90d04ff622	Support whisper large-v3 turbo in the whisper-microphone example. (#2533 )	2024-10-02 22:09:14 +02:00
Laurent Mazare	936300678d	Add whisper large-v3 turbo to the example. (#2531 )	2024-10-02 21:07:08 +02:00
Laurent Mazare	f479840ce6	Add a seed to the flux example. (#2529 )	2024-10-02 10:52:02 +02:00
Akshay Ballal	888d886dd8	Add ColPali (#2524 ) * add colpali * cleanup * fix clippy	2024-10-01 11:48:39 +02:00
Laurent Mazare	6110ad8d4f	Refactor the whisper microphone example. (#2523 ) * Refactor the whisper microphone example. * Tweak the whisper microphone example more.	2024-10-01 00:24:17 +02:00
Laurent Mazare	dfe9a00683	Pixtral polishing. (#2522 ) * Pixtral polishing. * Clippy fix.	2024-09-30 21:23:54 +02:00
Laurent Mazare	683ab698de	Add Pixtral. (#2521 ) * Add Pixtral. * More pixtral vision encoder. * Sketch a pixtral example. * Sketch a pixtral example. * Better image loading. * Support loading images embedded in safetensor files. * Clippy fixes. * Add the llava multimodal adapter. * Add more of the llava bits. * Add the pixtral config. * More pixtral inference. * Add the text generation bits. * Get the example to work. * Bugfix. * Run some bits of the model in f32. * Blessed version :) * Better rope frequency computations. * README update.	2024-09-30 19:31:14 +02:00
Laurent Mazare	2f49e1b534	Add PaliGemma. (#2519 ) * Add PaliGemma. * PaliGemma inference loop. * Running PaliGemma example. * Tweak the prompt.	2024-09-29 19:56:56 +02:00
Laurent Mazare	261ed65f36	Add the SigLIP model. (#2515 ) * Add the SigLIP model. * Add more to the forward pass of the vision model. * Complete the forward pass. * Add the siglip example. * Fix. * Another fix. * Get everything in place. * Add a readme.	2024-09-28 23:48:00 +02:00
Laurent Mazare	62525e8352	Remove some extra whitelines. (#2513 )	2024-09-28 14:41:28 +02:00
Laurent Mazare	ad8a4c5e5a	Add some llama-3.2 examples. (#2508 ) * Add some llama-3.2 examples. * Support tie-word-embeddings for llama.	2024-09-26 21:00:18 +02:00
Laurent Mazare	10d47183c0	Quantized version of flux. (#2500 ) * Quantized version of flux. * More generic sampling. * Hook the quantized model. * Use the newly minted gguf file. * Fix for the quantized model. * Default to avoid the faster cuda kernels.	2024-09-26 10:23:43 +02:00
Laurent Mazare	d01207dbf3	Add a RotatingKVCache. (#2493 ) * Add a RotatingKVCache. * Add some KvCache tests. * Test the reset too. * More kv-cache testing. * More tests for the rotating kv-cache. * Improve the api for the rotating cache so that the whole src tensor gets returned when it's overlarge. * Handle contiguity + bugfix + use in mimi. * Add a way to test the mimi streaming mode. * Mimi streaming fixes. * More rotating kv-cache. * Fix the attn mask generation. * Handle the abs case. * Add some tests for the generated mask.	2024-09-23 13:14:32 +02:00
Juan Gomez	5fc4f17727	Adding Granite 7b Instruct model example (#2487 ) * Adding Granite 7b Instruct model example * Minor refactoring to make it a little more idiomatic * Clippy fixes. * * Adding a README with some information about supported Granite models * Changing the default prompt to accomodate better the Language modality of the Granite 7b Instruct model --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2024-09-21 11:52:01 +02:00
Laurent Mazare	c58c5d5b01	Add the mimi audio-tokenizer. (#2488 ) * Add the mimi audio-tokenizer. * Formatting tweaks. * Add a full example. * Use the transformers names. * More renamings. * Get encoding and decoding to work. * Clippy fixes.	2024-09-20 14:31:20 -06:00
Laurent Mazare	e3261216b1	Clippy fixes for 1.81.0. (#2461 ) * Clippy fixes for 1.81.0. * Another fix.	2024-09-05 23:46:55 +02:00
Eugene Hauptmann	c02b7c3272	Fix FLUX.1 weights (#2457 ) * fix FLUX.1 weights * added flux1-dev.safetensors	2024-08-29 17:10:28 +02:00
Jani Monoses	86613c00e2	MobileCLIP models S1 and S2 (#2454 ) * Allow loading images with given std and mean * OpenCLIP text encoder component * Two MobileCLIP models * Clippy fixes. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2024-08-29 15:38:58 +02:00
Jani Monoses	29e25c458d	FastViT fixes. (#2452 ) * correct optional SE layer dimensions. * head_dim instead of num_heads is 32. * update test example output.	2024-08-28 11:20:09 +02:00
Jani Monoses	ccdbe87639	Add FastViT model. (#2444 )	2024-08-23 16:06:54 +02:00
Laurent Mazare	2ec8729d51	Fix for parler-tts, do not add the last slice of padding tokens. (#2442 ) * Fix for parler-tts, do not add the last slice of padding tokens. * Support for the mini model.	2024-08-22 23:22:03 +02:00
shua	e3c146ada6	silero-vad v5 example (#2321 ) * silero-vad v5 example This change adds an example of how to run silero-vad v5 * PR: rename 'vad' to 'silero-vad' * Update README.md --------- Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>	2024-08-22 22:50:42 +02:00
Laurent Mazare	b47c0bc475	Update README.md (#2435 )	2024-08-19 09:34:24 +02:00
Laurent Mazare	14fd2d97e0	Add a readme for the parler-tts example. (#2434 ) * Add a readme for the parler-tts example. * Remove the python decode script. * mp4 tweaks. * Another readme tweak.	2024-08-19 09:30:12 +02:00

1 2 3 4 5 ...

753 Commits