candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 02:38:10 +00:00

Author	SHA1	Message	Date
Jani Monoses	86613c00e2	MobileCLIP models S1 and S2 (#2454 ) * Allow loading images with given std and mean * OpenCLIP text encoder component * Two MobileCLIP models * Clippy fixes. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2024-08-29 15:38:58 +02:00
Jani Monoses	29e25c458d	FastViT fixes. (#2452 ) * correct optional SE layer dimensions. * head_dim instead of num_heads is 32. * update test example output.	2024-08-28 11:20:09 +02:00
Laurent Mazare	aafa24ed93	Update cudarc to 0.12. (#2451 ) * Update cudarc to 0.12. * Some cudnn tweaks.	2024-08-27 10:10:30 +02:00
ilookee	fdc2622686	fix: qwen2 lm_head loading #2443 (#2445 ) Co-authored-by: Yi Xu <xuyi@me.com>	2024-08-23 16:50:02 +02:00
Jani Monoses	ccdbe87639	Add FastViT model. (#2444 )	2024-08-23 16:06:54 +02:00
Laurent Mazare	2ec8729d51	Fix for parler-tts, do not add the last slice of padding tokens. (#2442 ) * Fix for parler-tts, do not add the last slice of padding tokens. * Support for the mini model.	2024-08-22 23:22:03 +02:00
shua	e3c146ada6	silero-vad v5 example (#2321 ) * silero-vad v5 example This change adds an example of how to run silero-vad v5 * PR: rename 'vad' to 'silero-vad' * Update README.md --------- Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>	2024-08-22 22:50:42 +02:00
shua	1e96b8b695	onnx: support negative index in Gather (#2440 ) index_select does not support negative indexing, but this change adds just enough workarounds in onnx to allow evaluating silero-vad models (which make use of negative indices).	2024-08-22 15:28:25 +02:00
shua	a8288b7a72	onnx: workaround pow with negative base (#2439 ) * onnx: workaround pow with negative base rather than fully defining pow in the cpu backend (as in #2318), this implements a much smaller change which is sufficient to evaluate silero-vad onnx models. Specifically, checking if pow is run with 2.0 exponent, and if so evaluate as simply `xx` instead of the cpu backend of `e^(2.0 ln(x))`. * PR: use Tensor::powf insead powf correctly handles a negative base.	2024-08-22 13:34:53 +02:00
Laurent Mazare	6070278a31	Bump the version to 0.6.1. (#2438 )	2024-08-22 09:23:52 +02:00
Laurent Mazare	b47c0bc475	Update README.md (#2435 )	2024-08-19 09:34:24 +02:00
Laurent Mazare	14fd2d97e0	Add a readme for the parler-tts example. (#2434 ) * Add a readme for the parler-tts example. * Remove the python decode script. * mp4 tweaks. * Another readme tweak.	2024-08-19 09:30:12 +02:00
shua	31a1075f4b	onnx: implement LSTM op (#2268 ) use candle-nn LSTM	2024-08-19 09:06:17 +02:00
Laurent Mazare	236b29ff15	Add the DAC model. (#2433 ) * Add the DAC model. * More quantization support. * Handle DAC decoding. * Plug the DAC decoding in parler-tts.	2024-08-19 08:59:51 +02:00
Laurent Mazare	58197e1896	parler-tts support (#2431 ) * Start sketching parler-tts support. * Implement the attention. * Add the example code. * Fix the example. * Add the description + t5 encode it. * More of the parler forward pass. * Fix the positional embeddings. * Support random sampling in generation. * Handle EOS. * Add the python decoder. * Proper causality mask.	2024-08-18 20:42:08 +02:00
Laurent Mazare	736d8eb752	Stream tensor (#2429 ) * Support Minus(u) for arbitrary values of u, e.g. Minus(3). * Forces u to be strictly positive. * Add StreamTensor.	2024-08-17 21:54:28 +02:00
Laurent Mazare	7cff5898ec	Support Minus(u) for arbitrary values of u, e.g. Minus(3). (#2428 ) * Support Minus(u) for arbitrary values of u, e.g. Minus(3). * Forces u to be strictly positive.	2024-08-17 21:29:01 +02:00
Laurent Mazare	b75ef051cf	Fix the marian tokenizer importer. (#2426 ) * Fix the marian tokenizer importer. * Ignore the python caches.	2024-08-17 20:58:40 +02:00
Laurent Mazare	c1b9e07e35	Add support for gemma-2. (#2425 ) * Add gemma-2. * Support a couple more models. * Sliding window support. * Example + readme updates. * Update the main readme.	2024-08-17 20:31:23 +02:00
Laurent Mazare	69fdcfe96a	Apply rustfmt. (#2421 )	2024-08-16 18:57:14 +02:00
Hadi	2b75dd9551	Fix build issue in EOS Token in llama-multiprocess (#2420 )	2024-08-16 18:46:31 +02:00
Laurent Mazare	53ce65f706	Clippy fixes. (#2415 ) * Clippy fixes. * Bump the web_sys required version.	2024-08-14 10:13:53 +02:00
Laurent Mazare	68aa9c7320	Fix the device for the bert attention mask. (#2414 )	2024-08-14 10:01:12 +02:00
Jani Monoses	35e5f31397	Add Based LLM from Hazy Research. (#2411 )	2024-08-12 21:21:19 +02:00
Carsten Csiky	d3fe989d08	Add documentation examples for `Tensor::i` and `Tensor::narrow` methods (#2308 ) * Add documentation examples for `Tensor` methods * Apply fmt. * Cosmetic tweaks. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2024-08-10 08:11:09 +02:00
Matthew O'Malley-Nichols	14db029494	Soft Non-Maximum Suppression (#2400 ) * Soft NMS with thresholds * NMS Test * Soft nms w/ boxes removed below threshold * Soft nms test * No longer removing bounding boxes to fit Soft-NMS focus * Initialize confidence * Added comments * Refactored out updating based on IOU/sigma * Score_threshold -> confidence_threshold for clarity * Remove bboxes below confidence threshold * Softnms basic functionality test * Softnms confidence decay test * Softnms confidence threshold test * Softnms no overlapping bbox test * Testing confidence after no overlap test * Single bbox and no bbox tests * Signify test completion * Handling result of test functions * Checking all pairs of bboxes instead of a forward pass * Equal confidence overlap test * Clarified tests for implementation * No longer dropping boxes, just setting to 0.0 * Formatted w/ cargo	2024-08-10 07:57:52 +02:00
Joel Nises	6e6c1c99b0	Fix issues in the encodec example README.md (#2407 ) Also squeeze the first dimension of the codes tensor in the example file to get the expected three dimensions.	2024-08-10 07:49:05 +02:00
Hamir Mahal	b7d9af00cc	fix: usage of `actions/checkout@v2` (#2403 ) * chore: changes from formatting on save * fix: usage of `actions/checkout@v2`	2024-08-06 10:59:34 +02:00
Laurent Mazare	59bbc0d287	Add the import script for the T5 tokenizer. (#2399 )	2024-08-05 21:03:31 +02:00
Czxck001	dfdce2b602	Add the MMDiT model of Stable Diffusion 3 (#2397 ) * add mmdit of stable diffusion 3 lint add comments * correct a misplaced comment * fix cargo fmt * fix clippy error * use bail! instead of assert! * use get_on_dim in splitting qkv	2024-08-05 19:26:15 +02:00
唐璜	500c9f2882	add models support and example for THUDM/glm-4 (#2362 ) * add models support and example for THUDM/glm-4 * fix the ci report * fmt * fix * Update README.org * Update README.org * fmt * Update README.org * README.md add codegeex4 * README.md add glm4 * Typo. * change expect into ? --------- Co-authored-by: Laurent Mazare <laurent.mazare@gmail.com>	2024-08-05 17:48:09 +02:00
Laurent Mazare	2be9bd211e	Support for mistral-nemo. (#2396 )	2024-08-04 19:52:40 +02:00
Laurent Mazare	89eae41efd	Support the flux-dev model too. (#2395 )	2024-08-04 12:16:24 +02:00
MilkFather	c0a559d427	optimize gradient for silu a bit (#2393 )	2024-08-04 11:24:17 +02:00
Laurent Mazare	aa7ac1832d	Simplify handling of flux modulations. (#2394 )	2024-08-04 11:09:54 +02:00
Laurent Mazare	19db6b9723	Add the flux model for image generation. (#2390 ) * Add the flux autoencoder. * Add the encoder down-blocks. * Upsampling in the decoder. * Sketch the flow matching model. * More flux model. * Add some of the positional embeddings. * Add the rope embeddings. * Add the sampling functions. * Add the flux example. * Fix the T5 bits. * Proper T5 tokenizer. * Clip encoder path fix. * Get the clip embeddings. * No configurable weights in layer norm. * More weights related fixes. * Yet another shape fix. * DType fix. * Fix a couple more shape issues. * DType fixes. * Fix the latent dims. * Fix more shape issues. * Autoencoder fixes. * Get some generations out. * Bugfix. * T5 padding. * Clippy fix. * Add the decode only mode. * Fix. * More fixes. * Finally get some generations to work. * Add readme.	2024-08-04 08:14:33 +02:00
Laurent Mazare	0fcb40b229	Revert the bf16 gemm metal changes for now. (#2386 )	2024-08-01 23:08:47 +02:00
Justin Sing	6991a37b94	update: LSTMState and GRUState fields to be public (#2384 )	2024-08-01 16:30:32 +02:00
Laurent Mazare	9ca277a9d7	Fix cargo fmt. (#2383 ) * Fix cargo fmt. * Clippy fix. * Cosmetic tweaks.	2024-08-01 14:19:41 +02:00
Joan Fontanals	2e9c010609	Jina Bert Example fix and more configuration (#2191 ) * fix: fix jina bert example logic * feat: enable jina embeddings de * feat: allow more flexibility on Jina Bert	2024-08-01 13:59:20 +02:00
Jani Monoses	ac51f477eb	Add Hiera vision model. (#2382 )	2024-08-01 11:59:22 +02:00
Laurent Mazare	d4b6f6eef6	Add a minimal test for the metal bf16 matmul. (#2381 )	2024-08-01 11:22:46 +02:00
Laurent Mazare	957d604a78	Enable BF16 on metal. (#2380 )	2024-08-01 11:05:07 +02:00
Takanori MAEHARA	ce90287f45	Add get_ids to GradStore (#2379 )	2024-08-01 10:56:13 +02:00
Laurent Mazare	1ba87a9450	Use BF16 on metal when possible. (#2378 )	2024-08-01 10:48:58 +02:00
Yun-Jhong Wu	bd80078acf	Fix log_sum_exp to handle large positive/negative inputs (#2367 )	2024-08-01 10:37:02 +02:00
ivarflakstad	fea46cb719	Metal bgemm min changes (#2364 ) * Add updated mfa metallib * Add bgemm and tests	2024-08-01 10:06:04 +02:00
Laurent Mazare	8696cf6494	Enable the affine kernel for u8/u32. (#2376 )	2024-08-01 10:03:11 +02:00
Zheng Li	4a52aeb437	bert attention mask (#1934 ) * bert attention mask * Allow for using None as a mask. * Revert part of the changes so that the proper default mask applies. * Cosmetic change. * Another cosmetic tweak. --------- Co-authored-by: Laurent <laurent.mazare@gmail.com>	2024-08-01 08:26:19 +02:00
ivarflakstad	24d54d0ff9	Bump image crate version so ImageReader is available without aliasing (#2365 )	2024-07-29 17:41:33 +02:00

... 3 4 5 6 7 ...

2345 Commits