candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Author	SHA1	Message	Date
Laurent Mazare	eead1dcead	Clippy fix. (#1972 )	2024-03-31 08:57:40 +02:00
Santiago Medina	92f81d2fcb	Add Moondream transformer implementation and example (#1970 ) * moondream implementation * add moondream example * change config default activation * Add assets and integrate phi mixformer with example * Make use of kv cache and fix seq_len bug; Clean up example code * Add README link to example * Remove pos_embed scaling; Remove assets; Add to README; Expand VisionConfig * Delete image * Use apply instead of forward	2024-03-31 08:54:56 +02:00
Laurent Mazare	3144150b8d	Move the tensor-tools binary in a separate crate. (#1969 )	2024-03-30 15:49:37 +01:00
Laurent Mazare	8ad12a0e81	Add some examples using the MT5 variants. (#1963 )	2024-03-29 18:09:29 +01:00
Laurent Mazare	eb1b27abcd	Readme fix. (#1961 )	2024-03-28 23:24:46 +01:00
Laurent Mazare	708e422456	Qwen MoE model. (#1960 ) * Qwen MoE model. * Add the MoE model to the example. * Fix the scaling. * Readme updates. * Readme tweaks.	2024-03-28 23:10:57 +01:00
Laurent Mazare	c5092f2c29	Add a couple t5 models. (#1958 )	2024-03-28 17:58:06 +01:00
Tigran Zhampeissov	b0340d72ec	CLIP model implementation with example (#1950 ) * CLIP model implementation with example * CLIP Implementation fixes, batch images * CLIP model remove images from git * CLIP model remove unnecessary use of batch_indices	2024-03-28 13:44:12 +01:00
Laurent Mazare	e2b4829531	Support more mistral models. (#1927 ) * Support more mistral models. * Use the appropriate rope parameter.	2024-03-24 08:04:04 +01:00
Laurent Mazare	a00e24d752	Improve the error message on overlong prompts. (#1908 )	2024-03-21 21:08:07 +01:00
Sanchit Gandhi	bb3ee48039	whisper readme (#1899 )	2024-03-21 12:54:09 +01:00
Sanchit Gandhi	0c11e055be	support distil-large-v3 (#1898 )	2024-03-21 11:46:49 +01:00
Laurent Mazare	18036c6ccb	Update the image crate + use the re-exported version. (#1893 ) * Update the image crate + use the re-exported version. * Update to using ab_glyph.	2024-03-21 10:56:41 +01:00
Laurent Mazare	455c42aa72	Avoid copying the data on squeeze and unsqueeze. (#1884 ) * Avoid copying the data on squeeze and unsqueeze. * Fix the quantized llama example. * Unrelated fix for the quantized stable-lm example on cuda. * Fix for mamba on cuda (unrelated to the PR).	2024-03-20 13:04:36 +01:00
Laurent Mazare	f115895b9e	Apply rustfmt. (#1873 )	2024-03-18 21:43:31 +01:00
Gabriel	6a966cf9e0	Add a DQN example to the reinforcement-learning section (#1872 )	2024-03-18 21:22:53 +01:00
Laurent Mazare	58605252e8	Microphone support for the encodec example. (#1866 )	2024-03-18 11:19:46 +01:00
Laurent Mazare	d365ef32d9	Improve the encodec example: handle resampling. (#1865 ) * Improve the encodec example: handle resampling. * Play the audio directly.	2024-03-18 10:09:40 +01:00
Laurent Mazare	a15f859ab4	Fix for the encodec example. (#1861 )	2024-03-17 21:15:12 +01:00
Laurent Mazare	74bf6994b1	Move the image tensor to the appropriate device. (#1856 )	2024-03-16 22:25:46 +01:00
Jani Monoses	e1f9c3776d	StableLM-2 models were updated to use GPT-2 tokenization. (#1847 )	2024-03-14 21:01:36 +01:00
Tyler Rockwood	3318fe30fb	Update gemma README (#1843 ) * Update gemma README * Fixit	2024-03-13 21:41:36 +01:00
Laurent Mazare	56c9d3ee7b	Fix the model path for rwkv. (#1825 )	2024-03-09 11:21:48 +01:00
Laurent Mazare	dd00482ea3	Quantized version of the metavoice model. (#1824 ) * Quantized version of the metavoice model. * Integrate the quantized version of metavoice.	2024-03-09 11:06:04 +01:00
Laurent Mazare	3440cec3a0	Fast CPU kernel for transposed 1d convolutions. (#1822 ) * Fast CPU kernel for transposed 1d convolutions. * Bugfix.	2024-03-08 22:43:07 +01:00
Niklas Hallqvist	0a3487a776	Add a --seed argument to the stable-diffusion example. (#1812 ) * Add a --seed argument to the stable-diffusion example. * Make the case when no seed is specified, that it will not be set, but use the engine's default. This will make the CPU engine work again when no --seed is given, and will cause a bailout when a seed is there, as the engine does not currently support it. --------- Co-authored-by: niklas <niklas@appli.se>	2024-03-08 08:17:36 +01:00
Laurent Mazare	8a99cf7dd2	Add a flag to select the dtype used in metavoice. (#1805 )	2024-03-05 12:16:00 +01:00
Jiayu Liu	924ccae30c	Add an initial Segformer implementation (#1617 ) * add segformer * Make the id2label field optional. --------- Co-authored-by: laurent <laurent.mazare@gmail.com>	2024-03-03 16:01:46 +01:00
Laurent Mazare	60dc72b96b	More metavoice tweaks. (#1796 )	2024-03-03 15:05:25 +01:00
Laurent Mazare	20abb72fec	Normalize loudness of the generated audio (#1795 ) * Normalize loudness of the generated audio. * Lints. * One more lint. * Avoid running the bs1770 tests. * Another attempt at discarding doc comments. * Also normalize the loudness in the encodec example.	2024-03-03 14:00:42 +01:00
Laurent Mazare	ca5d727ba2	Use the same padding in metavoice as in the python version. (#1794 )	2024-03-03 12:04:48 +01:00
Laurent Mazare	09e0148cce	Tweaks to run metavoice on metal (#1792 ) * Enable tanh + tweak conv-transpose. * Run the encodec decoding on cpu. * Clippy fixes.	2024-03-03 07:46:44 +01:00
Laurent Mazare	de11623752	Metavoice position fix (#1791 ) * Add the metavoice transformer. * Sketch the speaker-encoder module. * Adding to the metavoice model. * Start adding the metavoice example. * Get some logits out. * Load the second stage model. * Get the second step to run. * Tweak the example. * Add encodec tilting. * Glue the different bits together. * Fix a shape issue. * Use a constant. * BPE tokenization. * Fix the position index in metavoice.	2024-03-02 21:00:35 +01:00
Laurent Mazare	21f1d04976	Add the instruction finetuned gemma variants. (#1790 )	2024-03-02 18:56:59 +01:00
Laurent Mazare	4fff5b51f5	Metavoice - first cut (#1717 ) * Add the metavoice transformer. * Sketch the speaker-encoder module. * Adding to the metavoice model. * Start adding the metavoice example. * Get some logits out. * Load the second stage model. * Get the second step to run. * Tweak the example. * Add encodec tilting. * Glue the different bits together. * Fix a shape issue. * Use a constant. * BPE tokenization. * Add a warning.	2024-03-02 18:50:01 +01:00
Jack Shih	6980774a91	fix rwkv example eos token (#1785 )	2024-03-01 10:22:28 +01:00
Laurent Mazare	64d4038e4f	Mention rwkv v6 in the readmes. (#1784 )	2024-03-01 08:58:30 +01:00
Jani Monoses	979deaca07	EfficientVit (MSRA) model (#1783 ) * Add EfficientVit (Microsoft Research Asia) model. * Mention models in README	2024-03-01 08:53:52 +01:00
Jack Shih	b485e4b6ee	add models of rwkv v6 and quantized rwkv v6 (#1781 ) * add models of rwkv v6 and quantized rwkv v6 * fix ci clippy fail	2024-03-01 08:37:56 +01:00
Laurent Mazare	4fd00b8900	Add the StarCoder2 model. (#1779 ) * Add the StarCoder2 model. * Add the example code and get things to work. * And also tweak the readme.	2024-02-28 21:02:41 +01:00
Laurent Mazare	57267cd536	Add a flag to force running the quantized model on CPUs. (#1778 ) * Add a flag to force running the quantized model on CPUs. * Add encodec to the readme.	2024-02-28 14:58:42 +01:00
Laurent Mazare	60ee5cfd4d	Support more modes in the encodec example. (#1777 ) * Support more modes in the encodec example. * Remove the old encodec model from the musicgen bits.	2024-02-28 09:22:33 +01:00
Laurent Mazare	d0aca6c3c6	Encodec encoding demo. (#1775 )	2024-02-28 06:49:03 +01:00
Laurent Mazare	0c49e95dfb	Encodec model. (#1771 ) * Encodec model. * Fixes. * Add the padding functions. * Get the LSTM bit to work. * Get the encodec model to generate some tokens (decoder only for now). * Minor tweak. * Minor tweak.	2024-02-27 22:59:40 +01:00
Laurent Mazare	32544a2ad6	Add an option to split the prompt. (#1766 )	2024-02-27 11:24:11 +01:00
Jack Shih	918136ba46	add quantized rwkv v5 model (#1743 ) * and quantized rwkv v5 model * Integrate the quantized rwkv model in the initial example. --------- Co-authored-by: laurent <laurent.mazare@gmail.com>	2024-02-25 21:43:40 +01:00
Laurent Mazare	2f22afd80e	Cuda acceleration for quantized model. (#1754 ) * Boilerplate for the quantized cuda support. * More basic cuda support. * More cuda quantization (quantize on cpu for now). * Add the dequantization bit. * Start adding some dedicated cuda kernels from llama.cpp. * Move the kernel code. * Start interfacing with the kernel. * Tweak the kernel launch params. * Bugfix for quantized metal. * Fix some clippy lints. * Tweak the launch parameters. * Tweak cuda basics to perform a quantized matmul. * Perform the dequantization on the cpu + use cublas for matmul. * Add the dequantization kernel. * Test the qmatmul. * More kernels. * Matmul-vec kernel. * Add a couple kernels. * More dequantization kernels.	2024-02-25 18:11:47 +01:00
Laurent Mazare	8d04f70f4d	Fix the eos token for gemma. (#1753 )	2024-02-24 11:07:02 +01:00
Daniel Varga	32eb56d6b3	Fix typo in README (#1740 )	2024-02-22 12:35:26 +01:00
Laurent Mazare	28057781aa	Make the cache for the llama model explicit too. (#1745 )	2024-02-22 12:04:33 +01:00

1 2 3 4 5 ...

581 Commits