candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 02:38:10 +00:00

Author	SHA1	Message	Date
GeauxEric	7f2bbcf746	[segment-anything] Support multi-point as the prompt input (#945 ) * [sam] Support multi-point prompts * [segment-anything] Pass points by reference * [segment-anything] Update example code and image * Fix clippy lint. --------- Co-authored-by: Yun Ding <yunding@nvidia.com> Co-authored-by: laurent <laurent.mazare@gmail.com>	2023-09-25 12:14:10 +01:00
Laurent Mazare	dc47224ab9	Override the default cudnn heuristics. (#957 )	2023-09-25 10:31:53 +01:00
Laurent Mazare	1ce7fe2543	Add more examples to the phi readme. (#956 )	2023-09-24 18:19:05 +01:00
Laurent Mazare	402ddcfcb4	Add the missing kernel. (#955 )	2023-09-24 17:21:37 +01:00
Laurent Mazare	f5069dd354	Use the repo for the quantized phi model. (#954 )	2023-09-24 16:30:26 +01:00
Laurent Mazare	0007ae9c11	Add the quantized mixformer model. (#953 ) * Add the quantized mixformer model. * Add the quantized option in the phi example.	2023-09-24 15:03:48 +01:00
Laurent Mazare	e15862cfdb	Shared the quantized var-builder code. (#952 ) * Shared the quantized var-builder code. * Fix compilation.	2023-09-24 12:55:07 +01:00
Laurent Mazare	4aeb449017	Depreate the VarBuilder::from_safetensors function. (#951 )	2023-09-24 11:18:17 +01:00
Laurent Mazare	bcb0ed8f1c	Self-contained safetensors for the multiprocess llama example. (#950 )	2023-09-24 06:54:49 +01:00
Laurent Mazare	7edd755756	Pass directly the buffer ownership. (#949 )	2023-09-24 06:34:44 +01:00
Laurent Mazare	e32c89d90c	Add the buffered safetensor wrapper. (#948 )	2023-09-23 22:57:42 +01:00
Laurent Mazare	bb3471ea31	Adapt more examples to the updated safetensor api. (#947 ) * Simplify the safetensor usage. * Convert more examples. * Move more examples. * Adapt stable-diffusion.	2023-09-23 21:26:03 +01:00
Laurent Mazare	890d069092	Self-contained safetensor wrappers (#946 ) * Self-contained safetensor wrappers. * Use the new safetensor container in varbuilders.	2023-09-23 20:39:52 +01:00
Laurent Mazare	5dbe46b389	Add tracing. (#943 )	2023-09-23 16:55:46 +01:00
Laurent Mazare	ccf352f3d1	Use yoke to provide a self-referential container for mmaped safetenso… (#939 ) * Use yoke to provide a self-referential container for mmaped safetensor files. * Add the new self-owned type for safetensor files without removing the previous version. * Add routing. * Add an initializer for the case of multiple files.	2023-09-23 15:43:11 +01:00
Laurent Mazare	402d207f0f	VarMap setter functions (#938 ) * Add some setter helper functions for varmap. * Add more comments.	2023-09-23 10:27:51 +01:00
Laurent Mazare	7582937a32	Add the causal mask in mixformer. (#937 )	2023-09-23 09:50:26 +01:00
Laurent Mazare	b54acfa3d0	Tracing for the phi model (#936 ) * Add some tracing bits to mixformers. * Add the missing file. * Add the conv2d layer to with-tracing. * Improve the tracing usage.	2023-09-23 09:19:34 +01:00
Radamés Ajna	cda1786eed	smaller t5 models quantized (#934 )	2023-09-22 22:31:23 +01:00
Laurent Mazare	912a3d63b0	Use the proper block size for quantizing models. (#933 ) * Use the proper block size for quantizing models. * Use the proper dimension.	2023-09-22 21:36:56 +01:00
Laurent Mazare	3ef328c53d	Mention the new phi model in the readme. (#932 )	2023-09-22 21:24:51 +01:00
Radamés Ajna	0c8e983514	update link to t5 (#931 )	2023-09-22 20:30:01 +01:00
Laurent Mazare	df6f5240ba	Complete the mixformer implementation. (#930 ) * Complete the mixformers implementation. * Tweak the attention. * Add the phi-1.5 example. * Improve the phi example. * Bugfix. * Get the phi example to work.	2023-09-22 20:03:16 +01:00
Laurent Mazare	a46b1b4657	Mixformer (#929 ) * Sketch the mixformer model. * More modeling code. * More mixformers. * MixFormer creation. * More mixformers.	2023-09-22 16:17:14 +01:00
Radamés Ajna	19e52e5007	T5 Wasm (#918 ) * init t5 wasm model * split workers for each model * clean up * add some ui * readme * index * typo * remove cache param, clear_kv_cache * add max_length as param * add model tasks option to ui * add method to load quantized gguf from buffer * Add quantized wasm module * add quantized models to UI, dynamic import wasms * link to quantized * fix copy * fix ModelEncoder * fix README.md	2023-09-22 15:31:10 +01:00
Laurent Mazare	8601537e31	Add slice-scatter. (#927 ) * Add slice-scatter. * Add the op. * Make transpose be a no-op when the dimensions are identical. * Add the backprop. * And add some gradient test.	2023-09-22 12:18:16 +01:00
Gonzalo	a96878f235	cuda cast i64 (#925 )	2023-09-21 19:52:39 +01:00
Laurent Mazare	aa8ec06fd2	Add the t5-xxl version. (#924 )	2023-09-21 14:48:13 +01:00
Laurent Mazare	b43ca493f6	Add more quantized flan t5 variants (#923 ) * Add the quantized flan-t5-large variant. * Add more sizes.	2023-09-21 13:23:30 +01:00
Laurent Mazare	3b557765e8	T5 quantized example (#922 ) * Load gguf files for the quantized t5. * Add the quantized t5 example. * Allow for loading local files. * Add some support for quantizing safetensor files. * Transpose before quantizing. * Quantized t5. * Retrieve the weights from the hub.	2023-09-21 12:33:15 +01:00
Laurent Mazare	2619c4307f	Add a quantized version of the t5 model. (#921 )	2023-09-21 11:13:39 +01:00
Laurent Mazare	c89b82b2d4	Add a clear cache function to the t5 model. (#919 )	2023-09-21 09:01:06 +01:00
Laurent Mazare	7b26e513f1	Add the erf function. (#917 )	2023-09-21 06:19:10 +01:00
Laurent Mazare	ab1d40ea97	Add more t5 tracing. (#915 )	2023-09-20 20:20:54 +01:00
Laurent Mazare	3a0d3e05df	Add more t5 tracing. (#914 ) * Add more t5 tracing. * Rever the sm change.	2023-09-20 16:37:51 +01:00
Laurent Mazare	9b24d89d2d	Tracing mode for T5. (#913 ) * Tracing mode for T5. * Tracing for the linear layer.	2023-09-20 15:03:35 +01:00
Laurent Mazare	fb1c2ac535	Add flash-attn support. (#912 ) * Add flash-attn support. * Add the use-flash-attn flag. * Re-enable flash-attn.	2023-09-20 14:07:55 +01:00
Laurent Mazare	728e167334	Add details on wuerstchen. (#911 )	2023-09-20 13:09:35 +01:00
Laurent Mazare	7b1ddcff47	Add clone to various nn layers. (#910 )	2023-09-20 11:33:51 +01:00
Laurent Mazare	f685b2231c	Add some missing biases. (#908 )	2023-09-20 10:14:51 +01:00
Laurent Mazare	c0b49d5a50	Wuerstchen parameter tweaks. (#907 )	2023-09-20 09:26:24 +01:00
Mahmoud	098dd0d1e9	fix: add missing`top_p` in llama_multiprocess (#905 )	2023-09-20 08:54:56 +01:00
Juarez Bochi	05626ef492	Flan T5: Read lm_head when word embeddings are not tied (#903 ) * Read lm_head when word embeddings are not tied * Fix formatting * Address comments	2023-09-19 22:36:47 +01:00
Laurent Mazare	67a486d18d	Line-up the wuerstchen model with the python implementation. (#901 ) * Line-up the wuerstchen model with the python implementation. * Missing cos. * Fix the picture denormalization.	2023-09-19 21:59:44 +01:00
Radamés Ajna	7ad82b87e4	BERT Wasm (#902 ) * implement wasm module * add example to workspace * add UI explore semantic similiarity * change status messages * formatting * minor changes	2023-09-19 21:31:37 +01:00
Juarez Bochi	8696f64bae	Fix T5 kv cache (#899 ) * Fix T5 kv cache * Add argument for decoder prompt * Fix range	2023-09-19 20:36:15 +01:00
Laurent Mazare	d7e48234d4	Add an erf based gelu op (#900 ) * Erf based gelu. * Add the erf backed gelu. * Test the new gelu op (which is not gelu_new).	2023-09-19 19:54:28 +01:00
Laurent Mazare	34f2ecbc3b	Fix the leaky relu. (#898 )	2023-09-19 18:17:17 +01:00
Laurent Mazare	4f91c8e109	Improve the error message on shape mismatch for cat. (#897 ) * Improve the error message on shape mismatch for cat. * Cosmetic tweak.	2023-09-19 15:09:47 +01:00
Laurent Mazare	06e46d7c3b	Only use classifier free guidance for the prior. (#896 ) * Only use classifier free guidance for the prior. * Add another specific layer-norm structure. * Tweaks. * Fix the latent shape. * Print the prior shape. * More shape fixes. * Remove some debugging continue.	2023-09-19 14:13:05 +01:00

1 2 3 4 5 ...

1264 Commits