candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-21 12:20:46 +00:00

Author	SHA1	Message	Date
Laurent Mazare	fb1c2ac535	Add flash-attn support. (#912 ) * Add flash-attn support. * Add the use-flash-attn flag. * Re-enable flash-attn.	2023-09-20 14:07:55 +01:00
Laurent Mazare	f685b2231c	Add some missing biases. (#908 )	2023-09-20 10:14:51 +01:00
Laurent Mazare	67a486d18d	Line-up the wuerstchen model with the python implementation. (#901 ) * Line-up the wuerstchen model with the python implementation. * Missing cos. * Fix the picture denormalization.	2023-09-19 21:59:44 +01:00
Laurent Mazare	4f91c8e109	Improve the error message on shape mismatch for cat. (#897 ) * Improve the error message on shape mismatch for cat. * Cosmetic tweak.	2023-09-19 15:09:47 +01:00
Laurent Mazare	06e46d7c3b	Only use classifier free guidance for the prior. (#896 ) * Only use classifier free guidance for the prior. * Add another specific layer-norm structure. * Tweaks. * Fix the latent shape. * Print the prior shape. * More shape fixes. * Remove some debugging continue.	2023-09-19 14:13:05 +01:00
Laurent Mazare	92db8cecd3	Specialized attention module for Wuerstchen. (#890 ) * Specialized attention module for Wuerstchen. * Reshaping ops. * Attention processor. * Finish the forward pass. * Hook the new attention processor. * Get the prior forward pass to work. * Make it contiguous.	2023-09-18 21:16:09 +01:00
Laurent Mazare	82a98f6da0	Prior denoising. (#889 )	2023-09-18 16:51:38 +01:00
Laurent Mazare	5082954c52	Fix the W clip embeddings. (#887 ) * Fix the W clip embeddings. * Add the specialized ddpm scheduler.	2023-09-18 14:50:14 +01:00
Laurent Mazare	c2b866172a	More Wuerstchen fixes. (#882 ) * More Weurstchen fixes. * More shape fixes. * Add more of the prior specific bits. * Broadcast add. * Fix the clip config. * Add some masking options to the clip model.	2023-09-17 22:08:11 +01:00
Laurent Mazare	06cc329e71	Remove the parameters for the Wuerstchen layer-norm. (#879 ) * Remove the parameters for the Wuerstchen layer-norm. * Fixes. * More fixes (including conv-transpose2d. * More fixes. * Again more fixes.	2023-09-17 15:59:27 +01:00
Laurent Mazare	db3e9dae04	Wuerstchen main (#876 ) * Wuerstchen main. * More of the wuerstchen cli example. * Paella creation. * Build the prior model. * Fix the weight file names.	2023-09-17 12:46:38 +01:00
Laurent Mazare	c2007ac88f	W fixes. (#862 )	2023-09-15 15:11:11 +01:00
Laurent Mazare	30be5b6660	Replication pad (#861 ) * Add the embed mapper convolutions. * Add the replication pad layer. * Use the replication-pad op. * Tweak a todo.	2023-09-15 14:06:21 +01:00
Laurent Mazare	107d3d9530	Add the embed mapper convolutions. (#860 )	2023-09-15 11:38:38 +02:00
Laurent Mazare	2746f2c4be	DiffNeXt/unet (#859 ) * DiffNeXt/unet * Start adding the vae. * VAE residual block. * VAE forward pass. * Add pixel shuffling. * Actually use pixel shuffling.	2023-09-15 10:14:02 +01:00
Laurent Mazare	130fe5a087	Add the upblocks. (#853 )	2023-09-14 22:24:56 +01:00
Laurent Mazare	91ec546feb	More DiffNeXt. (#847 ) * More DiffNeXt. * Down blocks.	2023-09-14 22:16:31 +02:00
Laurent Mazare	a0c6d5548c	Add the attention block. (#846 ) * Add the attention block. * Add more to clipnext.	2023-09-14 15:40:09 +01:00
Laurent Mazare	286f01db14	Start adding the Wuerstchen diffusion pipeline (#843 ) * Wuerstchen common bits. * Add the prior layer. * Start adding diffnext.	2023-09-14 10:56:07 +01:00

19 Commits