candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Author	SHA1	Message	Date
Laurent Mazare	5082954c52	Fix the W clip embeddings. (#887 ) * Fix the W clip embeddings. * Add the specialized ddpm scheduler.	2023-09-18 14:50:14 +01:00
Laurent Mazare	c2b866172a	More Wuerstchen fixes. (#882 ) * More Weurstchen fixes. * More shape fixes. * Add more of the prior specific bits. * Broadcast add. * Fix the clip config. * Add some masking options to the clip model.	2023-09-17 22:08:11 +01:00
Laurent Mazare	5f83c13f17	Add the DDPM scheduler. (#877 ) * Add the DDPM scheduler. * Minor tweaks.	2023-09-17 15:03:01 +01:00
Laurent Mazare	db3e9dae04	Wuerstchen main (#876 ) * Wuerstchen main. * More of the wuerstchen cli example. * Paella creation. * Build the prior model. * Fix the weight file names.	2023-09-17 12:46:38 +01:00
Laurent Mazare	7f65af1f0d	Avoid re-encoding the input in the T5 example. (#875 )	2023-09-17 10:25:54 +01:00
Laurent Mazare	eeb54716dd	Tweaks for the T5 example. (#874 )	2023-09-17 10:05:15 +01:00
Laurent Mazare	1a276b5da7	Add a KV cache to T5. (#873 ) * Add a KV cache to T5. * Suggest using release mode. * Use the kv cache in decoding. * Add a comment.	2023-09-17 08:00:45 +01:00
Juarez Bochi	3e49f8fce5	Implement T5 decoding (#864 ) * Load t5 decoder * Run enc, dec, and lm head, but no cross attn * Cross-attention over key_value_states * New arg for decoder input ids * Add mask, don't forward position biases through decoder * Update t5 examples * Clippy + rustfmt	2023-09-15 22:05:12 +02:00
Laurent Mazare	31ab2ddaeb	Remove the padding. (#838 )	2023-09-13 13:00:59 +01:00
Laurent Mazare	3e94324012	Add some sentence similarity part to the t5 example. (#835 ) * Add some sentence similarity part to the t5 example. * Clippy fix.	2023-09-13 10:44:02 +01:00
Laurent Mazare	e6f040d6e3	Readme gallery (#834 ) * More readme tweaks. * Update README.md	2023-09-13 09:05:47 +01:00
Laurent Mazare	cbd36157ac	Add a gif to the quantized readme. (#833 ) * Add a gif to the quantized readme. * gif update.	2023-09-13 08:43:52 +01:00
Laurent Mazare	e4553fb355	T5 tweaks (#831 ) * Use default values rather than options. * Avoid exposing the device field. * More tweaks.	2023-09-13 07:37:04 +01:00
Juarez Bochi	9daa6dbe87	Extract T5 module and add main function to use it (#829 ) * Extract t5 out of musicgen * Add main for t5 module	2023-09-13 07:14:05 +01:00
Laurent Mazare	e82fcf1c59	Add more example readmes. (#828 ) * Add more readmes. * Add a readme for dinov2. * Add some skeleton files for a couple more examples. * More whisper details.	2023-09-12 17:21:24 +01:00
Juarez Bochi	805bf9ffa7	Implement top_p / nucleus sampling (#819 ) * Implement top_p / nucleus sampling * Update changelog * rustfmt * Add tests * Fix clippy warning * Fix another clippy error	2023-09-12 18:10:16 +02:00
Laurent Mazare	42da17694a	Segment Anything readme (#827 ) * Add a readme for the segment-anything model. * Add the original image. * Clean-up the segment anything cli example. * Also print the mask id in the outputs.	2023-09-12 14:35:55 +01:00
Laurent Mazare	7a62aad24a	Add a readme for yolo-v8. (#824 )	2023-09-12 11:01:06 +01:00
Laurent Mazare	bb23b90b1d	Add a small readme for the quantized example. (#823 )	2023-09-12 10:17:31 +01:00
Laurent Mazare	c5a058b169	Use the module trait in stable-diffusion. (#817 )	2023-09-11 20:40:07 +01:00
Laurent Mazare	5c35fbbb13	Stable-Diffusion readme (#814 ) * Stable Diffusion readme. * Fix the image path. * Move the assets. * Resize the sample image. * Lower resolution.	2023-09-11 13:06:29 +01:00
Laurent Mazare	d7b9fec849	Move the stable-diffusion modeling code so that it's easier to re-use. (#812 )	2023-09-11 11:45:57 +01:00
Laurent Mazare	35f72514f5	Move more models to candle-transformers (#796 ) * Move dinov2. * Move efficientnet. * Move the quantized llama model. * Move segment-anything.	2023-09-10 10:20:18 +01:00
Laurent Mazare	d3f05eae8c	Move some models to candle-transformers so that it's easier to re-use. (#794 ) * Move some models to candle-transformers so that they can be shared. * Also move falcon. * Move Llama. * Move whisper (partial).	2023-09-10 09:40:27 +01:00
Laurent Mazare	31936c08fe	ViT tracing. (#790 )	2023-09-09 17:26:39 +01:00
Laurent Mazare	74ad4deb42	Get the MobileSAM TinyViT based version to work. (#789 ) * More TinyViT support in SA. * More mobilesam work. * Add the mobile-sam weights to the hub.	2023-09-09 16:21:44 +01:00
Laurent Mazare	b7cd58473b	TinyViT backbone for segment-anything. (#787 ) * TinyViT. * More TinyViT. * Add more to the tinyvit backbone. * Proper padding. * Plus ViT. * Add the tiniest vit spec.	2023-09-09 15:10:06 +01:00
Laurent Mazare	3cd7e7b51d	Fuse the rel-pos additions via a custom-op. (#786 ) * Fuse the rel-pos additions via a custom-op. * Run with rayon. * Add more tracing.	2023-09-09 10:46:09 +01:00
Laurent Mazare	acf8f10ae1	Get the comparison operation to work on scalar values. (#780 ) * Get the comparison operation to work on scalar values. * Add some time measurement.	2023-09-08 20:13:29 +01:00
Laurent Mazare	0906acab91	Automatic mask generation (#779 ) * A few more contiguous fixes for cuda. * Mask generation. * Generic bbox. * Generate all the masks.	2023-09-08 19:11:34 +01:00
Laurent Mazare	158ff3c609	Add tracing to segment-anything (#777 ) * Tracing support for segment-anything. * More tracing. * Handle the empty slice case.	2023-09-08 15:31:29 +01:00
Laurent Mazare	e5703d2f56	Draw the mask on a merged image. (#775 ) * Draw the mask on a merged image. * Clippy fix. * Enable the target point by default. * Add to the readme.	2023-09-08 14:04:34 +01:00
Laurent Mazare	28c87f6a34	Automatic mask generator + point base mask (#773 ) * Add more to the automatic mask generator. * Add the target point. * Fix. * Remove the allow-unused. * Mask post-processing.	2023-09-08 12:26:56 +01:00
Laurent Mazare	c1453f00b1	Improve the safetensor loading in the segment-anything example. (#772 ) * Improve the safetensor loading in the segment-anything example. * Properly handle the labels when embedding the point prompts.	2023-09-08 09:39:10 +01:00
Laurent Mazare	989a4807b1	Use shape with holes. (#771 )	2023-09-08 08:50:27 +01:00
Laurent Mazare	3898e500de	Generate a mask image + the scaled input image. (#769 ) * Also round-trip the original image. * Make it possible to use a safetensors input.	2023-09-08 05:53:08 +01:00
Laurent Mazare	79c27fc489	Segment-anything fixes: avoid normalizing twice. (#767 ) * Segment-anything fixes: avoid normalizing twice. * More fixes for the image aspect ratio.	2023-09-07 21:45:16 +01:00
Laurent Mazare	7396b8ed1a	Segment Anything - process images (#766 ) * Start processing images. * Add LayerNorm2d. * Properly use LayerNorm2d. * Tweak eps. * Use LayerNorm on inputs with a rank different from 3. * Window partitioning. * Fix a couple todos. * More todos. * Hard-code the einsums. * More padding support. * Some sizes tweaks. * Use the hub to get the weights. * Use a batch matmul. * Tweaks. * More fixes. * Get some predictions to be generated.	2023-09-07 19:22:45 +01:00
Laurent Mazare	7b50f3e106	More segment-anything again. (#764 ) * More segment-anything again. * Transformer block forward. * Two-ways transformer. * Position embeddings. * Sketch the prompt encoder. * More prompt-encoder. * More prompt-encoder. * Add the main sam module. * Embed the transformer. * And hook the transformer forward step. * Build the model. * Handle the global attn indexes. * Get the model to load.	2023-09-07 12:06:55 +01:00
Laurent Mazare	8c991df394	More segment-anything. (#763 ) * More segment-anything. * Split the model in multiple files. * Start adding the transformer. * Add the attention block. * Move the MLP Block.	2023-09-07 07:28:30 +01:00
Laurent Mazare	6527ab81a3	Sketch the segment anything model. (#759 ) * Sketch the segment anything model. * Fix some clippy lint. * Add the mask decoder.	2023-09-07 05:34:05 +01:00
Laurent Mazare	dcf708559d	Fix for cudnn to work with img2img. (#753 )	2023-09-06 07:49:28 +01:00
Laurent Mazare	7299a68353	img2img pipeline for stable diffusion. (#752 ) * img2img pipeline for stable diffusion. * Rename the arguments + fix. * Fix for zero strength. * Another fix. * Another fix. * Revert. * Include the backtrace. * Noise scaling. * Fix the height/width.	2023-09-06 07:06:49 +01:00
Laurent Mazare	1c9e5394a5	Add a custom softmax implementation. (#744 ) * Add a custom softmax implementation. * Add softmaxlastdim to the benchmarks. * And add a test. * Support more dtypes. * Polish the code. * Use the slow implementation on cuda. * Add a todo for the cuda kernel.	2023-09-05 14:20:23 +01:00
Laurent Mazare	9c61b0fc9b	Proper log buckets for t5. (#727 ) * Proper log buckets for t5. * Properly pass the position bias.	2023-09-03 20:33:50 +01:00
Laurent Mazare	26cd266e65	Musicgen text embeddings. (#726 ) * Musicgen text embeddings. * Bugfix for layer norm. * Proper position bias. * Expose the weights.	2023-09-03 18:27:48 +01:00
Laurent Mazare	bbec527bb9	Fix the musicgen example. (#724 ) * Fix the musicgen example. * Retrieve the weights from the hub.	2023-09-03 14:50:39 +01:00
Laurent Mazare	2c1df6bba1	Add a repeat penality to the llama2-c command line example. (#713 ) * Add a repeat penality to the llama2-c command line example. * Another fix attempt.	2023-09-01 20:38:58 +01:00
Laurent Mazare	19042962d5	Whisper fix (#711 ) * Remove unnecessary file. * Whisper fix.	2023-09-01 20:04:07 +01:00
Laurent Mazare	7529531056	Add the optimizer trait. (#702 )	2023-09-01 12:55:39 +01:00

1 2 3 4 5 ...

344 Commits