candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Author	SHA1	Message	Date
Laurent Mazare	80f0482f26	Fix the stable-diffusion vae. (#398 ) * Fix the stable-diffusion vae. * Fix for saving images.	2023-08-10 18:24:31 +01:00
Laurent Mazare	385f0d261c	Normalize embeddings in the bert example. (#390 )	2023-08-10 13:05:55 +01:00
Laurent Mazare	c3a0761e62	Add some tracing to the whisper example. (#375 )	2023-08-09 19:58:36 +01:00
Laurent Mazare	a3b1699409	Embed the mel filters in the whisper binary. (#373 )	2023-08-09 18:27:26 +01:00
Laurent Mazare	3a62aee91f	Write the generated images using the image crate. (#363 ) * Use the image crate to write the generated images. * Make the dependency optional.	2023-08-09 15:26:44 +01:00
Laurent Mazare	be21d7e75a	Fix the padding used in stable diffusion. (#362 )	2023-08-09 13:23:59 +01:00
Laurent Mazare	89d3926c9b	Fixes for the stable diffusion example. (#342 ) * Fixes for the stable diffusion example. * Bugfix. * Another fix. * Fix for group-norm. * More fixes to get SD to work.	2023-08-08 14:57:09 +01:00
Laurent Mazare	fc265d9dcf	Some CLIP fixes for stable diffusion. (#338 ) * Some CLIP fixes for stable diffusion. * Add the avg-pool2d operation on cpu.	2023-08-07 18:31:45 +01:00
Laurent Mazare	2345b8ce3f	Skeleton for the avg-pool2d and upsample-nearest2d ops. (#337 ) * Skeleton for the avg-pool2d and upsample-nearest2d ops. * Preliminary conv2d support.	2023-08-07 16:15:38 +01:00
Laurent Mazare	f53a333ea9	Simple pad support. (#336 ) * Simple pad support. * Fix the tensor indexing when padding.	2023-08-07 15:24:56 +01:00
Laurent Mazare	5bb2fce998	Implement group-norm. (#334 ) * Implement group-norm. * Add some testing for group-norm.	2023-08-07 06:53:05 +01:00
Laurent Mazare	141df4ad2b	Main diffusion loop for the SD example. (#332 )	2023-08-06 21:39:53 +01:00
Laurent Mazare	166bfd5847	Add the recip op + use it in stable-diffusion. (#331 ) * Add the recip unary op. * Fix the cuda kernel. * Use the recip op in sigmoid.	2023-08-06 21:14:52 +01:00
Laurent Mazare	1c062bf06b	Add the ddim scheduler. (#330 )	2023-08-06 20:44:00 +01:00
Laurent Mazare	d34039e352	Add a stable diffusion example (#328 ) * Start adding a stable-diffusion example. * Proper computation of the causal mask. * Add the chunk operation. * Work in progress: port the attention module. * Add some dummy modules for conv2d and group-norm, get the attention module to compile. * Re-enable the 2d convolution. * Add the embeddings module. * Add the resnet module. * Add the unet blocks. * Add the unet. * And add the variational auto-encoder. * Use the pad function from utils.	2023-08-06 17:49:43 +01:00
Laurent Mazare	b278834267	Support the Accelerate BLAS on macOS. (#325 ) * Add the accelerate feature. * Ffi tweaks.	2023-08-05 17:25:24 +01:00
Laurent Mazare	620f83cf66	Add the candle-datasets crate (#322 ) * Move the vision datasets to a separate crate. * Move the batcher bits. * Update the readme. * Move the tiny-stories bits. --------- Co-authored-by: Jane Doe <jane.doe@example.org>	2023-08-05 08:56:50 +01:00
Laurent Mazare	f7b2a0391d	Transpose the weight matrixes for llama2.c. (#321 )	2023-08-04 13:32:20 +01:00
Laurent Mazare	df6667ba88	Add some tracing to llama. (#318 )	2023-08-03 13:52:22 +01:00
Laurent Mazare	a79286885c	Support safetensors weights in llama2.c inference. (#317 )	2023-08-03 11:10:58 +01:00
Laurent Mazare	4f17290ce0	Use AdamW in the llama2 training. (#308 )	2023-08-02 14:14:02 +01:00
Laurent Mazare	51e51da896	Rename the candle crate to candle-core (#301 ) * Rename to candle-core. * More candle-core renaming.	2023-08-02 08:20:22 +01:00
Laurent Mazare	4b3bd79fbd	Remove the embedding ops in favor of index-select. (#299 ) * Remove the embedding ops in favor of index-select. * Also remove the cuda kernels.	2023-08-02 05:42:11 +01:00
Laurent Mazare	ff876c2103	Llama more training (#297 ) * Rework the var-builder to handle initializations. * Add some helper functions for layer creation. * Improve the layer initializations. * Get initialized variables. * Precompute the rot embeddings when training lamas.	2023-08-01 19:53:41 +01:00
Laurent Mazare	a27239f3d9	Add training for the llama2.c example (#296 ) * Rework the commands and run inference by default. * Add the training module and load the training dataset. * Random dataset iterator. * Proper valid-loss computation. * Compute the evaluation loss. * Add more substance to the training loop.	2023-08-01 17:23:07 +01:00
Laurent Mazare	75e0448114	Move the weight bits in a separate module. (#295 )	2023-08-01 10:37:06 +01:00
Laurent Mazare	614f911e9e	Add some batcher variants that handle errors. (#294 )	2023-08-01 09:40:34 +01:00
Laurent Mazare	e1e8127f15	Add the batcher. (#293 )	2023-08-01 09:16:10 +01:00
Laurent Mazare	fa98ca0c35	Use subcommands in llama2. (#292 )	2023-08-01 05:57:41 +01:00
Laurent Mazare	1a07ff8d17	Pre-tokenized evaluation mode for llama2.c. (#291 )	2023-08-01 05:36:25 +01:00
Laurent Mazare	f28558d0b7	Evaluate on the pre-tokenized file. (#290 )	2023-07-31 21:31:38 +01:00
Laurent Mazare	6b98b66eb3	Remove the end of text tokens. (#289 )	2023-07-31 20:43:57 +01:00
Laurent Mazare	9ae1f6afee	Add an eval mode to llama2-c (#288 ) * Add an eval mode to llama2-c. * Encode line by line. * Get the eval to run.	2023-07-31 17:22:14 +01:00
Laurent Mazare	ffeafbfc43	Make the nll op closer to the pytorch version + add a test. (#286 )	2023-07-31 14:14:01 +01:00
Laurent Mazare	b3ea96b62b	Add a prompt and support more models in llama2-c. (#285 ) * Support more models in llama2-c. * Add a prompt.	2023-07-31 13:09:30 +01:00
Laurent Mazare	94a43faaca	Use the hub models for llama2.c (#284 )	2023-07-31 12:51:14 +01:00
Laurent Mazare	62a9b03715	Add a flag to set the number of epochs in the mnist training (#283 ) * Add a flag to change the number of epochs for the mnist training. * Increase the learning rate for the MLP.	2023-07-31 10:32:14 +01:00
Laurent Mazare	a8d8f9f206	Load a trained checkpoint in the mnist example. (#280 )	2023-07-30 17:01:45 +01:00
Laurent Mazare	38ff693af0	Add a flag to save the trained weights. (#279 )	2023-07-30 15:41:42 +01:00
Laurent Mazare	c950a5c6b1	Cuda support for the mnist training. (#277 ) * Cuda support for the mnist training. * min/max fix + testing. * Add the argmin/argmax tests. * More cuda support for argmin/argmax. * Cuda kernels for argmin and argmax.	2023-07-29 19:48:04 +01:00
Laurent Mazare	16c33383eb	Improve the mnist training example. (#276 ) * Improve the mnist training example. * Add some initialization routine that can be used for nn. * Proper initialization in the mnist example.	2023-07-29 16:28:22 +01:00
Nicolas Patry	40c80bfbb2	Merge branch 'main' into update_multiprocess	2023-07-29 16:38:35 +02:00
Laurent Mazare	07eb899729	More mnist training. (#275 )	2023-07-29 13:29:31 +01:00
Laurent Mazare	4bf2ebf836	Use u8 tensors for masks. (#273 )	2023-07-29 11:32:58 +01:00
Nicolas Patry	97d8712ba5	Remove single function.	2023-07-28 23:31:25 +02:00
Nicolas Patry	97181a77c0	Making multiprocess require flash-attn.	2023-07-28 23:31:24 +02:00
Laurent Mazare	50d8273ae4	Support both llama v1 and llama v2. (#272 )	2023-07-28 18:40:59 +01:00
Laurent Mazare	7513a5e005	Line-up the llama implementation with the python-transformers one. (#271 ) * Line-up the llama implementation with the python-transformers one. * Also lineup the multiprocess version.	2023-07-28 18:31:28 +01:00
Laurent Mazare	cb8dd5cd53	Back to using the main branch now that the PR has been merged. (#270 )	2023-07-28 16:22:44 +01:00
Laurent Mazare	a0e47aba98	Fix the revision used in starcoder to use the safetensors PR. (#269 )	2023-07-28 14:02:31 +01:00

1 2 3 4

194 Commits