candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-20 12:06:35 +00:00

Author	SHA1	Message	Date
Laurent Mazare	b7cd58473b	TinyViT backbone for segment-anything. (#787 ) * TinyViT. * More TinyViT. * Add more to the tinyvit backbone. * Proper padding. * Plus ViT. * Add the tiniest vit spec.	2023-09-09 15:10:06 +01:00
Laurent Mazare	7396b8ed1a	Segment Anything - process images (#766 ) * Start processing images. * Add LayerNorm2d. * Properly use LayerNorm2d. * Tweak eps. * Use LayerNorm on inputs with a rank different from 3. * Window partitioning. * Fix a couple todos. * More todos. * Hard-code the einsums. * More padding support. * Some sizes tweaks. * Use the hub to get the weights. * Use a batch matmul. * Tweaks. * More fixes. * Get some predictions to be generated.	2023-09-07 19:22:45 +01:00
Laurent Mazare	8c991df394	More segment-anything. (#763 ) * More segment-anything. * Split the model in multiple files. * Start adding the transformer. * Add the attention block. * Move the MLP Block.	2023-09-07 07:28:30 +01:00
Laurent Mazare	000fa00e31	Expose the conv2d-transpose layers. (#761 )	2023-09-07 06:04:52 +01:00
Laurent Mazare	a17a7c42c1	Add a nn layer for conv-transpose2d. (#760 )	2023-09-07 05:47:28 +01:00
Laurent Mazare	bdc9d46fe3	Use an arc in the varbuilder rather than rc. (#757 ) * Use an arc in the varbuilder rather than rc. * Require the backends to be send. * Request send and sync.	2023-09-06 15:29:09 +01:00
Laurent Mazare	a0d65585db	Softmax implementation for cuda. (#747 )	2023-09-05 18:38:03 +01:00
Laurent Mazare	6615daf242	Tweaks to softmax. (#745 )	2023-09-05 15:22:27 +01:00
Laurent Mazare	1c9e5394a5	Add a custom softmax implementation. (#744 ) * Add a custom softmax implementation. * Add softmaxlastdim to the benchmarks. * And add a test. * Support more dtypes. * Polish the code. * Use the slow implementation on cuda. * Add a todo for the cuda kernel.	2023-09-05 14:20:23 +01:00
Masato Mori	4698eb5cb6	Fix typo in the nll function document (#742 )	2023-09-05 09:25:11 +01:00
Laurent Mazare	26cd266e65	Musicgen text embeddings. (#726 ) * Musicgen text embeddings. * Bugfix for layer norm. * Proper position bias. * Expose the weights.	2023-09-03 18:27:48 +01:00
Laurent Mazare	74a82c358a	Add the mse loss. (#723 )	2023-09-03 10:51:40 +01:00
Laurent Mazare	af552a5274	Fix the rnn tests for accelerate. (#704 )	2023-09-01 13:21:38 +01:00
Laurent Mazare	7529531056	Add the optimizer trait. (#702 )	2023-09-01 12:55:39 +01:00
Laurent Mazare	f9f482d4e5	Add some doc to the varbuilder. (#700 )	2023-09-01 08:28:35 +01:00
Lennard	9736236175	Allow retrieving and setting prefix of VarBuilder (#699 )	2023-09-01 08:08:41 +01:00
Laurent Mazare	db59816087	Add a GRU layer. (#688 ) * Add a GRU layer. * Fix the n gate computation.	2023-08-31 08:43:10 +01:00
Laurent Mazare	d210c71d77	Set the learning rate. (#687 )	2023-08-31 08:03:40 +01:00
Laurent Mazare	21e1c73892	Add a LSTM test. (#681 ) * Add a LSTM test. * Clippy.	2023-08-30 20:05:42 +02:00
Laurent Mazare	2047d34b7c	More robust tests (so that they pass on accelerate). (#679 )	2023-08-30 18:10:10 +01:00
Laurent Mazare	3159982a89	Add a Dropout layer (#676 ) * Add a dropout layer. * Add an actual layer.	2023-08-30 16:19:28 +01:00
Laurent Mazare	ad8a62dbf5	Add tanh. (#675 ) * Add tanh. * Use tanh in the lstm block. * Add a test for tanh forward and backward passes.	2023-08-30 13:54:50 +01:00
Laurent Mazare	f35b9f6baa	Add some recurrent neural networks (#674 ) * Add the rnn module. * More LSTM. * Implement the RNN forward pass. * More forward pass for LSTM.	2023-08-30 13:27:09 +01:00
Laurent Mazare	2d3fcad267	Simplify usage of the pool functions. (#662 ) * Simplify usage of the pool functions. * Small tweak. * Attempt at using apply to simplify the convnet definition.	2023-08-29 19:12:16 +01:00
Laurent Mazare	a044907ffc	Dilated convolutions (#657 ) * Add the dilation parameter. * Restore the basic optimizer example. * Dilation support in cudnn. * Use the dilation parameter in the cpu backend. * More dilation support. * No support for dilation in transposed convolutions. * Add dilation to a test. * Remove a print. * Helper function.	2023-08-29 16:12:11 +01:00
Laurent Mazare	33c23c19b6	Preliminary support for SDXL. (#647 ) * Preliminary support for SDXL. * More SDXL support. * More SDXL. * Use the proper clip config. * Querying for existing tensors. * More robust test.	2023-08-29 09:00:04 +01:00
Laurent Mazare	4c338b0cd9	VarBuilder cleanup (#627 ) * VarBuilder cleanup. * Implement the basic varbuilders. * Add the sharded code. * Proper support for tensor sharding.	2023-08-27 18:03:26 +01:00
Laurent Mazare	431051cc32	Add Efficientnet (#572 ) * EfficientNet. * Complete the efficientnet implementation. * Improve group handling. * Get the efficientnet to work.	2023-08-23 18:02:58 +01:00
Laurent Mazare	aba1e90797	Add some group parameter to convolutions. (#566 ) * Add some group parameter to convolutions. * Avoid some unnecessary groups checks. * Move the tensor convolution bits. * Properh handling of groups. * Bump the crate version. * And add a changelog.	2023-08-23 12:58:55 +01:00
Laurent Mazare	11c7e7bd67	Some fixes for yolo-v3. (#529 ) * Some fixes for yolo-v3. * Use the running stats for inference in the batch-norm layer. * Get some proper predictions for yolo. * Avoid the quadratic insertion.	2023-08-20 23:19:15 +01:00
Laurent Mazare	e3d2786ffb	Add a couple functions required for yolo. (#527 )	2023-08-20 17:02:05 +01:00
Laurent Mazare	d2622a8160	Move the VarMap to a separate file (#525 ) * Move the var-map struct in a separate file. * Fix some typos.	2023-08-20 14:25:07 +01:00
Laurent Mazare	42e1cc8062	Add a batch normalization layer (#508 ) * Add BatchNormalization. * More batch-norm. * Add some validation of the inputs. * More validation.	2023-08-18 20:05:56 +01:00
Laurent Mazare	c78ce76501	Add a simple Module trait and implement it for the various nn layers (#500 ) * Start adding the module trait. * Use the module trait. * Implement module for qmatmul.	2023-08-18 09:38:22 +01:00
Laurent Mazare	13401df4d1	Add an abstract type for RmsNorm. (#499 )	2023-08-18 08:52:14 +01:00
Laurent Mazare	d32e8199cd	Layer norm tweaks (#482 ) * Add some options to make layer-norm more configurable. * Add the rms-norm variant. * Replace the RmsNorm with the shared bits.	2023-08-17 10:07:13 +01:00
Laurent Mazare	55e428c8ae	Expose the varmap inner data. (#411 )	2023-08-11 16:58:56 +01:00
Laurent Mazare	89d3926c9b	Fixes for the stable diffusion example. (#342 ) * Fixes for the stable diffusion example. * Bugfix. * Another fix. * Fix for group-norm. * More fixes to get SD to work.	2023-08-08 14:57:09 +01:00
Laurent Mazare	2345b8ce3f	Skeleton for the avg-pool2d and upsample-nearest2d ops. (#337 ) * Skeleton for the avg-pool2d and upsample-nearest2d ops. * Preliminary conv2d support.	2023-08-07 16:15:38 +01:00
Laurent Mazare	5bb2fce998	Implement group-norm. (#334 ) * Implement group-norm. * Add some testing for group-norm.	2023-08-07 06:53:05 +01:00
Laurent Mazare	d34039e352	Add a stable diffusion example (#328 ) * Start adding a stable-diffusion example. * Proper computation of the causal mask. * Add the chunk operation. * Work in progress: port the attention module. * Add some dummy modules for conv2d and group-norm, get the attention module to compile. * Re-enable the 2d convolution. * Add the embeddings module. * Add the resnet module. * Add the unet blocks. * Add the unet. * And add the variational auto-encoder. * Use the pad function from utils.	2023-08-06 17:49:43 +01:00
Laurent Mazare	620f83cf66	Add the candle-datasets crate (#322 ) * Move the vision datasets to a separate crate. * Move the batcher bits. * Update the readme. * Move the tiny-stories bits. --------- Co-authored-by: Jane Doe <jane.doe@example.org>	2023-08-05 08:56:50 +01:00
Laurent Mazare	0902846f25	Add the AdamW optimizer. (#307 ) * Add the AdamW optimizer. * Add some AdamW test validated against PyTorch.	2023-08-02 14:03:49 +01:00
Laurent Mazare	cc76c63202	Use index-select for the embeddings as it supports backprop. (#298 )	2023-08-01 20:44:43 +01:00
Laurent Mazare	ff876c2103	Llama more training (#297 ) * Rework the var-builder to handle initializations. * Add some helper functions for layer creation. * Improve the layer initializations. * Get initialized variables. * Precompute the rot embeddings when training lamas.	2023-08-01 19:53:41 +01:00
Laurent Mazare	614f911e9e	Add some batcher variants that handle errors. (#294 )	2023-08-01 09:40:34 +01:00
Laurent Mazare	e1e8127f15	Add the batcher. (#293 )	2023-08-01 09:16:10 +01:00
Laurent Mazare	1064b9b031	Add the cross-entropy loss. (#287 )	2023-07-31 14:26:36 +01:00
Laurent Mazare	ffeafbfc43	Make the nll op closer to the pytorch version + add a test. (#286 )	2023-07-31 14:14:01 +01:00
Laurent Mazare	16c33383eb	Improve the mnist training example. (#276 ) * Improve the mnist training example. * Add some initialization routine that can be used for nn. * Proper initialization in the mnist example.	2023-07-29 16:28:22 +01:00

1 2

75 Commits