candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Author	SHA1	Message	Date
Laurent Mazare	af552a5274	Fix the rnn tests for accelerate. (#704 )	2023-09-01 13:21:38 +01:00
Laurent Mazare	7529531056	Add the optimizer trait. (#702 )	2023-09-01 12:55:39 +01:00
Laurent Mazare	f2d476ca65	Replace the discord link. (#701 )	2023-09-01 09:43:55 +01:00
Laurent Mazare	f9f482d4e5	Add some doc to the varbuilder. (#700 )	2023-09-01 08:28:35 +01:00
Lennard	9736236175	Allow retrieving and setting prefix of VarBuilder (#699 )	2023-09-01 08:08:41 +01:00
Laurent Mazare	30a4b593d7	More ops again. (#697 )	2023-08-31 22:28:48 +01:00
Laurent Mazare	949f1eae6f	Implement a couple more binary ops. (#693 )	2023-08-31 21:30:15 +01:00
Laurent Mazare	7cef35c84d	Tweak some quantized args (#692 ) * Print the args + change the default temp/repeat penalty. * Minor formatting tweak.	2023-08-31 17:25:21 +01:00
Laurent Mazare	7509c98970	Interactive mode for the quantized model. (#690 )	2023-08-31 10:52:42 +01:00
Laurent Mazare	94aa234dfd	Add the kv-cache to the whisper wasm version. (#689 ) * Add the kv-cache to the whisper wasm version. * Improve the handling of special tokens.	2023-08-31 09:37:44 +01:00
Laurent Mazare	db59816087	Add a GRU layer. (#688 ) * Add a GRU layer. * Fix the n gate computation.	2023-08-31 08:43:10 +01:00
Laurent Mazare	d210c71d77	Set the learning rate. (#687 )	2023-08-31 08:03:40 +01:00
Laurent Mazare	8e84d8a59b	Llama2.c wasm module. (#686 )	2023-08-31 07:44:32 +01:00
Radamés Ajna	9bd486fb96	Add Yolo Pose to JS Example (#684 ) * add support for yolo pose models * fix copy	2023-08-31 06:32:57 +01:00
Laurent Mazare	eaf760a751	Add a python variant for the lstm test. (#682 )	2023-08-30 22:32:08 +01:00
Radamés Ajna	1d0bb48fae	Improve Whisper WASM UI example (#669 ) * wip add module and js worker example * params * clean up, send error * final UI with whisper webworker * add simple instructions	2023-08-30 20:35:41 +02:00
Laurent Mazare	21e1c73892	Add a LSTM test. (#681 ) * Add a LSTM test. * Clippy.	2023-08-30 20:05:42 +02:00
Laurent Mazare	2047d34b7c	More robust tests (so that they pass on accelerate). (#679 )	2023-08-30 18:10:10 +01:00
Laurent Mazare	9874d843f1	Fix the accelerate build (#678 ) * Cosmetic changes. * Fix the accelerate build for tanh.	2023-08-30 18:31:14 +02:00
Laurent Mazare	7d753d3acd	Mnist training dropout (#677 ) * Use dropout in the mnist training. * Fix.	2023-08-30 16:41:01 +01:00
Laurent Mazare	3159982a89	Add a Dropout layer (#676 ) * Add a dropout layer. * Add an actual layer.	2023-08-30 16:19:28 +01:00
Laurent Mazare	ad8a62dbf5	Add tanh. (#675 ) * Add tanh. * Use tanh in the lstm block. * Add a test for tanh forward and backward passes.	2023-08-30 13:54:50 +01:00
Laurent Mazare	f35b9f6baa	Add some recurrent neural networks (#674 ) * Add the rnn module. * More LSTM. * Implement the RNN forward pass. * More forward pass for LSTM.	2023-08-30 13:27:09 +01:00
Laurent Mazare	618f4e4c78	Add some documentation. (#673 ) * Add some documentation. * Bump the crate version.	2023-08-30 11:54:00 +01:00
Laurent Mazare	5ac0a98f01	Changelog update. (#672 )	2023-08-30 09:27:56 +01:00
Laurent Mazare	393690387f	Support dilation in conv-transpose2d. (#671 )	2023-08-30 09:22:00 +01:00
Laurent Mazare	9b25113393	Small cleanups (avoid some possible mutations) (#670 ) * More mut cleanup. * Factor out some common bits.	2023-08-30 08:54:00 +01:00
Laurent Mazare	a1a5ab8b0a	Neon optimized vecdot (#666 ) * Q5k vecdot. * Add the q3k vecdot. * Q2k vecdot. * Move the quantized model to its own file.	2023-08-29 22:28:46 +01:00
Laurent Mazare	59b731de99	Add the powf op. (#664 ) * Add the powf op. * Cuda kernels and backprop. * Add a test.	2023-08-29 20:48:18 +01:00
Laurent Mazare	2d3fcad267	Simplify usage of the pool functions. (#662 ) * Simplify usage of the pool functions. * Small tweak. * Attempt at using apply to simplify the convnet definition.	2023-08-29 19:12:16 +01:00
Laurent Mazare	b31d41e26a	Add a convnet training example. (#661 ) * Add a convnet example. * Dataset fix. * Randomize batches.	2023-08-29 18:23:01 +01:00
Laurent Mazare	71221559d3	Fix the dilated convolutions. (#659 )	2023-08-29 16:37:42 +01:00
Laurent Mazare	a044907ffc	Dilated convolutions (#657 ) * Add the dilation parameter. * Restore the basic optimizer example. * Dilation support in cudnn. * Use the dilation parameter in the cpu backend. * More dilation support. * No support for dilation in transposed convolutions. * Add dilation to a test. * Remove a print. * Helper function.	2023-08-29 16:12:11 +01:00
Lukas Kreussel	ee8bb1bde1	Add `avx` implemenetations of `q2k`, `q3k` and `q5k` vec-dot functions (#654 ) * `q2k` avx implementation * `q3k` avx implementation * `q5k` avx implementation * `avx` make masks constant * clippy stuff	2023-08-29 13:35:56 +01:00
Nicolas Patry	3d2d3c7edb	Merge pull request #658 from huggingface/upgrade_hf_hub2 Upgrading hf-hub (for windows support, removing symlink requirement).	2023-08-29 14:32:15 +02:00
Nicolas Patry	1aca6fa291	Upgrading hf-hub.	2023-08-29 14:18:54 +02:00
Nicolas Patry	4ed202447e	Upgrading hf-hub.	2023-08-29 14:14:26 +02:00
Laurent Mazare	1d6bff53fc	Changelog update. (#656 )	2023-08-29 12:55:56 +01:00
Nicolas Patry	14b4d456e8	Merge pull request #439 from huggingface/training_hub_dataset [Book] Add small error management + start training (with generic dataset inclusion).	2023-08-29 13:10:05 +02:00
Nicolas Patry	2d5b7a735d	Update the book with new layout of datasets.	2023-08-29 12:51:59 +02:00
Laurent Mazare	62ef494dc1	Use multiple transformer layer in the same cross-attn blocks. (#653 ) * Use multiple transformer layer in the same cross-attn blocks. * Make the context contiguous if required.	2023-08-29 11:13:43 +01:00
Laurent Mazare	d0a330448d	Backprop support for pooling ops. (#652 ) * Backprop support for pooling ops. * max-pool gradient.	2023-08-29 10:17:59 +01:00
Laurent Mazare	4b8d57ba15	AVX version of the q4k vecdot. (#651 )	2023-08-29 09:41:17 +01:00
Nicolas Patry	d5a525f7a7	Fix clippy + save_image.	2023-08-29 10:19:44 +02:00
Laurent Mazare	33c23c19b6	Preliminary support for SDXL. (#647 ) * Preliminary support for SDXL. * More SDXL support. * More SDXL. * Use the proper clip config. * Querying for existing tensors. * More robust test.	2023-08-29 09:00:04 +01:00
Lei	49326fb925	Update .gitignore (#649 )	2023-08-29 08:41:33 +01:00
Laurent Mazare	fd3131a4ce	Fix the debug implementation. (#648 )	2023-08-28 22:51:39 +01:00
Laurent Mazare	037b41c9dc	Cuda conv transpose (#645 ) * Cuda kernel for conv-transpose. * Fix the cuda kernel. * Fix the tests.	2023-08-28 20:58:49 +01:00
Laurent Mazare	72fae3140c	Optimize the conv2d transpose cpu kernel. (#644 ) * Optimize the conv2d transpose cpu kernel. * Use multiple cores.	2023-08-28 20:06:31 +01:00
Laurent Mazare	ca26198b95	Fix the cpu kernel for conv-transpose. (#643 )	2023-08-28 16:45:12 +01:00

1 2 3 4 5 ...

1071 Commits