candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-14 09:57:10 +00:00

Author	SHA1	Message	Date
Laurent Mazare	bb3471ea31	Adapt more examples to the updated safetensor api. (#947 ) * Simplify the safetensor usage. * Convert more examples. * Move more examples. * Adapt stable-diffusion.	2023-09-23 21:26:03 +01:00
Laurent Mazare	e6f040d6e3	Readme gallery (#834 ) * More readme tweaks. * Update README.md	2023-09-13 09:05:47 +01:00
Laurent Mazare	e82fcf1c59	Add more example readmes. (#828 ) * Add more readmes. * Add a readme for dinov2. * Add some skeleton files for a couple more examples. * More whisper details.	2023-09-12 17:21:24 +01:00
Juarez Bochi	805bf9ffa7	Implement top_p / nucleus sampling (#819 ) * Implement top_p / nucleus sampling * Update changelog * rustfmt * Add tests * Fix clippy warning * Fix another clippy error	2023-09-12 18:10:16 +02:00
Laurent Mazare	d3f05eae8c	Move some models to candle-transformers so that it's easier to re-use. (#794 ) * Move some models to candle-transformers so that they can be shared. * Also move falcon. * Move Llama. * Move whisper (partial).	2023-09-10 09:40:27 +01:00
Laurent Mazare	a1812f934f	Add a yolo-v3 example. (#528 ) * Add a couple functions required for yolo. * Add the yolo-v3 example. * Add minimum and maximum. * Use the newly introduced maximum. * Cuda support for min/max + add some testing. * Allow for more tests to work with accelerate. * Fix a typo.	2023-08-20 18:19:37 +01:00
Laurent Mazare	c78ce76501	Add a simple Module trait and implement it for the various nn layers (#500 ) * Start adding the module trait. * Use the module trait. * Implement module for qmatmul.	2023-08-18 09:38:22 +01:00
Laurent Mazare	c84883ecf2	Add a cuda kernel for upsampling. (#441 ) * Add a cuda kernel for upsampling. * Update for the latest tokenizers version.	2023-08-14 13:12:17 +01:00
Laurent Mazare	4bf2ebf836	Use u8 tensors for masks. (#273 )	2023-07-29 11:32:58 +01:00
Laurent Mazare	cb8dd5cd53	Back to using the main branch now that the PR has been merged. (#270 )	2023-07-28 16:22:44 +01:00
Laurent Mazare	a0e47aba98	Fix the revision used in starcoder to use the safetensors PR. (#269 )	2023-07-28 14:02:31 +01:00
Laurent Mazare	3eb2bc6d07	Softmax numerical stability. (#267 ) * Softmax numerical stability. * Fix the flash-attn test.	2023-07-28 13:13:01 +01:00
Laurent Mazare	68eab38de6	Cuda fix for starcoder. (#266 ) * Cuda fix for starcoder. * Nicer output.	2023-07-28 12:13:41 +01:00
Laurent Mazare	3e89df938c	Starcoder fix (#264 ) * Bugfix for starcoder. * Get some proper code generation. * Slightly simpler softmax.	2023-07-28 11:17:49 +01:00
Laurent Mazare	6a54ca115e	Add some Bigcode model (#260 ) * Start sketching the bigcode gpt model. * Sketch the bigcode model. * Implement the attention mechanism. * Random reshaping. * Sketch more of the example. * Add some kv cache. * Properly generate the position ids. * Proper attention mask. * Bail on upcasting. * Properly apply the attention mask. * Add the smaller starcoder variants. * Update for the new hub api. * Fix a shape issue. * Fix another shape issue. * Get some logits out. * Adjust the weigth names.	2023-07-28 09:57:32 +01:00

15 Commits