candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 18:48:51 +00:00

Author	SHA1	Message	Date
Laurent Mazare	096dee7073	Bump the version to 0.3.0. (#1014 ) * Bump the version to 0.3.0. * Changelog update.	2023-10-01 13:51:57 +01:00
Radamés Ajna	9571b200c9	fix firstToken, minor ui changes (#971 )	2023-09-27 06:01:59 +01:00
Radamés Ajna	a084f65f9a	fix rep penalty min value (#963 )	2023-09-26 05:23:50 +01:00
Laurent Mazare	7dd8e12472	Bump the crate versions to v0.2.3. (#886 ) * Bump the crate version. * Also update the python bindings.	2023-09-18 12:14:03 +01:00
Radamés Ajna	5cefbba757	minor UI fixes (#856 ) * fixes * remove listener * remove event listener	2023-09-15 06:30:50 +01:00
Juarez Bochi	805bf9ffa7	Implement top_p / nucleus sampling (#819 ) * Implement top_p / nucleus sampling * Update changelog * rustfmt * Add tests * Fix clippy warning * Fix another clippy error	2023-09-12 18:10:16 +02:00
Laurent Mazare	2257f4d475	Bump the crate version + update the changelog. (#822 )	2023-09-12 06:39:24 +01:00
Radamés Ajna	16bf44f6e9	force model cache (#751 )	2023-09-06 05:53:31 +02:00
Radamés Ajna	6a40decc76	Minor WASM UI improvements (#748 ) * add stats * random seed btn * minor ui improvoments	2023-09-05 19:24:43 +01:00
Laurent Mazare	0d00c06a83	Fix clippy lint. (#736 )	2023-09-04 16:09:19 +01:00
Radamés Ajna	8395152d20	Llama2c WASM UI improvements (#732 ) * pass seed, expose model seq_len * wip new llama2.c ui * final new UI example * small coppy * copy	2023-09-04 15:59:22 +01:00
Laurent Mazare	e2f9f60ac2	Avoid some redundant clone. (#731 )	2023-09-04 09:18:32 +02:00
Laurent Mazare	2c1df6bba1	Add a repeat penality to the llama2-c command line example. (#713 ) * Add a repeat penality to the llama2-c command line example. * Another fix attempt.	2023-09-01 20:38:58 +01:00
Laurent Mazare	4d56cef583	Handle the empty sequence case properly. (#712 ) * Handle the empty sequence case properly. * Proper fix.	2023-09-01 20:12:30 +01:00
Laurent Mazare	2fef14cb14	Add a repeat penalty to the llama2.c wasm example. (#709 )	2023-09-01 19:32:28 +01:00
Laurent Mazare	8e84d8a59b	Llama2.c wasm module. (#686 )	2023-08-31 07:44:32 +01:00
Laurent Mazare	618f4e4c78	Add some documentation. (#673 ) * Add some documentation. * Bump the crate version.	2023-08-30 11:54:00 +01:00
Laurent Mazare	a3f97c143d	Bump the crate version + update CHANGELOG. (#628 )	2023-08-27 18:17:11 +01:00
Laurent Mazare	aba1e90797	Add some group parameter to convolutions. (#566 ) * Add some group parameter to convolutions. * Avoid some unnecessary groups checks. * Move the tensor convolution bits. * Properh handling of groups. * Bump the crate version. * And add a changelog.	2023-08-23 12:58:55 +01:00
Laurent Mazare	a8f61e66cc	Bump the crates version to 0.1.2. (#522 )	2023-08-20 08:07:07 +01:00
Laurent Mazare	c78ce76501	Add a simple Module trait and implement it for the various nn layers (#500 ) * Start adding the module trait. * Use the module trait. * Implement module for qmatmul.	2023-08-18 09:38:22 +01:00
Laurent Mazare	13401df4d1	Add an abstract type for RmsNorm. (#499 )	2023-08-18 08:52:14 +01:00
Laurent Mazare	d32e8199cd	Layer norm tweaks (#482 ) * Add some options to make layer-norm more configurable. * Add the rms-norm variant. * Replace the RmsNorm with the shared bits.	2023-08-17 10:07:13 +01:00
Laurent Mazare	531f23b4d0	Rename vec-dot to vec-ops. (#449 ) * Rename vec-dot to vec-ops. * Also bump the crate version. * Add a currently empty readme.	2023-08-15 10:48:57 +01:00
Laurent Mazare	52414ba5c8	Bugfix for the llama2 wasm example. (#310 ) * Clean-up the llama2.c wasm example. * Use a proper tokenizer. * Add a prompt. * Bugfix for the llama2 wasm example.	2023-08-02 17:32:36 +01:00
Laurent Mazare	186c308d51	Wasm llama2 tweaks (#309 ) * Clean-up the llama2.c wasm example. * Use a proper tokenizer.	2023-08-02 15:49:43 +01:00
Laurent Mazare	4fe8a02f88	Update the repo location. (#305 )	2023-08-02 11:12:18 +01:00
Laurent Mazare	d38943aadc	Add version numbers for all the candle crates (#303 ) * Switch to candle-gemm for the time being. * Add the missing versions.	2023-08-02 10:52:13 +01:00
Laurent Mazare	51e51da896	Rename the candle crate to candle-core (#301 ) * Rename to candle-core. * More candle-core renaming.	2023-08-02 08:20:22 +01:00
Laurent Mazare	ba2254556c	Display the temperature being used for text generation. (#278 )	2023-07-30 09:53:05 +01:00
Laurent Mazare	4bf2ebf836	Use u8 tensors for masks. (#273 )	2023-07-29 11:32:58 +01:00
Laurent Mazare	3eb2bc6d07	Softmax numerical stability. (#267 ) * Softmax numerical stability. * Fix the flash-attn test.	2023-07-28 13:13:01 +01:00
Nicolas Patry	81bfa46702	Updated.	2023-07-26 15:21:50 +02:00
Nicolas Patry	035372248e	Simple QOL. - Add ms/token on llama2.c (15ms/token on my personal machine) - Hide `Run` buttons while models are not ready - Add dummy `progress` while weights are downloading (I briefly looked at putting a real progressbar.. and nothing easy enough came up.)	2023-07-26 15:17:32 +02:00
Nicolas Patry	97990f4afc	Add number of tokens.	2023-07-26 14:57:20 +02:00
Laurent Mazare	160ba09d30	Polish the llama2 wasm ui. (#232 ) * Polish the llama2 wasm ui. * readme update.	2023-07-24 15:28:27 +01:00
Laurent Mazare	5a26cba733	Re-organize the wasm examples (#231 ) * Move the whisper example. * More renaming. * Add llama2 as a new wasm example. * Live generation. * More of the llama wasm example. * Formatting.	2023-07-24 12:36:02 +01:00

37 Commits