candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Author	SHA1	Message	Date
Laurent Mazare	0b175fcbbd	Fix the pyo3 build for macos. (#324 ) * Fix the pyo3 build for macos. * rustfmt fix.	2023-08-05 14:53:57 +01:00
Laurent Mazare	620f83cf66	Add the candle-datasets crate (#322 ) * Move the vision datasets to a separate crate. * Move the batcher bits. * Update the readme. * Move the tiny-stories bits. --------- Co-authored-by: Jane Doe <jane.doe@example.org>	2023-08-05 08:56:50 +01:00
Laurent Mazare	f7b2a0391d	Transpose the weight matrixes for llama2.c. (#321 )	2023-08-04 13:32:20 +01:00
Laurent Mazare	8b6f5be1cc	Support q5k quantized data. (#320 )	2023-08-04 09:51:30 +01:00
Laurent Mazare	df6667ba88	Add some tracing to llama. (#318 )	2023-08-03 13:52:22 +01:00
Laurent Mazare	a79286885c	Support safetensors weights in llama2.c inference. (#317 )	2023-08-03 11:10:58 +01:00
Laurent Mazare	74845a4dcd	Use the assert! function as it turns out to be const. (#316 )	2023-08-03 10:03:43 +01:00
Laurent Mazare	aa76b783eb	Q6K dequantization. (#315 )	2023-08-03 09:31:20 +01:00
Laurent Mazare	25564357f7	Support some ggml quantized types (#314 ) * Add the quantized types for GGML loading. * Support quantization for Q2K. * More quantization support. * Fix some clippy lints.	2023-08-03 09:16:26 +01:00
Laurent Mazare	634700d84a	Use some consts for ggml values. (#312 )	2023-08-02 22:03:05 +01:00
Laurent Mazare	e635f18eda	Initial support for reading ggml files. (#311 ) * Start adding support for reading ggml files. * Compute the proper tensor size. * Print the read tensors. * Fix file reading.	2023-08-02 21:59:02 +01:00
Laurent Mazare	52414ba5c8	Bugfix for the llama2 wasm example. (#310 ) * Clean-up the llama2.c wasm example. * Use a proper tokenizer. * Add a prompt. * Bugfix for the llama2 wasm example.	2023-08-02 17:32:36 +01:00
Laurent Mazare	186c308d51	Wasm llama2 tweaks (#309 ) * Clean-up the llama2.c wasm example. * Use a proper tokenizer.	2023-08-02 15:49:43 +01:00
Laurent Mazare	4f17290ce0	Use AdamW in the llama2 training. (#308 )	2023-08-02 14:14:02 +01:00
Laurent Mazare	0902846f25	Add the AdamW optimizer. (#307 ) * Add the AdamW optimizer. * Add some AdamW test validated against PyTorch.	2023-08-02 14:03:49 +01:00
Laurent Mazare	e2acbe1e72	Update the wasm example locations in the readme. (#306 )	2023-08-02 11:36:43 +01:00
Laurent Mazare	4fe8a02f88	Update the repo location. (#305 )	2023-08-02 11:12:18 +01:00
Laurent Mazare	03a421f714	Add some missing readme files. (#304 )	2023-08-02 10:57:12 +01:00
Laurent Mazare	d38943aadc	Add version numbers for all the candle crates (#303 ) * Switch to candle-gemm for the time being. * Add the missing versions.	2023-08-02 10:52:13 +01:00
Laurent Mazare	51e51da896	Rename the candle crate to candle-core (#301 ) * Rename to candle-core. * More candle-core renaming.	2023-08-02 08:20:22 +01:00
Laurent Mazare	6e33ff62d6	Update cudarc now that it includes the cublas-f16 and nccl changes. (#300 )	2023-08-02 05:54:28 +01:00
Laurent Mazare	4b3bd79fbd	Remove the embedding ops in favor of index-select. (#299 ) * Remove the embedding ops in favor of index-select. * Also remove the cuda kernels.	2023-08-02 05:42:11 +01:00
Laurent Mazare	cc76c63202	Use index-select for the embeddings as it supports backprop. (#298 )	2023-08-01 20:44:43 +01:00
Laurent Mazare	ff876c2103	Llama more training (#297 ) * Rework the var-builder to handle initializations. * Add some helper functions for layer creation. * Improve the layer initializations. * Get initialized variables. * Precompute the rot embeddings when training lamas.	2023-08-01 19:53:41 +01:00
Laurent Mazare	a27239f3d9	Add training for the llama2.c example (#296 ) * Rework the commands and run inference by default. * Add the training module and load the training dataset. * Random dataset iterator. * Proper valid-loss computation. * Compute the evaluation loss. * Add more substance to the training loop.	2023-08-01 17:23:07 +01:00
Nicolas Patry	babee9f011	Merge pull request #259 from LaurentMazare/book_2 Book 2 (load/save)	2023-08-01 17:26:57 +02:00
Nicolas Patry	afb5e24a63	Remove map ownership from `save`.	2023-08-01 17:19:22 +02:00
Nicolas Patry	89d1fd03e5	Adding new surface for savetensors (global load, global save).	2023-08-01 15:00:38 +02:00
Nicolas Patry	310094310b	Modifying `safetensors` export to get simple load and save.	2023-08-01 15:00:38 +02:00
Nicolas Patry	836ba3e090	Merge pull request #258 from LaurentMazare/start_book Starting the book.	2023-08-01 14:59:34 +02:00
Nicolas Patry	091e781977	Grammarly pass.	2023-08-01 14:26:02 +02:00
Nicolas Patry	5cead227ef	Adressed comments.	2023-08-01 14:26:02 +02:00
Nicolas Patry	ebd0315623	Typo.	2023-08-01 14:26:02 +02:00
Nicolas Patry	ad9d8fe400	Complexifying our hello world	2023-08-01 14:26:02 +02:00
Nicolas Patry	5bc5716b85	Revert "Making sure the CI actually works" This reverts commit 699346b603cec1f279d94e9aa3210c193ba973f8.	2023-08-01 14:26:02 +02:00
Nicolas Patry	ba37de94d4	Making sure the CI actually works	2023-08-01 14:26:02 +02:00
Nicolas Patry	6242a1470e	Starting the book.	2023-08-01 14:26:02 +02:00
Laurent Mazare	75e0448114	Move the weight bits in a separate module. (#295 )	2023-08-01 10:37:06 +01:00
Laurent Mazare	614f911e9e	Add some batcher variants that handle errors. (#294 )	2023-08-01 09:40:34 +01:00
Laurent Mazare	e1e8127f15	Add the batcher. (#293 )	2023-08-01 09:16:10 +01:00
Laurent Mazare	fa98ca0c35	Use subcommands in llama2. (#292 )	2023-08-01 05:57:41 +01:00
Laurent Mazare	1a07ff8d17	Pre-tokenized evaluation mode for llama2.c. (#291 )	2023-08-01 05:36:25 +01:00
Laurent Mazare	f28558d0b7	Evaluate on the pre-tokenized file. (#290 )	2023-07-31 21:31:38 +01:00
Laurent Mazare	6b98b66eb3	Remove the end of text tokens. (#289 )	2023-07-31 20:43:57 +01:00
Laurent Mazare	9ae1f6afee	Add an eval mode to llama2-c (#288 ) * Add an eval mode to llama2-c. * Encode line by line. * Get the eval to run.	2023-07-31 17:22:14 +01:00
Laurent Mazare	1064b9b031	Add the cross-entropy loss. (#287 )	2023-07-31 14:26:36 +01:00
Laurent Mazare	ffeafbfc43	Make the nll op closer to the pytorch version + add a test. (#286 )	2023-07-31 14:14:01 +01:00
Laurent Mazare	b3ea96b62b	Add a prompt and support more models in llama2-c. (#285 ) * Support more models in llama2-c. * Add a prompt.	2023-07-31 13:09:30 +01:00
Laurent Mazare	94a43faaca	Use the hub models for llama2.c (#284 )	2023-07-31 12:51:14 +01:00
Laurent Mazare	62a9b03715	Add a flag to set the number of epochs in the mnist training (#283 ) * Add a flag to change the number of epochs for the mnist training. * Increase the learning rate for the MLP.	2023-07-31 10:32:14 +01:00

1 2 3 4 5 ...

733 Commits