candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Author	SHA1	Message	Date
Laurent Mazare	ba35d895e7	Sketch the candle-transformers crate. (#147 ) * Sketch the candle-transformers crate. * Format the empty files.	2023-07-12 13:49:31 +01:00
Laurent Mazare	9ce0f1c010	Sketch the candle-nn crate. (#115 ) * Sketch the candle-nn crate. * Tweak the cuda dependencies. * More cuda tweaks.	2023-07-10 08:50:09 +01:00
Laurent Mazare	4afa461b34	Sketch the Falcon model. (#93 ) * Sketch the Falcon model. * Add more substance to the falcon example. * Falcon (wip). * Falcon (wip again). * Falcon inference. * Get the weights from the api and properly generate the model. * Use the proper model. * Fix the file/revision names. * Fix bias handling. * Recompute the rot embeddings. * Fix the input shape. * Add the release-with-debug profile. * Silly bugfix. * More bugfixes. * Stricter shape checking in matmul.	2023-07-06 19:01:21 +01:00
laurent	fdb1acd2ff	Move llama in a cargo-examples directory.	2023-07-03 11:30:58 +01:00
laurent	ebb0fedf14	Very simple pyo3 bindings for candle.	2023-07-01 20:36:44 +01:00
laurent	af66f0829e	Revert the new profile.	2023-06-29 19:08:50 +01:00
laurent	3232df9458	Add some KV cache to llama.	2023-06-29 15:29:40 +01:00
Nicolas Patry	1a82bc50c9	[Tmp] Adding candle-hub	2023-06-27 13:58:23 +02:00
Nicolas Patry	d7f729fb8f	Refactor the hierarchy.	2023-06-27 11:57:27 +02:00
laurent	22da2c7e02	More f16 and bf16 support.	2023-06-26 20:52:01 +01:00
laurent	a31411fd91	Start adding f16/bf16 support.	2023-06-26 19:37:47 +01:00
laurent	11696e6377	Faster model weight loading.	2023-06-26 07:40:11 +01:00
laurent	96c098b6cd	Remove the unecessary features.	2023-06-24 18:15:44 +01:00
laurent	a7f80e258f	Read and write npy files.	2023-06-24 18:12:10 +01:00
Nicolas Patry	04cf14f35a	Moving to `gemm` and adding matmul backprop. - Tentative `T` operator.	2023-06-22 12:37:02 +02:00
Nicolas Patry	9ea220fc6e	Fixing tokenizers dep.	2023-06-22 12:25:58 +02:00
Nicolas Patry	ce977b489e	Adding matmul?	2023-06-22 12:25:58 +02:00
laurent	083ced4428	Integrate the kernels bits.	2023-06-22 09:59:00 +01:00
laurent	7adffafeda	Abstract the gradient storage.	2023-06-21 14:29:48 +01:00
laurent	9698211d56	Add some very basic tensor type.	2023-06-19 17:26:50 +01:00

20 Commits