mirror of
https://github.com/huggingface/candle.git
synced 2025-06-16 10:38:54 +00:00

* Make it easier to use samples from the repo. * Use f32 for accumulation in the f16/bf16 kernels.
candle-kernels
This crate contains CUDA kernels used from candle. Some of these implementations come from the dfdx crate.