mirror of
https://github.com/huggingface/candle.git
synced 2025-06-16 10:38:54 +00:00

* Improved launch config for layer-norm/rms-norm. * Add more testing for the fused layer/rms norm kernels.
candle-kernels
This crate contains CUDA kernels used from candle. Some of these implementations come from the dfdx crate.