mirror of
https://github.com/huggingface/candle.git
synced 2025-06-15 18:28:24 +00:00

* Add the rope THD kernel. * Cuda kernel for rope-thd. * Add the metal kernels. * Add a dedicated test.
candle-kernels
This crate contains CUDA kernels used from candle. Some of these implementations come from the dfdx crate.