Files
candle/candle-kernels
Laurent Mazare 1df2bddccf Add the layernorm specialized op. (#2212)
* Add the layernorm cuda kernels.

* Dedicated layer norm op.

* Add the slower variant.

* Plug the cuda implementation.

* Add the metal variant.

* Add a dedicated test.

* Bugfix.
2024-05-24 15:58:01 +02:00
..

candle-kernels

This crate contains CUDA kernels used from candle. Some of these implementations come from the dfdx crate.