Files
candle/candle-kernels
Laurent Mazare af7f8b87d3 Custom op for RmsNorm (#1890)
* Trying out a custom RmsNorm cuda kernel.

* CPU implementation for rms-norm.

* Cuda wrappers.

* Add some validation.

* Add some testing.

* More testing.
2024-03-21 06:36:28 +01:00
..
2024-03-21 06:36:28 +01:00

candle-kernels

This crate contains CUDA kernels used from candle. Some of these implementations come from the dfdx crate.