mirror of
https://github.com/huggingface/candle.git
synced 2025-06-17 19:18:50 +00:00

* Trying out a custom RmsNorm cuda kernel. * CPU implementation for rms-norm. * Cuda wrappers. * Add some validation. * Add some testing. * More testing.