Files
candle/candle-kernels
Laurent Mazare e676f85f00 Sketch a fast cuda kernel for reduce-sum. (#109)
* Sketch a fast cuda kernel for reduce-sum.

* Sketch the rust support code for the fast sum kernel.

* More work on the fast kernel.

* Add some testing ground.

* A couple fixes for the fast sum kernel.
2023-07-08 12:43:56 +01:00
..
2023-07-05 18:12:17 +02:00
2023-06-27 11:57:27 +02:00
2023-06-27 11:57:27 +02:00

candle-kernels

This crate contains CUDA kernels used from candle. Some of these implementations come from the dfdx crate.