Files
candle/candle-kernels
Laurent Mazare 2817643db9 Add the mmv kernels for small batch sizes. (#2075)
* Add the mmv kernels for smaller sizes.

* Support more mmv kernels.

* Use the new kernels.

* Fix the call.

* Silly fix.

* Improve the testing.

* Fix for dmmv.

* Add another dedicated test for the batching mmv.
2024-04-16 21:30:51 +02:00
..

candle-kernels

This crate contains CUDA kernels used from candle. Some of these implementations come from the dfdx crate.