Files
candle/candle-kernels
Laurent Mazare 4f92420132 Add some flash attn test (#253)
* Add some flash-attn test.

* Add the cpu test.

* Fail when the head is not a multiple of 8.

* Polish the flash attention test.
2023-07-26 20:56:00 +01:00
..
2023-07-26 20:56:00 +01:00
2023-06-27 11:57:27 +02:00

candle-kernels

This crate contains CUDA kernels used from candle. Some of these implementations come from the dfdx crate.