Files
candle/candle-transformers
Laurent Mazare 36cf54525d Fix the fast bf16 gemm cublas kernels. (#2274)
* Use flash-attn in gemma.

* Fix for the fast bf16 cublas gemm.

* Fix some clippy lints.

* Fix another lint.

* Proper clippy fix.
2024-06-18 23:46:58 +02:00
..
2024-03-23 15:26:09 +01:00
2024-03-02 18:50:01 +01:00

candle-transformers