Files
candle/candle-kernels/src
Laurent Mazare eb26e2467e Add the cuda dequantize f16 kernels. (#2137)
* Add the cuda dequantize f16 kernels.

* Expose the cuda kernels.

* Add some testing + fix.

* Test the other cases too.

* A few more tests.

* Add an environment variable to enable the dequantize f16 + matmul behavior.
2024-04-28 20:05:05 +02:00
..
2024-03-20 18:32:55 +01:00
2023-08-23 10:42:19 +01:00
2023-08-10 17:46:47 +02:00
2024-02-12 15:03:18 +01:00
2024-04-27 20:17:35 +02:00
2024-04-05 08:32:58 +02:00
2024-04-27 20:17:35 +02:00
2023-08-23 10:42:19 +01:00