Files
candle/candle-core/src
Laurent Mazare eb26e2467e Add the cuda dequantize f16 kernels. (#2137)
* Add the cuda dequantize f16 kernels.

* Expose the cuda kernels.

* Add some testing + fix.

* Test the other cases too.

* A few more tests.

* Add an environment variable to enable the dequantize f16 + matmul behavior.
2024-04-28 20:05:05 +02:00
..
2023-09-19 19:54:28 +01:00
2024-04-23 13:23:27 +02:00
2024-04-23 13:23:27 +02:00
2024-04-27 20:17:35 +02:00
2024-04-23 13:23:27 +02:00
2023-08-23 10:42:19 +01:00
2024-04-23 13:23:27 +02:00
2024-04-23 13:23:27 +02:00
2024-04-23 13:23:27 +02:00
2024-04-23 13:23:27 +02:00
2023-12-17 05:20:05 -06:00
2024-04-27 20:17:35 +02:00
2024-02-18 19:33:55 +01:00
2024-03-26 17:05:26 +01:00
2024-04-28 08:18:04 +02:00
2024-04-23 13:23:27 +02:00
2023-11-20 14:38:35 +01:00