Laurent Mazare
eb26e2467e
Add the cuda dequantize f16 kernels. ( #2137 )
...
* Add the cuda dequantize f16 kernels.
* Expose the cuda kernels.
* Add some testing + fix.
* Test the other cases too.
* A few more tests.
* Add an environment variable to enable the dequantize f16 + matmul behavior.
2024-04-28 20:05:05 +02:00
..
2023-09-19 19:54:28 +01:00
2024-04-23 13:23:27 +02:00
2024-04-23 13:23:27 +02:00
2024-04-27 20:17:35 +02:00
2024-04-28 20:05:05 +02:00
2024-02-14 10:27:22 +01:00
2024-04-23 13:23:27 +02:00
2024-04-18 14:31:41 +02:00
2024-02-18 21:28:07 +01:00
2023-08-23 10:42:19 +01:00
2024-03-23 14:16:19 +01:00
2024-04-23 13:23:27 +02:00
2024-03-08 15:04:18 +01:00
2024-04-23 13:23:27 +02:00
2024-04-23 13:23:27 +02:00
2024-04-23 13:23:27 +02:00
2024-04-20 22:19:46 +02:00
2023-12-17 05:20:05 -06:00
2024-03-08 10:52:22 +01:00
2024-04-27 20:17:35 +02:00
2024-02-14 10:27:22 +01:00
2024-04-22 16:23:27 +02:00
2024-04-04 22:32:47 +02:00
2024-02-18 19:33:55 +01:00
2023-09-23 22:57:42 +01:00
2023-09-08 20:13:29 +01:00
2024-03-26 17:05:26 +01:00
2024-04-28 08:18:04 +02:00
2024-03-25 11:48:16 +01:00
2023-07-17 13:41:09 +01:00
2024-03-30 13:22:00 +01:00
2024-04-23 13:23:27 +02:00
2023-11-20 14:38:35 +01:00
2023-11-10 08:35:48 +01:00
2024-02-13 14:26:32 +01:00