Commit Graph

4 Commits

Author SHA1 Message Date
4a95d34c83 Compat windows. 2023-08-10 17:46:47 +02:00
66d1c093e0 This is duplicated code on Cuda 12.2.
Without it we can compile for 52 (but I get Operation Not supported
when actually trying to use those kernels).
2023-08-10 09:20:18 +02:00
ec79fc43f2 Add the bf16 cuda kernels. 2023-06-29 23:12:02 +01:00
d7f729fb8f Refactor the hierarchy. 2023-06-27 11:57:27 +02:00