Commit Graph

108 Commits

Author SHA1 Message Date
8ad47907f3 Add the kernels. 2023-06-30 10:26:56 +01:00
6486a6d7b2 Avoid some cast kernels. 2023-06-29 23:23:44 +01:00
ec79fc43f2 Add the bf16 cuda kernels. 2023-06-29 23:12:02 +01:00
1ea08a19cb Rerun on new files. 2023-06-29 15:59:58 +00:00
b5bdbef53a Fixing kernel cache (a bit brutal for now, but if build triggers,
rebuild ALL kernels).
2023-06-29 15:51:08 +00:00
1ce3843cab Add the relu op. 2023-06-28 09:38:54 +01:00
380d61e990 Fix two cuda bugs (matmul and where_cond). 2023-06-27 11:31:04 +01:00
d7f729fb8f Refactor the hierarchy. 2023-06-27 11:57:27 +02:00