|
1a90f9d3a6
|
Cuda implementation for copying data around.
|
2023-06-23 11:18:29 +01:00 |
|
|
065b7a19c7
|
Stride support for unary ops.
|
2023-06-22 15:46:34 +01:00 |
|
|
5b1ab5b687
|
Support strides in affine.
|
2023-06-22 15:38:42 +01:00 |
|
|
5276755fb3
|
Add cuda support for unary ops.
|
2023-06-22 15:12:59 +01:00 |
|
|
b8f514d9c6
|
Add more binary kernels.
|
2023-06-22 14:07:02 +01:00 |
|
|
e1eb86db61
|
Add some first binary op (add).
|
2023-06-22 13:52:02 +01:00 |
|
|
83d6198009
|
Simplify the binary kernels.
|
2023-06-22 13:16:03 +01:00 |
|
|
4b1c3405e9
|
Add a couple cuda kernels from dfdx.
|
2023-06-22 12:56:29 +01:00 |
|
|
083ced4428
|
Integrate the kernels bits.
|
2023-06-22 09:59:00 +01:00 |
|