Files
candle/candle-metal-kernels/src
Thomas Santerre 0067fe00a8 Metal Unary: Add benchmarks and process kernels in a tile based fashion (#2056)
* add basic unary bench for sqrt

* process unary commands in tiles of 4

* re-enable all benchmarks

* rename helper to unary

* modify approach to split up tiled and non-tiled operations

* undo bench ignore for other tests

* update tile size to 2

* only perform the optimization on the contiguous even numbered element case
2024-04-21 00:10:33 +02:00
..
2024-01-22 15:15:19 +00:00
2024-01-17 10:27:58 +01:00
2024-04-05 08:32:58 +02:00