0e2c8c17fb
UG metal integration. ( #2580 )
2024-10-27 15:20:37 +01:00
af2104078f
Metal commands refactoring ( #2489 )
...
* Split out the commands part of the metal device.
* Make most fields private.
* Move the allocator back.
* Rework the encoder provider type.
2024-09-21 13:18:42 +02:00
72d649058b
Hook the MLX matmul kernels in candle-core. ( #2473 )
2024-09-12 13:52:59 +02:00
25960676ca
Add a basic metal example with capture ( #2324 )
...
* Add some tracing.
* Get the trace to work.
2024-07-09 12:38:11 +02:00
01794dc16e
Use write rather than try-write on the metal rw-locks. ( #2162 )
2024-05-05 07:22:46 +02:00
f135b7963d
Fix for the batch dim in the quantized matmul example. ( #2073 )
...
* Fix for the batch dim in the quantized matmul example.
* Enable more tests on cuda.
* Add a test for qmm with a batch.
* Fix the zeros-dim test on metal.
2024-04-15 20:00:28 +02:00
665da30487
Backend refactoring. ( #1966 )
...
* Backend refactoring.
* Metal tweaks.
* Move the cudnn module.
2024-03-29 23:02:11 +01:00