Files
candle/candle-examples/examples
Laurent Mazare cd29c7ccd4 More ggml cuda kernels (#1977)
* Add more cuda kernels for quantized matmul.

* Add the vec-dot bits.

* Expose the quantized matmul-vec kernels.

* Also include the quantize-q8-1 kernel.

* Glue code for the q8-1 quantization.

* mm-vec product via q8-1 quantization.

* Add a test.

* Add a mm test.

* Get the test to return some sensible results.

* Also test dmmv.

* Fix the launch params.

* Allow for tweaking the force_dmmv parameter while it's experimental.
2024-04-01 00:15:48 +02:00
..
2024-01-17 10:27:58 +01:00
2024-02-09 17:36:50 +01:00
2023-11-24 15:09:14 +00:00
2024-03-13 21:41:36 +01:00
2024-02-22 10:22:03 +01:00
2024-01-17 10:27:58 +01:00
2024-04-01 00:15:48 +02:00
2024-03-28 23:24:46 +01:00
2024-01-17 10:27:58 +01:00
2024-03-09 11:21:48 +01:00
2024-02-10 16:14:50 +01:00
2024-03-21 12:54:09 +01:00