Fix for the batch dim in the quantized matmul example. (#2073)

* Fix for the batch dim in the quantized matmul example.

* Enable more tests on cuda.

* Add a test for qmm with a batch.

* Fix the zeros-dim test on metal.
This commit is contained in:
Laurent Mazare
2024-04-15 20:00:28 +02:00
committed by GitHub
parent af955f260c
commit f135b7963d
3 changed files with 38 additions and 38 deletions

View File

@ -464,7 +464,7 @@ impl QCudaStorage {
/* x_rows */ n,
/* x_cols */ k,
/* y_rows */ k,
/* y_cols */ m,
/* y_cols */ b * m,
self.device(),
)?
};