|
ef33df7ae2
|
No need for the even constraint on vecdot-q40-q80. (#1202)
|
2023-10-28 07:23:59 +01:00 |
|
|
dac73edb34
|
AVX optimized q8k vecdot. (#1024)
|
2023-10-03 12:10:58 +01:00 |
|
|
9b25113393
|
Small cleanups (avoid some possible mutations) (#670)
* More mut cleanup.
* Factor out some common bits.
|
2023-08-30 08:54:00 +01:00 |
|
|
ee8bb1bde1
|
Add avx implemenetations of q2k , q3k and q5k vec-dot functions (#654)
* `q2k` avx implementation
* `q3k` avx implementation
* `q5k` avx implementation
* `avx` make masks constant
* clippy stuff
|
2023-08-29 13:35:56 +01:00 |
|
|
4b8d57ba15
|
AVX version of the q4k vecdot. (#651)
|
2023-08-29 09:41:17 +01:00 |
|
|
afc10a3232
|
AVX version for the q8-0 multiplications. (#598)
|
2023-08-25 10:14:49 +01:00 |
|
|
fc81af1712
|
AVX version of the q6k vec-dot. (#493)
* AVX version of the q6k vec-dot.
* Use the avx sum.
|
2023-08-17 20:13:18 +01:00 |
|
|
d99cac3ec3
|
Move the avx specific bits to a separate file. (#481)
|
2023-08-17 09:01:06 +01:00 |
|