mirror of
https://github.com/huggingface/candle.git
synced 2025-06-18 03:28:50 +00:00
Handle transposed matrixes in cublas.
This commit is contained in:
@ -1,4 +1,5 @@
|
||||
// TODO: Use a proper distributed reduction rather than atomicAdd.
|
||||
// https://people.maths.ox.ac.uk/gilesm/cuda/prac4/reduction.pdf
|
||||
#include "cuda_utils.cuh"
|
||||
#include<stdint.h>
|
||||
|
||||
|
Reference in New Issue
Block a user