candle/cuda_basics.rs at 50b0946a2dff2a65f8319ff6d798f12b2ea2a6fb - candle - Gitea: Git with a cup of tea

huggingface/candle

mirror of https://github.com/huggingface/candle.git synced 2025-06-17 02:58:50 +00:00

Files

Laurent Mazare e676f85f00 Sketch a fast cuda kernel for reduce-sum. (#109 )

* Sketch a fast cuda kernel for reduce-sum.

* Sketch the rust support code for the fast sum kernel.

* More work on the fast kernel.

* Add some testing ground.

* A couple fixes for the fast sum kernel.

2023-07-08 12:43:56 +01:00

16 lines

343 B

Rust

Raw Blame History

 #[cfg(feature = "mkl")]
 extern crate intel_mkl_src;
 use anyhow::Result;
 use candle::{Device, Tensor};
 fn main() -> Result<()> {
     let device = Device::new_cuda(0)?;
     let t = Tensor::new(&[[1f32, 2., 3., 4.2]], &device)?;
     let sum = t.sum(&[0])?;
     println!("{sum}");
     let sum = t.sum(&[1])?;
     println!("{sum}");
     Ok(())
 }