mirror of https://github.com/huggingface/candle.git synced 2025-06-16 10:38:54 +00:00

Files

Laurent Mazare eb26e2467e Add the cuda dequantize f16 kernels. (#2137 )

* Add the cuda dequantize f16 kernels.

* Expose the cuda kernels.

* Add some testing + fix.

* Test the other cases too.

* A few more tests.

* Add an environment variable to enable the dequantize f16 + matmul behavior.

2024-04-28 20:05:05 +02:00

src

Add the cuda dequantize f16 kernels. (#2137 )

2024-04-28 20:05:05 +02:00

build.rs

Ensure that the kernels get rebuilt on cuh changes. (#1954 )

2024-03-28 06:56:48 +01:00

Cargo.toml

Bumping the version number to 0.5.0. (#2009 )

2024-04-04 17:48:45 +02:00

README.md

Revert "Add the layer norm files. (#222 )" (#223 )

2023-07-22 16:51:11 +01:00

README.md

candle-kernels

This crate contains CUDA kernels used from candle. Some of these implementations come from the dfdx crate.