mirror of https://github.com/huggingface/candle.git synced 2025-06-15 10:26:33 +00:00

Files

OlivierDehaene 75629981bc feat: parse Cuda compute cap from env (#1066 )

* feat: add support for multiple compute caps

* Revert to one compute cap

* fmt

* fix

2023-10-16 15:37:38 +01:00

cutlass @ c4f6b8c6bc

Add flash attention (#241 )

2023-07-26 07:48:10 +01:00

2023-09-04 07:50:52 +01:00

2023-09-04 16:45:26 +01:00

2023-07-31 09:45:39 +01:00

build.rs

2023-10-16 15:37:38 +01:00

Cargo.toml

2023-10-01 13:51:57 +01:00

README.md

2023-08-02 10:57:12 +01:00

candle-flash-attn