This website requires JavaScript.
Explore
Help
Register
Sign In
huggingface
/
candle
Watch
1
Star
0
Fork
0
You've already forked candle
mirror of
https://github.com/huggingface/candle.git
synced
2025-06-15 02:16:37 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
operators-argmin-argmax-leakyrelu
candle
/
candle-flash-attn
History
Laurent Mazare
7ebc3548e1
Use flash-attn in gemma. (
#2195
)
...
* Use flash-attn in gemma. * Fix flash-attn for head dim 256.
2024-05-18 19:18:59 +02:00
..
cutlass
@
c4f6b8c6bc
Add flash attention (
#241
)
2023-07-26 07:48:10 +01:00
kernels
Use flash-attn in gemma. (
#2195
)
2024-05-18 19:18:59 +02:00
src
Use flash-attn in gemma. (
#2195
)
2024-05-18 19:18:59 +02:00
tests
Flash attention without padding (varlen). (
#281
)
2023-07-31 09:45:39 +01:00
build.rs
Moving to a proper build crate
bindgen_cuda
. (
#1531
)
2024-01-07 12:29:24 +01:00
Cargo.toml
Bump the version number to 0.5.1. (
#2155
)
2024-05-03 11:17:05 +02:00
README.md
Add some missing readme files. (
#304
)
2023-08-02 10:57:12 +01:00
README.md
candle-flash-attn