Laurent Mazare
0ace420e66
Flash attention without padding (varlen). ( #281 )
...
* Expose the seqlen variable for flash-attn without padding.
* Fix the batched call.
* Adapt for the varlen variant.
* No need to set the batch strides when in varlen mode.
* Add a test (disabled at the moment).
* Get the test to work properly.
2023-07-31 09:45:39 +01:00
..
2023-07-26 07:48:10 +01:00
2023-07-31 09:45:39 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 14:16:37 +01:00
2023-07-26 07:48:10 +01:00
2023-07-26 07:48:10 +01:00
2023-07-26 07:48:10 +01:00
2023-07-26 07:48:10 +01:00
2023-07-26 07:48:10 +01:00
2023-07-26 07:48:10 +01:00
2023-07-26 07:48:10 +01:00
2023-07-26 07:48:10 +01:00