Skip to content

Commit

Permalink
(temp workaround for Triton bug) (#306)
Browse files Browse the repository at this point in the history
Triton's autotuner had a bug (fixed in
triton-lang/triton@184fb53 )
that we happen to trigger - this PR is a temp. workaround before we
patch/update Triton.
  • Loading branch information
ilia-cher authored Dec 5, 2024
1 parent 68fdfc2 commit ccdb5b8
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion vllm/attention/ops/triton_flash_attention.py
Original file line number Diff line number Diff line change
Expand Up @@ -367,12 +367,12 @@ def attn_fwd(
IS_CAUSAL: tl.constexpr,
BLOCK_M: tl.constexpr,
BLOCK_DMODEL: tl.constexpr,
USE_FP8: tl.constexpr,
BLOCK_N: tl.constexpr,
PRE_LOAD_V: tl.constexpr,
BIAS_TYPE: tl.constexpr,
ENABLE_DROPOUT: tl.constexpr,
RETURN_ENCODED_SOFTMAX: tl.constexpr,
USE_FP8: tl.constexpr,
):
start_m = tl.program_id(0)
off_h_q = tl.program_id(1)
Expand Down

0 comments on commit ccdb5b8

Please sign in to comment.