vllm/attention at d84b97a3e33ed79aaba7552bfe5889d363875562 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-03-26 06:03:40 +08:00

History

[Bugfix] Fix workspace buffer None issue for Flashinfer TRTLLM Backend (#21525 )

Signed-off-by: elvischenv <219235043+elvischenv@users.noreply.github.com>

2025-07-29 10:34:00 -04:00

conftest.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_aiter_flash_attn.py

[ROCm][AITER] Enable fp8 kv cache on rocm aiter backend. (#20295 )

2025-07-25 06:50:21 -07:00

test_attention_selector.py

[V1] Support any head size for FlexAttention backend (#20467 )

2025-07-06 09:54:36 -07:00

test_attention.py

test_attention compat with coming xformers change (#20487 )

2025-07-05 19:37:59 -07:00

test_cache.py

[CI] change spell checker from codespell to typos (#18711 )

2025-06-11 19:57:10 -07:00

test_cascade_flash_attn.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_encoder_decoder_attn.py

[CI] change spell checker from codespell to typos (#18711 )

2025-06-11 19:57:10 -07:00

test_flash_attn.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_flashinfer_trtllm_decode_attention.py

[Bugfix] Fix workspace buffer None issue for Flashinfer TRTLLM Backend (#21525 )

2025-07-29 10:34:00 -04:00

test_flashinfer.py

[Misc] Add sliding window to flashinfer test (#21282 )

2025-07-21 08:37:49 -07:00

test_flashmla.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_lightning_attn.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_merge_attn_states.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_mha_attn.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_mla_decode_cpu.py

[Refactor] Remove duplicate ceil_div (#20023 )

2025-06-25 05:19:09 +00:00

test_prefix_prefill.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_rocm_attention_selector.py

[V0 Deprecation] Deprecate BlockSparse Attention & Phi3-Small (#21217 )

2025-07-19 13:53:17 -07:00

test_triton_decode_attention.py

[Refactor] Remove duplicate ceil_div (#20023 )

2025-06-25 05:19:09 +00:00

test_triton_unified_attention.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00