This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-04-03 17:57:05 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
kernels
/
attention
History
Matthew Bonanni
4c23690f43
[Attention] FlashAttention ViT support, make default backend (
#28763
)
...
Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
2025-11-18 20:06:21 -08:00
..
conftest.py
…
test_aiter_flash_attn.py
…
test_attention_selector.py
[CPU] Refactor CPU attention backend (
#27954
)
2025-11-12 09:43:06 +08:00
test_attention.py
…
test_cache.py
…
test_cascade_flash_attn.py
[BugFix] Fix FA3 IMA with FULL_AND_PIECEWISE and cascade attention (default) (
#28702
)
2025-11-14 12:19:22 +00:00
test_cpu_attn.py
[CPU] Refactor CPU attention backend (
#27954
)
2025-11-12 09:43:06 +08:00
test_cutlass_mla_decode.py
…
test_deepgemm_attention.py
…
test_flash_attn.py
[Attention] FlashAttention ViT support, make default backend (
#28763
)
2025-11-18 20:06:21 -08:00
test_flashinfer_mla_decode.py
…
test_flashinfer_trtllm_attention.py
Update Flashinfer from
v0.4.1
to
v0.5.2
(
#27952
)
2025-11-07 16:24:42 -08:00
test_flashinfer.py
…
test_flashmla_sparse.py
…
test_flashmla.py
…
test_lightning_attn.py
…
test_merge_attn_states.py
Replace
torch.cuda.Event
with
torch.Event
for better hardware compatibility (
#26985
)
2025-11-18 11:34:36 -08:00
test_mha_attn.py
[Attention] FlashAttention ViT support, make default backend (
#28763
)
2025-11-18 20:06:21 -08:00
test_mla_decode_cpu.py
…
test_pack_unpack_triton.py
…
test_prefix_prefill.py
[CI/Build] Refactor Attention backend for test_prefix_prefill from xformers to SDPA (
#28424
)
2025-11-12 01:09:47 +08:00
test_rocm_attention_selector.py
…
test_triton_decode_attention.py
…
test_triton_unified_attention.py
…