This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-10 01:35:01 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
v1
/
attention
History
Cyrus Leung
e83b7e379c
Revert "[Renderer] Separate out
RendererConfig
from
ModelConfig
(
#30145
)" (
#30199
)
2025-12-07 00:00:22 -08:00
..
test_attention_backends_selection.py
…
test_attention_backends.py
[v1] Add real sliding window calculation to FlexAttention direct BlockMask building (
#26015
)
2025-12-01 13:12:51 +00:00
test_attention_splitting.py
[BugFix] Fix DBO assert
assert B_block_table == B_q
(
#29933
)
2025-12-04 14:48:54 -05:00
test_batch_reordering.py
…
test_chunked_local_attention.py
…
test_mla_backends.py
[Attention] Refactor FA
block_size
limitations to hybrid models only (
#29084
)
2025-11-22 06:38:44 -08:00
test_rocm_attention_backends_selection.py
[Attention] Update attention imports (
#29540
)
2025-11-27 11:19:09 -05:00
test_sparse_mla_backends.py
Add TP parameter to attention tests (
#27683
)
2025-11-03 13:04:40 -08:00
utils.py
Revert "[Renderer] Separate out
RendererConfig
from
ModelConfig
(
#30145
)" (
#30199
)
2025-12-07 00:00:22 -08:00