vllm/attention at 408cf42f67dbcd50027fcd0f6ba35df83ced9107 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-01 17:07:05 +08:00

History

Cyrus Leung e83b7e379c

Revert "[Renderer] Separate out RendererConfig from ModelConfig (#30145 )" (#30199 )

2025-12-07 00:00:22 -08:00

..

test_attention_backends_selection.py

…

test_attention_backends.py

[v1] Add real sliding window calculation to FlexAttention direct BlockMask building (#26015 )

2025-12-01 13:12:51 +00:00

test_attention_splitting.py

[BugFix] Fix DBO assert assert B_block_table == B_q (#29933 )

2025-12-04 14:48:54 -05:00

test_batch_reordering.py

…

test_chunked_local_attention.py

…

test_mla_backends.py

[Attention] Refactor FA block_size limitations to hybrid models only (#29084 )

2025-11-22 06:38:44 -08:00

test_rocm_attention_backends_selection.py

[Attention] Update attention imports (#29540 )

2025-11-27 11:19:09 -05:00

test_sparse_mla_backends.py

Add TP parameter to attention tests (#27683 )

2025-11-03 13:04:40 -08:00

utils.py

Revert "[Renderer] Separate out RendererConfig from ModelConfig (#30145 )" (#30199 )

2025-12-07 00:00:22 -08:00