vllm/attention at 040ae89c5e9d970224568a005e2b6ea0a9bd59ab - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-03-28 16:38:04 +08:00

History

Qiu a11f4a81e0

[Misc][PCP&DCP] relocate PCP feature check (#30050 )

Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>
Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>

2025-12-11 03:36:18 -08:00

..

[Misc][PCP&DCP] relocate PCP feature check (#30050 )

2025-12-11 03:36:18 -08:00

[Attention] Make seq_lens_cpu optional in CommonAttentionMetadata to enable true async spec-decode (#29624 )

2025-12-09 17:18:10 -08:00

[Perf] Remove sync point in vit torch sdpa attn backend (#30232 )

2025-12-08 07:12:42 +00:00

[Attention][UX][1/N] Add AttentionConfig and change attention env vars to CLI arguments (#26315 )

2025-12-05 09:48:43 -08:00

__init__.py

[Attention] Remove imports from vllm/attention/__init__.py (#29342 )

2025-11-26 10:53:15 -07:00

layer.py

[CI/Build] Make test_mha_attn.py run on correct platform only and check for flash_attn_varlen_func in layer.py (#29145 )

2025-12-09 20:18:17 +00:00

selector.py

[Deprecation] Remove deprecated plugin and compilation fields for v0.13 release (#30396 )

2025-12-10 19:59:35 -08:00