vllm/attention at b7b6396584ab5565d3c2cbe1d2257fc4d0718599 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-03 10:44:33 +08:00

History

zejunchen-zejun d52c5096d7

[Bugfix] fix the alias bug of AttentionBackendEnum when register CUSTOM attention backend to vllm (#30869 )

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

2025-12-20 09:03:35 +08:00

..

[Bugfix] fix the alias bug of AttentionBackendEnum when register CUSTOM attention backend to vllm (#30869 )

2025-12-20 09:03:35 +08:00

[MM Encoder]: Migrate legacy ViT MultiHeadAttention to new MMEncoderAttention interface (#30684 )

2025-12-19 02:04:19 +08:00

[Bugfix] [Kernel] Triton attention kernels: mask out V blocks that fall outside sliding window (#30887 )

2025-12-19 21:39:54 +08:00

[Attention][UX][1/N] Add AttentionConfig and change attention env vars to CLI arguments (#26315 )

2025-12-05 09:48:43 -08:00

__init__.py

[Attention] Remove imports from vllm/attention/__init__.py (#29342 )

2025-11-26 10:53:15 -07:00

layer.py

[MM Encoder]: Migrate legacy ViT MultiHeadAttention to new MMEncoderAttention interface (#30684 )

2025-12-19 02:04:19 +08:00

selector.py

[Platform] Refactor Platform attention backend selection to avoid breakpoint for OOT platform (#30212 )

2025-12-15 17:36:07 +00:00