vllm/attention at d045e22dfeee61ece1a20ac4aec8cf483a42d406 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-06 19:04:03 +08:00

History

Matthew Bonanni 11857a00b0

[Attention] Add ROCM_AITER_MLA_SPARSE to attention backend registry (#29103 )

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>

2025-11-20 20:24:43 -08:00

..

[Attention] Add ROCM_AITER_MLA_SPARSE to attention backend registry (#29103 )

2025-11-20 20:24:43 -08:00

[Bugfix] Fix ChunkedLocalAttention CUDA Graph setting (#28739 )

2025-11-14 14:14:46 -08:00

[ROCm] Add AMD GPU support on Deepseek v3.2 and SparseMLA (#26670 )

2025-11-20 02:54:01 -08:00

[Misc] Refactor Attention kv transfer methods into decorator (#27816 )

2025-11-12 16:05:44 +00:00

__init__.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

layer.py

[Bugfix] Safeguard against missing backend in AttentionBackendEnum (#28846 )

2025-11-18 10:53:44 +00:00

selector.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00