vllm/attention at fe69f331f84d99541564dfe4852dd45220ed7875 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-05 08:07:03 +08:00

History

Shanshan Shen d44e9df7d4

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

Signed-off-by: shen-shanshan <467638484@qq.com>

2025-11-19 16:24:55 +00:00

..

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

[Bugfix] Fix ChunkedLocalAttention CUDA Graph setting (#28739 )

2025-11-14 14:14:46 -08:00

[Bugfix][CI/Test][Spec Decode] Fix illegal memory access in offline_inference/spec_decode.py (Issue 27619) (#28432 )

2025-11-13 22:34:01 -08:00

[Misc] Refactor Attention kv transfer methods into decorator (#27816 )

2025-11-12 16:05:44 +00:00

__init__.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00

layer.py

[Bugfix] Safeguard against missing backend in AttentionBackendEnum (#28846 )

2025-11-18 10:53:44 +00:00

selector.py

[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487 )

2025-11-19 16:24:55 +00:00