This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-06 19:04:03 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
attention
History
Matthew Bonanni
11857a00b0
[Attention] Add ROCM_AITER_MLA_SPARSE to attention backend registry (
#29103
)
...
Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
2025-11-20 20:24:43 -08:00
..
backends
[Attention] Add ROCM_AITER_MLA_SPARSE to attention backend registry (
#29103
)
2025-11-20 20:24:43 -08:00
layers
[Bugfix] Fix ChunkedLocalAttention CUDA Graph setting (
#28739
)
2025-11-14 14:14:46 -08:00
ops
[ROCm] Add AMD GPU support on Deepseek v3.2 and SparseMLA (
#26670
)
2025-11-20 02:54:01 -08:00
utils
[Misc] Refactor Attention kv transfer methods into decorator (
#27816
)
2025-11-12 16:05:44 +00:00
__init__.py
[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (
#26487
)
2025-11-19 16:24:55 +00:00
layer.py
[Bugfix] Safeguard against missing backend in AttentionBackendEnum (
#28846
)
2025-11-18 10:53:44 +00:00
selector.py
[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (
#26487
)
2025-11-19 16:24:55 +00:00