Logo
Explore Help
Sign In
xinyun/vllm
1
0
Fork 0
You've already forked vllm
mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-06 19:04:03 +08:00
Code Issues Packages Projects Releases Wiki Activity
vllm/vllm/attention
History
Matthew Bonanni 11857a00b0
[Attention] Add ROCM_AITER_MLA_SPARSE to attention backend registry (#29103)
Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
2025-11-20 20:24:43 -08:00
..
backends
[Attention] Add ROCM_AITER_MLA_SPARSE to attention backend registry (#29103)
2025-11-20 20:24:43 -08:00
layers
[Bugfix] Fix ChunkedLocalAttention CUDA Graph setting (#28739)
2025-11-14 14:14:46 -08:00
ops
[ROCm] Add AMD GPU support on Deepseek v3.2 and SparseMLA (#26670)
2025-11-20 02:54:01 -08:00
utils
[Misc] Refactor Attention kv transfer methods into decorator (#27816)
2025-11-12 16:05:44 +00:00
__init__.py
[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487)
2025-11-19 16:24:55 +00:00
layer.py
[Bugfix] Safeguard against missing backend in AttentionBackendEnum (#28846)
2025-11-18 10:53:44 +00:00
selector.py
[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (#26487)
2025-11-19 16:24:55 +00:00
Powered by Gitea Version: 1.23.1 Page: 1148ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API