This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-04-11 18:47:08 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
attention
History
Pleaplusone
d9d342d214
[Performance][MLA][ROCm] Remove redundant D2D copy in deepseek (
#27457
)
...
Signed-off-by: ganyi <ygan@amd.com>
2025-11-26 12:45:28 +08:00
..
backends
[Core] Deprecate
xformers
(
#29262
)
2025-11-24 04:18:55 +00:00
layers
[Core] Generalize Encoder-Decoder
seq_lens
computation to avoid Whisper hardcoded logic (
#29268
)
2025-11-25 11:32:11 +00:00
ops
[Performance][MLA][ROCm] Remove redundant D2D copy in deepseek (
#27457
)
2025-11-26 12:45:28 +08:00
utils
[Misc] Refactor Attention kv transfer methods into decorator (
#27816
)
2025-11-12 16:05:44 +00:00
__init__.py
[Model][Mamba] Add selector for mamba attention backend and make it pluggable for other device (
#26487
)
2025-11-19 16:24:55 +00:00
layer.py
[Core] Deprecate
xformers
(
#29262
)
2025-11-24 04:18:55 +00:00
selector.py
[Core] Deprecate
xformers
(
#29262
)
2025-11-24 04:18:55 +00:00