This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-03 19:34:13 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
attention
History
Benjamin Chislett
bf3ffb61e6
[Bugfix] Fix ChunkedLocalAttention CUDA Graph setting (
#28739
)
...
Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
2025-11-14 14:14:46 -08:00
..
backends
[CI Failure] Fix backend selection for encoder-only models (
#28534
)
2025-11-13 10:11:27 -05:00
layers
[Bugfix] Fix ChunkedLocalAttention CUDA Graph setting (
#28739
)
2025-11-14 14:14:46 -08:00
ops
[Bugfix][CI/Test][Spec Decode] Fix illegal memory access in offline_inference/spec_decode.py (Issue 27619) (
#28432
)
2025-11-13 22:34:01 -08:00
utils
[Misc] Refactor Attention kv transfer methods into decorator (
#27816
)
2025-11-12 16:05:44 +00:00
__init__.py
…
layer.py
[CI Failure] Fix backend selection for encoder-only models (
#28534
)
2025-11-13 10:11:27 -05:00
selector.py
[CI Failure] Fix backend selection for encoder-only models (
#28534
)
2025-11-13 10:11:27 -05:00