This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-09 20:28:42 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
models
/
language
History
Thomas Parnell
61f67d8acd
[V1] [Hybrid] Enable Full CUDA Graph (decode-only) for Mamba layers (
#21401
)
...
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
2025-08-09 20:16:11 -07:00
..
generation
[V1] [Hybrid] Enable Full CUDA Graph (decode-only) for Mamba layers (
#21401
)
2025-08-09 20:16:11 -07:00
pooling
[Bugfix] Fix ModernBert cuda graph capturing in v1 (
#21901
)
2025-08-08 22:17:22 -07:00
__init__.py
[CI/Build] Reorganize models tests (
#17459
)
2025-04-30 23:03:08 -07:00