This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-05-11 23:09:10 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
v1
/
core
/
sched
History
Woosuk Kwon
6825d9a998
[BugFix][Spec Decode] Improve Prefix Caching Logic in Speculative Decoding (
#18668
)
...
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
2025-05-24 17:33:46 -07:00
..
__init__.py
[V1] Scheduler Refactoring [1/N] - Add Scheduler Interface (
#15250
)
2025-03-20 17:50:43 -07:00
interface.py
[P/D] NIXL Integration (
#17751
)
2025-05-12 09:46:16 -07:00
output.py
[v1] Redo "Support multiple KV cache groups in GPU model runner (
#17945
)" (
#18593
)
2025-05-23 09:39:47 -07:00
scheduler.py
[BugFix][Spec Decode] Improve Prefix Caching Logic in Speculative Decoding (
#18668
)
2025-05-24 17:33:46 -07:00
utils.py
[V1] Scheduler Refactoring [1/N] - Add Scheduler Interface (
#15250
)
2025-03-20 17:50:43 -07:00