This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-05-12 09:01:37 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
v1
/
core
/
sched
History
Nicolò Lucchesi
75eb302a2e
[Bugfix] Whisper fix number of allocated CrossAttn blocks per-request (
#30772
)
...
Signed-off-by: NickLucche <nlucches@redhat.com>
2025-12-16 14:20:19 +00:00
..
__init__.py
…
async_scheduler.py
[Core] Support reseting all running requests' KV while calling
reset_prefix_cache
(
#28827
)
2025-12-02 02:25:05 +00:00
interface.py
[KVConnector][Feature] Support KV connector cache reset via /reset_prefix_cache (
#27170
)
2025-12-05 18:33:26 +00:00
output.py
[BugFix] Fix ValueError in NewRequestData repr methods (
#29392
)
2025-11-28 13:42:30 +08:00
request_queue.py
[Bugfix] fix --scheduling-policy=priority & n>1 crashes engine (
#29764
)
2025-12-02 22:42:28 +00:00
scheduler.py
[Bugfix] Whisper fix number of allocated CrossAttn blocks per-request (
#30772
)
2025-12-16 14:20:19 +00:00
utils.py
[Scheduer] Simplify stop checking for pooling models (
#30591
)
2025-12-13 09:45:26 +00:00