This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-04-22 21:37:08 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
v1
/
core
History
Nicolò Lucchesi
75eb302a2e
[Bugfix] Whisper fix number of allocated CrossAttn blocks per-request (
#30772
)
...
Signed-off-by: NickLucche <nlucches@redhat.com>
2025-12-16 14:20:19 +00:00
..
sched
[Bugfix] Whisper fix number of allocated CrossAttn blocks per-request (
#30772
)
2025-12-16 14:20:19 +00:00
__init__.py
…
block_pool.py
[P/D] KV Load Failure Recovery/Abort Configuration (
#26813
)
2025-12-10 11:00:52 -08:00
encoder_cache_manager.py
[Core] Whisper Enable Encoder Batching (
#29421
)
2025-12-11 21:06:51 +00:00
kv_cache_coordinator.py
[Core][Observability] Add KV cache residency metrics (
#27793
)
2025-12-01 18:27:53 +00:00
kv_cache_manager.py
[P/D] KV Load Failure Recovery/Abort Configuration (
#26813
)
2025-12-10 11:00:52 -08:00
kv_cache_metrics.py
[Core][Observability] Add KV cache residency metrics (
#27793
)
2025-12-01 18:27:53 +00:00
kv_cache_utils.py
[Bugfix] fix confusing OOM errors during v1 init (
#28051
)
2025-12-10 23:17:41 +00:00
single_type_kv_cache_manager.py
[Hybrid Allocator] Support KV cache groups with different block_size (
#29143
)
2025-11-25 10:30:57 -05:00