vllm/core at 621ca2c0aba8268d72d380fa3e479ddafa529479 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-05 06:07:08 +08:00

History

Chen Zhang aabcd2cae3

[v1] Introduce KVCacheBlocks as interface between Scheduler and KVCacheManager (#17479 )

Signed-off-by: Chen Zhang <zhangch99@outlook.com>

2025-05-06 08:50:34 -07:00

..

[v1] Introduce KVCacheBlocks as interface between Scheduler and KVCacheManager (#17479 )

2025-05-06 08:50:34 -07:00

__init__.py

[V1] Implement vLLM V1 [1/N] (#9289 )

2024-10-22 01:24:07 -07:00

block_pool.py

[V1][Metrics] add support for kv event publishing (#16750 )

2025-04-30 07:44:45 -07:00

encoder_cache_manager.py

Enforce valid max_num_batched_tokens when disable_chunked_mm_input=True (#16447 )

2025-04-11 08:09:52 +00:00

kv_cache_manager.py

[v1] Introduce KVCacheBlocks as interface between Scheduler and KVCacheManager (#17479 )

2025-05-06 08:50:34 -07:00

kv_cache_utils.py

[Core] Prevent side-channel attacks via cache salting (#17045 )

2025-04-30 20:27:21 +08:00

specialized_manager.py

[v1][Spec Decode] Make sliding window compatible with eagle prefix caching (#17398 )

2025-04-30 18:25:53 +00:00