vllm/core at dc464a3d3937e30267514e1fc5b988a35dd9dbdf - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-27 04:27:53 +08:00

History

Jialin Ouyang 4f8c4b890a

[Core] Use KVCacheBlock as much as possible instead of dict[block_id, KVCacheBlock] (#24830 )

Signed-off-by: Jialin Ouyang <Jialin.Ouyang@gmail.com>

2025-09-23 15:11:14 -07:00

..

__init__.py

Implement Async Scheduling (#19970 )

2025-07-14 23:01:46 -07:00

test_async_scheduler.py

[Spec Decode] Make propose_draft_token_ids non-blocking for lower TTFT (#23041 )

2025-08-18 17:20:38 -07:00

test_encoder_cache_manager.py

[Multimodal] Remove legacy multimodal fields in favor of MultiModalFeatureSpec (#24548 )

2025-09-12 21:42:23 +08:00

test_kv_cache_utils.py

[Hybrid Allocator] Support full attention with different hidden size (#25101 )

2025-09-19 23:43:59 -07:00

test_prefix_caching.py

[Core] Use KVCacheBlock as much as possible instead of dict[block_id, KVCacheBlock] (#24830 )

2025-09-23 15:11:14 -07:00

test_scheduler_e2e.py

[Misc] unify variable for LLM instance (#20996 )

2025-07-21 12:18:33 +01:00

test_scheduler.py

[Chore] Cleanup guided namespace, move to structured outputs config (#22772 )

2025-09-18 09:20:27 +00:00

test_single_type_kv_cache_manager.py

[Core] Use KVCacheBlock as much as possible instead of dict[block_id, KVCacheBlock] (#24830 )

2025-09-23 15:11:14 -07:00

utils.py

[Core] Use sha256 bytes instead of BlockHash to reduce GC overhead (#23673 )

2025-09-08 21:34:37 -07:00