vllm/v1 at 6550114c9cd23198f3a88094ebe3e0d2bc8cb8de - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-08 17:27:13 +08:00

History

[v1] Redo "Support multiple KV cache groups in GPU model runner (#17945 )" (#18593 )

Signed-off-by: Chen Zhang <zhangch99@outlook.com>

2025-05-23 09:39:47 -07:00

core

[v1] Redo "Support multiple KV cache groups in GPU model runner (#17945 )" (#18593 )

2025-05-23 09:39:47 -07:00

e2e

[V1][Spec Decode] Make eagle compatible with prefix caching. (#17137 )

2025-04-27 09:29:43 -07:00

engine

[CI] don't skip fixed test_kv_cache_events() (#18183 )

2025-05-14 23:17:16 -07:00

entrypoints

[V1] Structured Outputs + Thinking compatibility (#16577 )

2025-05-14 15:45:24 -07:00

kv_connector

[Bugfix] Set KVTransferConfig.engine_id in post_init (#18576 )

2025-05-23 02:54:42 +00:00

metrics

[Misc] Add Ray Prometheus logger to V1 (#17925 )

2025-05-16 01:02:42 -07:00

sample

[Bugfix] Fix LoRA test (#18518 )

2025-05-21 21:48:53 -07:00

shutdown

[V1][Frontend] Improve Shutdown And Logs (#11737 )

2025-04-16 19:48:34 -07:00

spec_decode

[V1][Spec Decoding] Use model_loader.get_model() to load models (#18273 )

2025-05-23 02:05:44 +00:00

structured_output

[Feature][V1] Support tool_choice: required when using Xgrammar as the StructuredOutputBackend. (#17845 )

2025-05-12 23:01:31 -07:00

tpu

[TPU] Fix the test_sampler (#17820 )

2025-05-08 05:51:33 -04:00

worker

[v1] Redo "Support multiple KV cache groups in GPU model runner (#17945 )" (#18593 )

2025-05-23 09:39:47 -07:00

__init__.py

[V1] AsyncLLM Implementation (#9826 )

2024-11-11 23:05:38 +00:00

test_async_llm_dp.py

[V1][DP] More robust DP/EP dummy request coordination (#16277 )

2025-04-22 19:12:15 -07:00

test_oracle.py

[CI/Build] Update bamba test model location (#18544 )

2025-05-22 06:01:07 -07:00

test_serial_utils.py

[V1] Add VLLM_ALLOW_INSECURE_SERIALIZATION env var (#17490 )

2025-05-08 13:34:02 +08:00

test_utils.py

Update deprecated Python 3.8 typing (#13971 )

2025-03-02 17:34:51 -08:00