mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2025-12-10 08:34:56 +08:00
Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>
Disaggregated Serving
This example contains scripts that demonstrate the disaggregated serving features of vLLM.
Files
disagg_proxy_demo.py- Demonstrates XpYd (X prefill instances, Y decode instances).kv_events.sh- Demonstrates KV cache event publishing.