vllm/examples/online_serving/disaggregated_serving
inkcherry 0b0c33d59e fix comments
Signed-off-by: inkcherry <mingzhi.liu@amd.com>
2025-12-22 03:10:03 +00:00
..
2025-12-22 03:10:03 +00:00

Disaggregated Serving

This example contains scripts that demonstrate the disaggregated serving features of vLLM.

Files

  • disagg_proxy_demo.py - Demonstrates XpYd (X prefill instances, Y decode instances).
  • kv_events.sh - Demonstrates KV cache event publishing.