mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2025-12-09 17:05:37 +08:00
9 lines
274 B
Markdown
9 lines
274 B
Markdown
# Disaggregated Serving
|
|
|
|
This example contains scripts that demonstrate the disaggregated serving features of vLLM.
|
|
|
|
## Files
|
|
|
|
- `disagg_proxy_demo.py` - Demonstrates XpYd (X prefill instances, Y decode instances).
|
|
- `kv_events.sh` - Demonstrates KV cache event publishing.
|