mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2025-12-15 03:25:01 +08:00
Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>
11 lines
661 B
Markdown
11 lines
661 B
Markdown
# Disaggregated Prefill V1
|
|
|
|
This example contains scripts that demonstrate disaggregated prefill in the offline setting of vLLM.
|
|
|
|
## Files
|
|
|
|
- `run.sh` - A helper script that will run `prefill_example.py` and `decode_example.py` sequentially.
|
|
- Make sure you are in the `examples/offline_inference/disaggregated-prefill-v1` directory before running `run.sh`.
|
|
- `prefill_example.py` - A script which performs prefill only, saving the KV state to the `local_storage` directory and the prompts to `output.txt`.
|
|
- `decode_example.py` - A script which performs decode only, loading the KV state from the `local_storage` directory and the prompts from `output.txt`.
|