mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2026-05-23 08:57:53 +08:00
[Misc] Update benchmark_prefix_caching.py fixed example usage (#11920)
Signed-off-by: Ren MinMin <renmm6@chinaunicom.cn> Co-authored-by: Ren MinMin <renmm6@chinaunicom.cn>
This commit is contained in:
parent
46fa98ccad
commit
8a579408f3
@ -10,7 +10,8 @@ Fixed example usage:
|
|||||||
--model meta-llama/Llama-2-7b-chat-hf \
|
--model meta-llama/Llama-2-7b-chat-hf \
|
||||||
--enable-prefix-caching \
|
--enable-prefix-caching \
|
||||||
--num-prompts 1 \
|
--num-prompts 1 \
|
||||||
--repeat-count 100
|
--repeat-count 100 \
|
||||||
|
--input-length-range 128:256
|
||||||
|
|
||||||
ShareGPT example usage:
|
ShareGPT example usage:
|
||||||
# This command samples 20 prompts with input lengths
|
# This command samples 20 prompts with input lengths
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user