mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2025-12-15 01:45:02 +08:00
31 lines
1.2 KiB
Markdown
31 lines
1.2 KiB
Markdown
---
|
|
toc_depth: 3
|
|
---
|
|
|
|
# Engine Arguments
|
|
|
|
Engine arguments control the behavior of the vLLM engine.
|
|
|
|
- For [offline inference](../serving/offline_inference.md), they are part of the arguments to [LLM][vllm.LLM] class.
|
|
- For [online serving](../serving/openai_compatible_server.md), they are part of the arguments to `vllm serve`.
|
|
|
|
The engine argument classes, [EngineArgs][vllm.engine.arg_utils.EngineArgs] and [AsyncEngineArgs][vllm.engine.arg_utils.AsyncEngineArgs], are a combination of the configuration classes defined in [vllm.config][]. Therefore, if you are interested in developer documentation, we recommend looking at these configuration classes as they are the source of truth for types, defaults and docstrings.
|
|
|
|
When passing JSON CLI arguments, the following sets of arguments are equivalent:
|
|
|
|
- `--json-arg '{"key1": "value1", "key2": {"key3": "value2"}}'`
|
|
- `--json-arg.key1 value1 --json-arg.key2.key3 value2`
|
|
|
|
Additionally, list elements can be passed individually using `+`:
|
|
|
|
- `--json-arg '{"key4": ["value3", "value4", "value5"]}'`
|
|
- `--json-arg.key4+ value3 --json-arg.key4+='value4,value5'`
|
|
|
|
## `EngineArgs`
|
|
|
|
--8<-- "docs/argparse/engine_args.md"
|
|
|
|
## `AsyncEngineArgs`
|
|
|
|
--8<-- "docs/argparse/async_engine_args.md"
|