mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2026-06-08 01:42:14 +08:00
Documentation about official docker image (#1709)
This commit is contained in:
parent
06e9ebebd5
commit
37c1e3c218
@ -3,11 +3,25 @@
|
|||||||
Deploying with Docker
|
Deploying with Docker
|
||||||
============================
|
============================
|
||||||
|
|
||||||
|
vLLM offers official docker image for deployment.
|
||||||
|
The image can be used to run OpenAI compatible server.
|
||||||
|
The image is available on Docker Hub as `vllm/vllm-openai <https://hub.docker.com/r/vllm/vllm-openai/tags>`_.
|
||||||
|
|
||||||
|
... code-block:: console
|
||||||
|
|
||||||
|
$ docker run --runtime nvidia --gpus all \
|
||||||
|
-v ~/.cache/huggingface:/root/.cache/huggingface \
|
||||||
|
-p 8000:8000 \
|
||||||
|
--env "HUGGING_FACE_HUB_TOKEN=<secret>" \
|
||||||
|
vllm/vllm-openai:latest \
|
||||||
|
--model mistralai/Mistral-7B-v0.1
|
||||||
|
|
||||||
|
|
||||||
You can build and run vLLM from source via the provided dockerfile. To build vLLM:
|
You can build and run vLLM from source via the provided dockerfile. To build vLLM:
|
||||||
|
|
||||||
.. code-block:: console
|
.. code-block:: console
|
||||||
|
|
||||||
$ DOCKER_BUILDKIT=1 docker build . --target vllm --tag vllm --build-arg max_jobs=8
|
$ DOCKER_BUILDKIT=1 docker build . --target vllm-openai --tag vllm/vllm-openai --build-arg max_jobs=8
|
||||||
|
|
||||||
To run vLLM:
|
To run vLLM:
|
||||||
|
|
||||||
@ -17,5 +31,5 @@ To run vLLM:
|
|||||||
-v ~/.cache/huggingface:/root/.cache/huggingface \
|
-v ~/.cache/huggingface:/root/.cache/huggingface \
|
||||||
-p 8000:8000 \
|
-p 8000:8000 \
|
||||||
--env "HUGGING_FACE_HUB_TOKEN=<secret>" \
|
--env "HUGGING_FACE_HUB_TOKEN=<secret>" \
|
||||||
vllm <args...>
|
vllm/vllm-openai <args...>
|
||||||
|
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user