[Docs] Update docker.md with HF_TOKEN, new model, and podman fix (#21856)

2026-07-06 16:17:16 +08:00 · 2025-07-29 22:45:41 -04:00 · 2025-07-29 22:45:41 -04:00 · fb58e3a651
commit fb58e3a651
parent 76080cff79
1 changed files with 5 additions and 5 deletions
--- a/docs/deployment/docker.md
+++ b/docs/deployment/docker.md
@ -10,23 +10,23 @@ The image can be used to run OpenAI compatible server and is available on Docker
 ```bash
 docker run --runtime nvidia --gpus all \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
-    --env "HUGGING_FACE_HUB_TOKEN=<secret>" \
+    --env "HUGGING_FACE_HUB_TOKEN=$HF_TOKEN" \
    -p 8000:8000 \
    --ipc=host \
    vllm/vllm-openai:latest \
-    --model mistralai/Mistral-7B-v0.1
+    --model Qwen/Qwen3-0.6B
 ```

 This image can also be used with other container engines such as [Podman](https://podman.io/).

 ```bash
-podman run --gpus all \
+podman run --device nvidia.com/gpu=all \
  -v ~/.cache/huggingface:/root/.cache/huggingface \
  --env "HUGGING_FACE_HUB_TOKEN=$HF_TOKEN" \
  -p 8000:8000 \
  --ipc=host \
-  vllm/vllm-openai:latest \
-  --model mistralai/Mistral-7B-v0.1
+  docker.io/vllm/vllm-openai:latest \
+  --model Qwen/Qwen3-0.6B
 ```

 You can add any other [engine-args](../configuration/engine_args.md) you need after the image tag (`vllm/vllm-openai:latest`).