[DOC] Update production-stack.md (#26177)

Signed-off-by: Elieser Pereira <elieser.pereiraa@gmail.com>
2026-03-16 16:27:15 +08:00 · 2025-10-05 18:32:48 -03:00 · 2025-10-05 18:32:48 -03:00 · f509a20846
commit f509a20846
parent 60bc25e74c
1 changed files with 2 additions and 2 deletions
--- a/docs/deployment/integrations/production-stack.md
+++ b/docs/deployment/integrations/production-stack.md
@ -55,7 +55,7 @@ sudo kubectl port-forward svc/vllm-router-service 30080:80
 And then you can send out a query to the OpenAI-compatible API to check the available models:

 ```bash
-curl -o- http://localhost:30080/models
+curl -o- http://localhost:30080/v1/models
 ```

 ??? console "Output"
@ -78,7 +78,7 @@ curl -o- http://localhost:30080/models
 To send an actual chatting request, you can issue a curl request to the OpenAI `/completion` endpoint:

 ```bash
-curl -X POST http://localhost:30080/completions \
+curl -X POST http://localhost:30080/v1/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "facebook/opt-125m",