mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-30 13:00:05 +08:00

History

[Doc] Update the doc for log probs + prefix caching (#23399 )

Signed-off-by: Chen Zhang <zhangch99@outlook.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

2025-08-22 13:20:39 +00:00

faq.md

Stop using title frontmatter and fix doc that can only be reached by search (#20623 )

2025-07-08 03:27:40 -07:00

metrics.md

Remove unnecessary explicit title anchors and use relative links instead (#20620 )

2025-07-08 02:49:13 -07:00

README.md

[Docs] Improve docs navigation (#22720 )

2025-08-12 04:25:55 -07:00

reproducibility.md

[Doc] Update reproducibility doc and example (#18741 )

2025-05-27 07:03:13 +00:00

security.md

[Docs] Switch to better markdown linting pre-commit hook (#21851 )

2025-07-29 19:45:08 -07:00

troubleshooting.md

[Frontend] Expose do_log_stats interval to env (#22905 )

2025-08-15 13:00:20 +00:00

usage_stats.md

Make distinct code and console admonitions so readers are less likely to miss them (#20585 )

2025-07-07 19:55:28 -07:00

v1_guide.md

[Doc] Update the doc for log probs + prefix caching (#23399 )

2025-08-22 13:20:39 +00:00

README.md

Using vLLM

First, vLLM must be installed for your chosen device in either a Python or Docker environment.

Then, vLLM supports the following usage patterns:

Inference and Serving: Run a single instance of a model.
Deployment: Scale up model instances for production.
Training: Train or fine-tune a model.