xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-08-03 16:37:07 +08:00

History

Thomas Parnell 5358cce5ff

[V1] [Doc] Update V1 docs for Mamba models (#20499 )

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>

2025-07-09 01:02:41 -07:00

..

faq.md

Stop using title frontmatter and fix doc that can only be reached by search (#20623 )

2025-07-08 03:27:40 -07:00

metrics.md

Remove unnecessary explicit title anchors and use relative links instead (#20620 )

2025-07-08 02:49:13 -07:00

README.md

[Doc] Reorganize user guide (#18661 )

2025-05-24 07:25:33 -07:00

reproducibility.md

[Doc] Update reproducibility doc and example (#18741 )

2025-05-27 07:03:13 +00:00

security.md

[Docs] Fix a bullet list in usage/security.md (#19358 )

2025-06-09 13:28:51 +00:00

troubleshooting.md

Stop using title frontmatter and fix doc that can only be reached by search (#20623 )

2025-07-08 03:27:40 -07:00

usage_stats.md

Make distinct code and console admonitions so readers are less likely to miss them (#20585 )

2025-07-07 19:55:28 -07:00

v1_guide.md

[V1] [Doc] Update V1 docs for Mamba models (#20499 )

2025-07-09 01:02:41 -07:00

README.md

Using vLLM

vLLM supports the following usage patterns:

Inference and Serving: Run a single instance of a model.
Deployment: Scale up model instances for production.
Training: Train or fine-tune a model.