xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-28 14:27:15 +08:00

History

Thomas Parnell 3534c39a20

[V1] [Hybrid] Refactor mamba state shape calculation; enable V1 via cli (#20840 )

Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>

2025-07-15 04:04:35 -07:00

..

faq.md

Stop using title frontmatter and fix doc that can only be reached by search (#20623 )

2025-07-08 03:27:40 -07:00

metrics.md

Remove unnecessary explicit title anchors and use relative links instead (#20620 )

2025-07-08 02:49:13 -07:00

README.md

[Doc] Reorganize user guide (#18661 )

2025-05-24 07:25:33 -07:00

reproducibility.md

[Doc] Update reproducibility doc and example (#18741 )

2025-05-27 07:03:13 +00:00

security.md

[Docs] Fix a bullet list in usage/security.md (#19358 )

2025-06-09 13:28:51 +00:00

troubleshooting.md

Stop using title frontmatter and fix doc that can only be reached by search (#20623 )

2025-07-08 03:27:40 -07:00

usage_stats.md

Make distinct code and console admonitions so readers are less likely to miss them (#20585 )

2025-07-07 19:55:28 -07:00

v1_guide.md

[V1] [Hybrid] Refactor mamba state shape calculation; enable V1 via cli (#20840 )

2025-07-15 04:04:35 -07:00

README.md

Using vLLM

vLLM supports the following usage patterns:

Inference and Serving: Run a single instance of a model.
Deployment: Scale up model instances for production.
Training: Train or fine-tune a model.