xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-09 01:35:01 +08:00

Cyrus Leung 1cb194a018

[Doc] Reorganize user guide (#18661 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

2025-05-24 07:25:33 -07:00

298 B

Raw Blame History

Using vLLM

vLLM supports the following usage patterns:

Inference and Serving: Run a single instance of a model.
Deployment: Scale up model instances for production.
Training: Train or fine-tune a model.