vllm/modal.md at 3d19d47d91b1d06a24a1bd7b6f0626a09cc18dce - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-15 11:36:20 +08:00

Harry Mellor a1fe24d961

Migrate docs from Sphinx to MkDocs (#18145 )

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-05-23 02:09:53 -07:00

9 lines

316 B

Markdown

Raw Blame History

 ---
 title: Modal
 ---
 [](){ #deployment-modal }
 vLLM can be run on cloud GPUs with [Modal](https://modal.com), a serverless computing platform designed for fast auto-scaling.
 For details on how to deploy vLLM on Modal, see [this tutorial in the Modal documentation](https://modal.com/docs/examples/vllm_inference).