mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2025-12-12 14:25:59 +08:00
806 B
806 B
(deploying-with-kubeai)=
Deploying with KubeAI
KubeAI is a Kubernetes operator that enables you to deploy and manage AI models on Kubernetes. It provides a simple and scalable way to deploy vLLM in production. Functionality such as scale-from-zero, load based autoscaling, model caching, and much more is provided out of the box with zero external dependencies.
Please see the Installation Guides for environment specific instructions:
Once you have KubeAI installed, you can configure text generation models using vLLM.