From 77c95f72f78e4edefc75cfe42c974e85836ea8f5 Mon Sep 17 00:00:00 2001 From: abhisheksheth28 Date: Mon, 6 Oct 2025 02:30:03 -0700 Subject: [PATCH] [Doc] add KAITO to integrations (#25521) Signed-off-by: "Abhishek Sheth" --- docs/deployment/integrations/kaito.md | 5 +++++ docs/deployment/k8s.md | 1 + 2 files changed, 6 insertions(+) create mode 100644 docs/deployment/integrations/kaito.md diff --git a/docs/deployment/integrations/kaito.md b/docs/deployment/integrations/kaito.md new file mode 100644 index 0000000000000..ff050d3eeaf47 --- /dev/null +++ b/docs/deployment/integrations/kaito.md @@ -0,0 +1,5 @@ +# KAITO + +[KAITO](https://kaito-project.github.io/kaito/docs/) is a Kubernetes operator that supports deploying and serving LLMs with vLLM. It offers managing large models via container images with built-in OpenAI-compatible inference, auto-provisioning GPU nodes and curated model presets. + +Please refer to [quick start](https://kaito-project.github.io/kaito/docs/quick-start) for more details. diff --git a/docs/deployment/k8s.md b/docs/deployment/k8s.md index ca23e0b9fd8af..d3fda7eb6fb6e 100644 --- a/docs/deployment/k8s.md +++ b/docs/deployment/k8s.md @@ -12,6 +12,7 @@ Alternatively, you can deploy vLLM to Kubernetes using any of the following: - [Helm](frameworks/helm.md) - [InftyAI/llmaz](integrations/llmaz.md) +- [KAITO](integrations/kaito.md) - [KServe](integrations/kserve.md) - [KubeRay](integrations/kuberay.md) - [kubernetes-sigs/lws](frameworks/lws.md)