diff --git a/docs/source/index.rst b/docs/source/index.rst index 32929257661a..bdc541cb2d58 100644 --- a/docs/source/index.rst +++ b/docs/source/index.rst @@ -70,6 +70,7 @@ Documentation serving/distributed_serving serving/run_on_sky + serving/deploying_with_kserve serving/deploying_with_triton serving/deploying_with_docker serving/serving_with_langchain diff --git a/docs/source/serving/deploying_with_kserve.rst b/docs/source/serving/deploying_with_kserve.rst new file mode 100644 index 000000000000..7f22766e09ae --- /dev/null +++ b/docs/source/serving/deploying_with_kserve.rst @@ -0,0 +1,8 @@ +.. _deploying_with_kserve: + +Deploying with KServe +============================ + +vLLM can be deployed with `KServe `_ on Kubernetes for highly scalable distributed model serving. + +Please see `this guide `_ for more details on using vLLM with KServe.