vllm/source at 8e7fb5d43ae74e0a75a7da940a63c7891208d268 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-24 02:27:10 +08:00

History

Kante Yin 8e7fb5d43a

Support to serve vLLM on Kubernetes with LWS (#4829 )

Signed-off-by: kerthcet <kerthcet@gmail.com>

2024-05-16 16:37:29 -07:00

..

[Doc] add visualization for multi-stage dockerfile (#4456 )

2024-04-30 17:41:59 +00:00

[Bugfix][Doc] Fix CI failure in docs (#4804 )

2024-05-15 01:57:08 +09:00

[Doc] Add API reference for offline inference (#4710 )

2024-05-13 17:47:42 -07:00

getting_started

Unable to find Punica extension issue during source code installation (#4494 )

2024-05-01 00:42:09 +00:00

[Doc] Shorten README by removing supported model list (#4796 )

2024-05-13 16:23:54 -07:00

offline_inference

[Doc] Add API reference for offline inference (#4710 )

2024-05-13 17:47:42 -07:00

Enable scaled FP8 (e4m3fn) KV cache on ROCm (AMD GPU) (#3290 )

2024-04-03 14:15:55 -07:00

Support to serve vLLM on Kubernetes with LWS (#4829 )

2024-05-16 16:37:29 -07:00

conf.py

[CI] Disable non-lazy string operation on logging (#4326 )

2024-04-26 00:16:58 -07:00

generate_examples.py

Add example scripts to documentation (#4225 )

2024-04-22 16:36:54 +00:00

index.rst

[Doc] Add meetups to the doc (#4798 )

2024-05-13 18:48:00 -07:00