vllm/engine at fd4ea8ef5c17a8b991107402a414f6ed355d854d - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-06 10:35:40 +08:00

History

Zhuohan Li fd4ea8ef5c

Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )

2024-01-03 11:30:22 -08:00

..

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

Update Help Text for --gpu-memory-utilization Argument (#2183 )

2023-12-18 11:33:24 -08:00

async_llm_engine.py

Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )

2024-01-03 11:30:22 -08:00

llm_engine.py

Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )

2024-01-03 11:30:22 -08:00

metrics.py

Add Production Metrics in Prometheus format (#1890 )

2023-12-02 16:37:44 -08:00

ray_utils.py

Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221 )

2024-01-03 11:30:22 -08:00