vllm/engine at c9fadda54353f1b57c3dae9b7cbebda6f0767f8e - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-26 22:37:33 +08:00

History

Woosuk Kwon 30fb0956df

[Minor] Add more detailed explanation on quantization argument (#2145 )

2023-12-17 01:56:16 -08:00

..

__init__.py

Change the name to vLLM (#150 )

2023-06-17 03:07:40 -07:00

arg_utils.py

[Minor] Add more detailed explanation on quantization argument (#2145 )

2023-12-17 01:56:16 -08:00

async_llm_engine.py

Fix typing in AsyncLLMEngine & add toml to requirements-dev (#2100 )

2023-12-14 00:19:41 -08:00

llm_engine.py

Remove dependency on CuPy (#2152 )

2023-12-17 01:49:07 -08:00

metrics.py

Add Production Metrics in Prometheus format (#1890 )

2023-12-02 16:37:44 -08:00

ray_utils.py

Optimize model execution with CUDA graph (#1926 )

2023-12-16 21:12:08 -08:00