vllm/v1 at 0d8451c3a45d309e58de5e1c546f043de461d478 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-27 09:07:12 +08:00

History

youkaichao be39e3cd18

[core] clean up cudagraph batchsize padding logic (#10996 )

Signed-off-by: youkaichao <youkaichao@gmail.com>

2024-12-13 06:57:50 +00:00

..

[torch.compile] add a flag to track batchsize statistics (#11059 )

2024-12-10 12:40:52 -08:00

[Bugfix][V1] Fix 'NoneType' object has no attribute 'hash_value' (#11157 )

2024-12-13 06:30:06 +00:00

[V1] Fix torch profiling for offline inference (#11125 )

2024-12-12 15:51:53 +00:00

[Core] cleanup zmq ipc sockets on exit (#11115 )

2024-12-11 19:12:24 -08:00

[V1] Multiprocessing Tensor Parallel Support for v1 (#9856 )

2024-12-10 06:28:14 +00:00

[core] clean up cudagraph batchsize padding logic (#10996 )

2024-12-13 06:57:50 +00:00

__init__.py

[V1] AsyncLLM Implementation (#9826 )

2024-11-11 23:05:38 +00:00

outputs.py

[V1] Multiprocessing Tensor Parallel Support for v1 (#9856 )

2024-12-10 06:28:14 +00:00

request.py

[V1] VLM - Run the mm_mapper preprocessor in the frontend process (#10640 )

2024-12-03 10:33:10 +00:00

serial_utils.py

[V1] Use pickle for serializing EngineCoreRequest & Add multimodal inputs to EngineCoreRequest (#10245 )

2024-11-12 08:57:14 -08:00

utils.py

[V1] VLM preprocessor hashing (#11020 )

2024-12-12 00:55:30 +00:00