vllm/v1 at 9597a095f2c02670b44f5973635ce4b9852e8eab - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-11 12:41:27 +08:00

History

Robert Shaw 9597a095f2

[V1][Core][1/n] Logging and Metrics (#11962 )

Signed-off-by: rshaw@neuralmagic.com <rshaw@neuralmagic.com>

2025-01-12 21:02:02 +00:00

..

[Kernel] Move attn_type to Attention.__init__() (#11690 )

2025-01-07 00:11:28 +08:00

[V1][Core][1/n] Logging and Metrics (#11962 )

2025-01-12 21:02:02 +00:00

[V1][Core][1/n] Logging and Metrics (#11962 )

2025-01-12 21:02:02 +00:00

[V1] Refactor get_executor_cls (#11754 )

2025-01-06 07:59:16 +00:00

[V1][Core][1/n] Logging and Metrics (#11962 )

2025-01-12 21:02:02 +00:00

[Doc] Fix typo (#11666 )

2025-01-01 08:10:10 +00:00

[torch.compile] Hide KV cache behind torch.compile boundary (#11677 )

2025-01-10 13:14:42 +08:00

__init__.py

[V1] AsyncLLM Implementation (#9826 )

2024-11-11 23:05:38 +00:00

outputs.py

[V1] Multiprocessing Tensor Parallel Support for v1 (#9856 )

2024-12-10 06:28:14 +00:00

request.py

[V1] Extend beyond image modality and support mixed-modality inference with Llava-OneVision (#11685 )

2025-01-06 19:58:16 +00:00

serial_utils.py

[V1] Use pickle for serializing EngineCoreRequest & Add multimodal inputs to EngineCoreRequest (#10245 )

2024-11-12 08:57:14 -08:00

utils.py

[V1] Simplify Shutdown (#11659 )

2025-01-03 17:25:38 +00:00