vllm/distributed at 696b01af8fac1819b2409cc0f205c73ef553558c - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-24 16:47:25 +08:00

History

Cody Yu d11bf435a0

[MISC] Consolidate cleanup() and refactor offline_inference_with_prefix.py (#9510 )

2024-10-18 14:30:55 -07:00

..

device_communicators

[torch.compile] improve allreduce registration (#9061 )

2024-10-04 16:43:50 -07:00

__init__.py

[Core][Refactor] move parallel_utils into vllm/distributed (#3950 )

2024-04-10 15:33:30 -07:00

communication_op.py

[Bugfix] Fix weight loading for Chameleon when TP>1 (#7410 )

2024-08-13 05:33:41 +00:00

parallel_state.py

[MISC] Consolidate cleanup() and refactor offline_inference_with_prefix.py (#9510 )

2024-10-18 14:30:55 -07:00

utils.py

[MISC] Introduce pipeline parallelism partition strategies (#6920 )

2024-07-31 12:02:17 -07:00