vllm/distributed at cbc53b6b8d87b29949ce13d504750f63714df532 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-01 15:57:05 +08:00

History

youkaichao 515080ad2f

[bugfix][distributed] fix shm broadcast when the queue size is full (#5801 )

2024-06-25 21:56:02 -07:00

..

device_communicators

[bugfix][distributed] fix shm broadcast when the queue size is full (#5801 )

2024-06-25 21:56:02 -07:00

__init__.py

[Core][Refactor] move parallel_utils into vllm/distributed (#3950 )

2024-04-10 15:33:30 -07:00

communication_op.py

[Core][Distributed] code deduplication in tp&pp with coordinator(#5293 )

2024-06-12 17:27:08 -07:00

parallel_state.py

[Speculative Decoding] Support draft model on different tensor-parallel size than target model (#5414 )

2024-06-25 09:56:06 +00:00

utils.py

[Core][Distributed] improve p2p access check (#4992 )

2024-05-29 11:29:07 +00:00