vllm/distributed at 19cc9468fd0fa1701e7cb74b5928b329a1d16cf1 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-17 12:54:34 +08:00

History

[NIXL][BUG FIX] Fix both failing issue and accuracy issue with nixl + host_buffer on CUDA (#30419 )

Signed-off-by: Chendi Xue <chendi.xue@intel.com>
Signed-off-by: Chendi.Xue <chendi.xue@intel.com>
Co-authored-by: Nicolò Lucchesi <nlucches@redhat.com>

2025-12-18 22:10:02 +00:00

device_communicators

[Perf] Do FP4 quant before All gather on flashinfer trtllmgen MOE (#30014 )

2025-12-16 13:01:48 -08:00

ec_transfer

[Platform] Let EPD work with non-cuda platform (#30225 )

2025-12-18 06:45:29 +00:00

eplb

[ROCm][CI] Add "Qwen3-Next-80B-A3B-Instruct MTP Async EPLB Accuracy Test" Back Into AMD CI (#30590 )

2025-12-14 06:56:26 +00:00

kv_transfer

[NIXL][BUG FIX] Fix both failing issue and accuracy issue with nixl + host_buffer on CUDA (#30419 )

2025-12-18 22:10:02 +00:00

__init__.py

…

communication_op.py

…

kv_events.py

[KVConnector] Add KV events to KV Connectors (#28309 )

2025-12-11 15:30:29 +01:00

parallel_state.py

[Perf] Do FP4 quant before All gather on flashinfer trtllmgen MOE (#30014 )

2025-12-16 13:01:48 -08:00

tpu_distributed_utils.py

…

utils.py

[UX] Suppress gloo log spam (#29250 )

2025-11-25 17:19:35 -08:00