Logo
Explore Help
Sign In
xinyun/vllm
1
0
Fork 0
You've already forked vllm
mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-17 12:54:34 +08:00
Code Issues Packages Projects Releases Wiki Activity
vllm/vllm/distributed
History
Chendi.Xue 6ca74bc11a
[NIXL][BUG FIX] Fix both failing issue and accuracy issue with nixl + host_buffer on CUDA (#30419)
Signed-off-by: Chendi Xue <chendi.xue@intel.com>
Signed-off-by: Chendi.Xue <chendi.xue@intel.com>
Co-authored-by: Nicolò Lucchesi <nlucches@redhat.com>
2025-12-18 22:10:02 +00:00
..
device_communicators
[Perf] Do FP4 quant before All gather on flashinfer trtllmgen MOE (#30014)
2025-12-16 13:01:48 -08:00
ec_transfer
[Platform] Let EPD work with non-cuda platform (#30225)
2025-12-18 06:45:29 +00:00
eplb
[ROCm][CI] Add "Qwen3-Next-80B-A3B-Instruct MTP Async EPLB Accuracy Test" Back Into AMD CI (#30590)
2025-12-14 06:56:26 +00:00
kv_transfer
[NIXL][BUG FIX] Fix both failing issue and accuracy issue with nixl + host_buffer on CUDA (#30419)
2025-12-18 22:10:02 +00:00
__init__.py
…
communication_op.py
…
kv_events.py
[KVConnector] Add KV events to KV Connectors (#28309)
2025-12-11 15:30:29 +01:00
parallel_state.py
[Perf] Do FP4 quant before All gather on flashinfer trtllmgen MOE (#30014)
2025-12-16 13:01:48 -08:00
tpu_distributed_utils.py
…
utils.py
[UX] Suppress gloo log spam (#29250)
2025-11-25 17:19:35 -08:00
Powered by Gitea Version: 1.23.1 Page: 1277ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API