This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-20 01:21:19 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
distributed
History
Roger Wang
f5f51e5931
[Core][MM] Optimize encoder cache manager by operating with embeddings only (
#30475
)
...
Signed-off-by: Roger Wang <hey@rogerw.io> Co-authored-by: Sun Kim <sunytokki@gmail.com>
2025-12-16 14:18:17 -08:00
..
device_communicators
[Perf] Do FP4 quant before All gather on flashinfer trtllmgen MOE (
#30014
)
2025-12-16 13:01:48 -08:00
ec_transfer
[Core][MM] Optimize encoder cache manager by operating with embeddings only (
#30475
)
2025-12-16 14:18:17 -08:00
eplb
[ROCm][CI] Add "Qwen3-Next-80B-A3B-Instruct MTP Async EPLB Accuracy Test" Back Into AMD CI (
#30590
)
2025-12-14 06:56:26 +00:00
kv_transfer
[NIXL][BUG FIX] Fix a bug for PD with host_buffer after merging 29665 (
#30420
)
2025-12-14 15:38:28 +00:00
__init__.py
…
communication_op.py
…
kv_events.py
[KVConnector] Add KV events to KV Connectors (
#28309
)
2025-12-11 15:30:29 +01:00
parallel_state.py
[Perf] Do FP4 quant before All gather on flashinfer trtllmgen MOE (
#30014
)
2025-12-16 13:01:48 -08:00
tpu_distributed_utils.py
…
utils.py
[UX] Suppress gloo log spam (
#29250
)
2025-11-25 17:19:35 -08:00