vllm/model_executor at 0384aa7150c4c9778efca041ffd1beb3ad2bd694 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-03-21 16:53:37 +08:00

History

Jiangyun Zhu 3857eb8725

[Perf] Decouple torch op from GDA to leverage torch.compile (#27871 )

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

2025-10-31 21:35:52 +08:00

..

[Perf] Decouple torch op from GDA to leverage torch.compile (#27871 )

2025-10-31 21:35:52 +08:00

[Feat] Adds runai distributed streamer (#27230 )

2025-10-29 21:09:10 -07:00

[Kimi-Linear] Correct prefixes and add compatibility to AWQ quants (#27834 )

2025-10-31 17:36:37 +08:00

[BugFix] Stopgap - Flashinfer Autotuner + GPT-OSS + DP/TP (#27762 )

2025-10-30 08:24:31 -07:00

__init__.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

custom_op.py

[FrontEnd] UNREVERT CompilationConfig overhaul (#20283 ): deprecate use_inductor in favor of backend, simplify custom_ops (#26502 )

2025-10-13 22:47:16 +00:00

parameter.py

[Docs] Replace rst style double-backtick with md single-backtick (#27091 )

2025-10-17 02:47:34 -07:00

utils.py

[Chore] Clean up pytorch helper functions in vllm.utils (#26908 )

2025-10-18 09:48:22 -07:00