vllm/model_executor at 3c7fefdeba183e5c5e575f668b797549530f5a3d - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-03-25 19:10:17 +08:00

History

Jiangyun Zhu 8df98c2161

[perf] Enable concurrent execution of "shared_experts" and "selected_experts" in qwen3-next (#27578 )

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

2025-10-29 08:12:54 +00:00

..

[Bugfix] Fix non-contiguous tensor error in rocm_unquantized_gemm_impl (#27605 )

2025-10-29 00:00:15 -07:00

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

[perf] Enable concurrent execution of "shared_experts" and "selected_experts" in qwen3-next (#27578 )

2025-10-29 08:12:54 +00:00

[Bugfix] Fix gpt-oss w4a8 DP/EP on B200 (#26729 )

2025-10-21 01:51:14 -04:00

__init__.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

custom_op.py

[FrontEnd] UNREVERT CompilationConfig overhaul (#20283 ): deprecate use_inductor in favor of backend, simplify custom_ops (#26502 )

2025-10-13 22:47:16 +00:00

parameter.py

[Docs] Replace rst style double-backtick with md single-backtick (#27091 )

2025-10-17 02:47:34 -07:00

utils.py

[Chore] Clean up pytorch helper functions in vllm.utils (#26908 )

2025-10-18 09:48:22 -07:00