vllm/model_executor at 0d37450eb7e38ac82df35d4e0f21d4254435049d - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-03 08:16:38 +08:00

History

bnellnm 47e66c24e2 [Model] Apply shared experts overlap optimization to all models with shared experts (#26145 )

Signed-off-by: Bill Nell <bnell@redhat.com>

2025-10-09 11:31:04 -04:00

..

[Model] Apply shared experts overlap optimization to all models with shared experts (#26145 )

2025-10-09 11:31:04 -04:00

[V0 deprecation] Remove QKVCrossParallelLinear implementation (#26475 )

2025-10-09 10:52:27 +00:00

[Model] Apply shared experts overlap optimization to all models with shared experts (#26145 )

2025-10-09 11:31:04 -04:00

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

__init__.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

custom_op.py

Revert #26113 "[Frontend] CompilationConfig overhaul (#20283 ): deprecate use_inductor in favor of backend, simplify custom_ops" (#26472 )

2025-10-09 05:43:55 -07:00

parameter.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

utils.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00