vllm/model_executor at b26b70bec4faa3dae487db1f3eebaddd86c1d828 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-08-02 00:49:09 +08:00

History

Nicolò Lucchesi b26b70bec4

[Misc] Refactor get_kv_cache_spec into AttentionLayerBase (#26587 )

Signed-off-by: NickLucche <nlucches@redhat.com>

2025-10-18 13:51:21 +00:00

..

[Misc] Refactor get_kv_cache_spec into AttentionLayerBase (#26587 )

2025-10-18 13:51:21 +00:00

[Chore] Separate out vllm.utils.importlib (#27022 )

2025-10-17 00:48:59 +00:00

[Misc] Refactor get_kv_cache_spec into AttentionLayerBase (#26587 )

2025-10-18 13:51:21 +00:00

[Feature] Migrate DeepGEMM API from get_m_alignment_for_contiguous_layout to get_mk_alignment_for_contiguous_layout (#26935 )

2025-10-16 16:46:48 -04:00

__init__.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

custom_op.py

[FrontEnd] UNREVERT CompilationConfig overhaul (#20283 ): deprecate use_inductor in favor of backend, simplify custom_ops (#26502 )

2025-10-13 22:47:16 +00:00

parameter.py

[Docs] Replace rst style double-backtick with md single-backtick (#27091 )

2025-10-17 02:47:34 -07:00

utils.py

disable graph partition in custom op (#26952 )

2025-10-17 11:08:47 +08:00