vllm/model_executor at 77f8001f533021ece46779f5b7e69edc1d3b514f - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-18 08:27:09 +08:00

History

tomeras91 77f8001f53

[Model][Bugfix] fix pipeline parallelism support for NemotronH (#27968 )

Signed-off-by: Tomer Asida <57313761+tomeras91@users.noreply.github.com>

2025-11-04 12:28:36 +00:00

..

Support using Int4PreshuffledTensor after loading (#26066 )

2025-11-04 06:00:57 -05:00

[Feat] Adds runai distributed streamer (#27230 )

2025-10-29 21:09:10 -07:00

[Model][Bugfix] fix pipeline parallelism support for NemotronH (#27968 )

2025-11-04 12:28:36 +00:00

[BugFix][Performance] Restore flashinfer autotuning for all scenarios (#27904 )

2025-11-04 15:56:21 +08:00

__init__.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

custom_op.py

[FrontEnd] UNREVERT CompilationConfig overhaul (#20283 ): deprecate use_inductor in favor of backend, simplify custom_ops (#26502 )

2025-10-13 22:47:16 +00:00

parameter.py

[Docs] Replace rst style double-backtick with md single-backtick (#27091 )

2025-10-17 02:47:34 -07:00

utils.py

[Chore] Clean up pytorch helper functions in vllm.utils (#26908 )

2025-10-18 09:48:22 -07:00