vllm/model_executor at de94289a98d7ec52a5ef02719e01a1db8b505170 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-26 12:47:26 +08:00

History

Kyle Sayers de94289a98

[Core] Support weight_loader_v2 for UnquantizedLinearMethod (#23036 )

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

2025-09-23 18:30:26 -06:00

..

[Core] Support weight_loader_v2 for UnquantizedLinearMethod (#23036 )

2025-09-23 18:30:26 -06:00

[TPU] Deprecate xm.mark_step in favor of `torch_xla.sync (#25254 )

2025-09-22 10:12:57 +00:00

Remove redundant mutates_args and dispatch_key for direct_register_custom_op (#25512 )

2025-09-23 22:48:40 +00:00

[Bug] Fix AttributeError: 'FusedMoE' object has no attribute 'w13_weight_scale'. Did you mean: 'w13_weight_scale_inv' (#25519 )

2025-09-24 00:07:51 +00:00

__init__.py

[V0 Deprecation] Remove V0 sampling metadata (#25345 )

2025-09-21 10:37:11 -07:00

custom_op.py

[V0 deprecation] Deprecate V0 Neuron backend (#21159 )

2025-09-06 16:15:18 -07:00

parameter.py

[Core] Support weight_loader_v2 for UnquantizedLinearMethod (#23036 )

2025-09-23 18:30:26 -06:00

utils.py

[OOT] Support sync_model_loading for OOT (#25126 )

2025-09-19 05:41:53 +00:00