vllm/model_executor at 0e71eaa6447d99e76de8e03213ec22bc1d3b07df - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-11 02:27:03 +08:00

History

汪志鹏 0e71eaa644

[Feature] AWQ marlin quantization support for fused moe with lora (#30442 )

Signed-off-by: princepride <wangzhipeng628@gmail.com>

2025-12-11 18:03:32 +00:00

..

[Feature] AWQ marlin quantization support for fused moe with lora (#30442 )

2025-12-11 18:03:32 +00:00

[Model][Quantization] Restore MoE + GGUF models support (incl. Qwen3 MoE) by allowing Sideload Parameters (#30116 )

2025-12-09 05:30:05 +00:00

Add Eagle and Eagle3 support to Transformers modeling backend (#30340 )

2025-12-11 17:02:10 +00:00

[BugFix] Fix AttributeError: 'MergedColumnParallelLinear' object has no attribute 'weight_scale' (#30399 )

2025-12-10 07:59:23 -08:00

__init__.py

…

custom_op.py

…

parameter.py

…

utils.py

[Quantization] FP8 Weight Reloading for Quantized RL Rollout (#28480 )

2025-12-09 13:54:32 -08:00