vllm/lora at f441d36cee366126dc4f6db5b6ca262c1e0cc20c - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-24 03:07:12 +08:00

History

[LoRA] Support FusedMoE LoRA Triton kernel for mxfp4 (#29708 )

Signed-off-by: Xin Yang <xyangx@amazon.com>
Signed-off-by: Xin Yang <105740670+xyang16@users.noreply.github.com>
Co-authored-by: Jee Jee Li <pandaleefree@gmail.com>

2025-11-30 10:37:25 +08:00

layers

[LoRA] Support FusedMoE LoRA Triton kernel for mxfp4 (#29708 )

2025-11-30 10:37:25 +08:00

ops

add support for --fully-sharded-loras in fused_moe (#28761 )

2025-11-19 16:32:00 +08:00

punica_wrapper

[Doc]: fixing typos in diverse files (#29492 )

2025-11-27 07:15:50 -08:00

__init__.py

…

lora_weights.py

[LoRA] Continue optimizing MoE LoRA weight loading (#29322 )

2025-11-27 05:56:28 -08:00

models.py

[LoRA] Cleanup LoRA unused code (#29611 )

2025-11-28 22:52:58 -08:00

peft_helper.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

request.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

resolver.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

utils.py

[LoRA] Continue optimizing MoE LoRA weight loading (#29322 )

2025-11-27 05:56:28 -08:00

worker_manager.py

[LoRA] Cleanup LoRA unused code (#29611 )

2025-11-28 22:52:58 -08:00