vllm/lora at 372bf0890b19cc3c2992ce5c16eca3647e2a9e13 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-08-01 22:09:08 +08:00

History

Russell Bryant d3d6bb13fb

Set weights_only=True when using torch.load() (#12366 )

Signed-off-by: Russell Bryant <rbryant@redhat.com>

2025-01-24 02:17:30 +00:00

..

[Hardware][CPU] Multi-LoRA implementation for the CPU backend (#11100 )

2025-01-12 13:01:52 +00:00

[Platform] Move get_punica_wrapper() function to Platform (#11516 )

2025-01-13 13:12:10 +00:00

__init__.py

[Experimental] Add multi-LoRA support (#1804 )

2024-01-23 15:26:37 -08:00

fully_sharded_layers.py

[Misc][LoRA] Clean up the function interface of Punica (#10917 )

2024-12-05 13:22:28 +00:00

layers.py

Support torchrun and SPMD-style offline inference (#12071 )

2025-01-16 19:58:53 +08:00

lora.py

[Misc][LoRA] Support Rank Stabilized LoRA (RSLoRA) (#6909 )

2024-12-30 22:15:58 -08:00

models.py

Set weights_only=True when using torch.load() (#12366 )

2025-01-24 02:17:30 +00:00

peft_helper.py

[Misc][LoRA] Improve the readability of LoRA error messages (#12102 )

2025-01-17 19:32:28 +08:00

request.py

[Core] Support Lora lineage and base model metadata management (#6315 )

2024-09-20 06:20:56 +00:00

utils.py

[Misc][LoRA] Fix LoRA weight mapper (#11495 )

2024-12-26 15:52:48 +08:00

worker_manager.py

[Misc][LoRA] Improve the readability of LoRA error messages (#12102 )

2025-01-17 19:32:28 +08:00