vllm/model_loader at 250e26a63e241076d8182155b9c7ea4f9f157ea3 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-18 00:46:59 +08:00

History

Tyler Michael Smith 7342a7d7f8

[Model] Support Mamba (#6484 )

2024-10-11 15:40:06 +00:00

..

__init__.py

[VLM] Refactor MultiModalConfig initialization and profiling (#7530 )

2024-08-17 13:30:55 -07:00

loader.py

support bitsandbytes quantization with more models (#9148 )

2024-10-08 19:52:19 -06:00

neuron.py

[Hardware][Neuron] Add on-device sampling support for Neuron (#8746 )

2024-10-04 16:42:20 -07:00

openvino.py

[OpenVINO] Enable GPU support for OpenVINO vLLM backend (#8192 )

2024-10-02 17:50:01 -04:00

tensorizer.py

[CI/Build] Update Ruff version (#8469 )

2024-09-18 11:00:56 +00:00

utils.py

[Kernel] Zero point support in fused MarlinMoE kernel + AWQ Fused MoE (#8973 )

2024-10-04 12:34:44 -06:00

weight_utils.py

[Model] Support Mamba (#6484 )

2024-10-11 15:40:06 +00:00