vllm/model_loader at 836e8ef6eeafcd1e24b25c990da6331f48a95fd2 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-01 10:57:13 +08:00

History

Shashwat Srijan bb76538bbd

[Hardwware][Neuron] Simplify model load for transformers-neuronx library (#9380 )

2024-10-17 15:39:39 -07:00

..

__init__.py

[VLM] Refactor MultiModalConfig initialization and profiling (#7530 )

2024-08-17 13:30:55 -07:00

loader.py

support bitsandbytes quantization with more models (#9148 )

2024-10-08 19:52:19 -06:00

neuron.py

[Hardwware][Neuron] Simplify model load for transformers-neuronx library (#9380 )

2024-10-17 15:39:39 -07:00

openvino.py

[OpenVINO] Enable GPU support for OpenVINO vLLM backend (#8192 )

2024-10-02 17:50:01 -04:00

tensorizer.py

[CI/Build] Update Ruff version (#8469 )

2024-09-18 11:00:56 +00:00

utils.py

[Kernel] Zero point support in fused MarlinMoE kernel + AWQ Fused MoE (#8973 )

2024-10-04 12:34:44 -06:00

weight_utils.py

[Misc] Print stack trace using logger.exception (#9461 )

2024-10-17 13:55:48 +00:00