This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-10 11:34:45 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
model_executor
/
model_loader
History
Tyler Michael Smith
7342a7d7f8
[Model] Support Mamba (
#6484
)
2024-10-11 15:40:06 +00:00
..
__init__.py
[VLM] Refactor
MultiModalConfig
initialization and profiling (
#7530
)
2024-08-17 13:30:55 -07:00
loader.py
support bitsandbytes quantization with more models (
#9148
)
2024-10-08 19:52:19 -06:00
neuron.py
[Hardware][Neuron] Add on-device sampling support for Neuron (
#8746
)
2024-10-04 16:42:20 -07:00
openvino.py
[OpenVINO] Enable GPU support for OpenVINO vLLM backend (
#8192
)
2024-10-02 17:50:01 -04:00
tensorizer.py
[CI/Build] Update Ruff version (
#8469
)
2024-09-18 11:00:56 +00:00
utils.py
[Kernel] Zero point support in fused MarlinMoE kernel + AWQ Fused MoE (
#8973
)
2024-10-04 12:34:44 -06:00
weight_utils.py
[Model] Support Mamba (
#6484
)
2024-10-11 15:40:06 +00:00