This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-25 00:29:13 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
model_executor
/
model_loader
History
Shashwat Srijan
bb76538bbd
[Hardwware][Neuron] Simplify model load for transformers-neuronx library (
#9380
)
2024-10-17 15:39:39 -07:00
..
__init__.py
[VLM] Refactor
MultiModalConfig
initialization and profiling (
#7530
)
2024-08-17 13:30:55 -07:00
loader.py
support bitsandbytes quantization with more models (
#9148
)
2024-10-08 19:52:19 -06:00
neuron.py
[Hardwware][Neuron] Simplify model load for transformers-neuronx library (
#9380
)
2024-10-17 15:39:39 -07:00
openvino.py
[OpenVINO] Enable GPU support for OpenVINO vLLM backend (
#8192
)
2024-10-02 17:50:01 -04:00
tensorizer.py
[CI/Build] Update Ruff version (
#8469
)
2024-09-18 11:00:56 +00:00
utils.py
[Kernel] Zero point support in fused MarlinMoE kernel + AWQ Fused MoE (
#8973
)
2024-10-04 12:34:44 -06:00
weight_utils.py
[Misc] Print stack trace using
logger.exception
(
#9461
)
2024-10-17 13:55:48 +00:00