This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-23 04:55:01 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
model_executor
/
model_loader
History
liangel-02
1d642872a2
[torchao] fix safetensors for sharding (
#28169
)
...
Signed-off-by: Angel Li <liangel@meta.com>
2025-11-19 16:39:45 -08:00
..
__init__.py
…
base_loader.py
…
bitsandbytes_loader.py
…
default_loader.py
[torchao] fix safetensors for sharding (
#28169
)
2025-11-19 16:39:45 -08:00
dummy_loader.py
…
gguf_loader.py
[Model] Add Gemma3 GGUF multimodal support (
#27772
)
2025-11-18 08:56:29 -08:00
online_quantization.py
Move online quantization to
model.load_weights
(
#26327
)
2025-11-18 16:52:41 -08:00
runai_streamer_loader.py
[Feat] Adds runai distributed streamer (
#27230
)
2025-10-29 21:09:10 -07:00
sharded_state_loader.py
…
tensorizer_loader.py
…
tensorizer.py
[V0 deprecation] Remove VLLM_USE_V1 usage in most modules (
#27955
)
2025-11-04 20:51:16 -08:00
tpu.py
…
utils.py
Move online quantization to
model.load_weights
(
#26327
)
2025-11-18 16:52:41 -08:00
weight_utils.py
[torchao] fix safetensors for sharding (
#28169
)
2025-11-19 16:39:45 -08:00