This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-06-09 11:29:06 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
model_executor
/
layers
/
quantization
/
quark
History
Harry Mellor
13698db634
Improve configs -
ModelConfig
(
#17130
)
...
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-04-30 10:38:22 +08:00
..
schemes
[Bugfix] Fix bugs of running Quark quantized models (
#16236
)
2025-04-11 10:18:32 -04:00
__init__.py
[Misc][Quark] Upstream Quark format to VLLM (
#10765
)
2025-01-15 11:05:15 -05:00
quark_moe.py
Upstream Llama4 Support to Main (
#16113
)
2025-04-07 08:06:27 -07:00
quark.py
Improve configs -
ModelConfig
(
#17130
)
2025-04-30 10:38:22 +08:00
utils.py
[Model][Quant] Fix GLM, Fix fused module mappings for quantization (
#12634
)
2025-02-05 05:32:06 +00:00