This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-04-16 22:17:07 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
vllm
/
model_executor
/
layers
/
quantization
/
kernels
History
czhu-cohere
2c2b140ae8
[quantization] use channel scales for w4a8 + misc fixes (
#23570
)
...
Signed-off-by: czhu-cohere <conway.zhu@cohere.com>
2025-08-26 18:23:23 -07:00
..
mixed_precision
[quantization] use channel scales for w4a8 + misc fixes (
#23570
)
2025-08-26 18:23:23 -07:00
scaled_mm
[CPU] Refactor CPU W8A8 scaled_mm (
#23071
)
2025-08-21 09:34:24 +08:00
__init__.py
[TPU][Quantization] TPU
W8A8
(
#11785
)
2025-01-08 19:33:29 +00:00