This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-24 10:10:15 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
docs
/
source
/
features
/
quantization
History
Michael Goin
ed37599544
Update supported_hardware.md for TPU INT8 (
#16437
)
2025-04-11 12:28:07 +08:00
..
auto_awq.md
[Docs] Add GPTQModel (
#14056
)
2025-03-03 21:59:09 +00:00
bnb.md
[Misc] Auto detect bitsandbytes pre-quantized models (
#16027
)
2025-04-04 23:30:45 -07:00
fp8.md
…
gguf.md
doc: fix some typos in doc (
#16154
)
2025-04-07 05:32:06 +00:00
gptqmodel.md
[Docs] Add GPTQModel (
#14056
)
2025-03-03 21:59:09 +00:00
index.md
Torchao (
#14231
)
2025-04-07 19:39:28 -04:00
int4.md
…
int8.md
…
quantized_kvcache.md
…
quark.md
[Doc] Quark quantization documentation (
#15861
)
2025-04-01 08:32:45 -07:00
supported_hardware.md
Update supported_hardware.md for TPU INT8 (
#16437
)
2025-04-11 12:28:07 +08:00
torchao.md
Torchao (
#14231
)
2025-04-07 19:39:28 -04:00