This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-19 09:47:11 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
docs
/
source
/
features
/
quantization
History
Qubitium-ModelCloud
cd1d3c3df8
[Docs] Add GPTQModel (
#14056
)
...
Signed-off-by: mgoin <mgoin64@gmail.com> Co-authored-by: mgoin <mgoin64@gmail.com>
2025-03-03 21:59:09 +00:00
..
auto_awq.md
[Docs] Add GPTQModel (
#14056
)
2025-03-03 21:59:09 +00:00
bnb.md
…
fp8.md
[Doc] Convert docs to use colon fences (
#12471
)
2025-01-29 11:38:29 +08:00
gguf.md
[Model] Deepseek GGUF support (
#13167
)
2025-02-27 02:08:35 -08:00
gptqmodel.md
[Docs] Add GPTQModel (
#14056
)
2025-03-03 21:59:09 +00:00
index.md
[Docs] Add GPTQModel (
#14056
)
2025-03-03 21:59:09 +00:00
int4.md
[Doc] int4 w4a16 example (
#12585
)
2025-01-31 15:38:48 -08:00
int8.md
[Doc] int4 w4a16 example (
#12585
)
2025-01-31 15:38:48 -08:00
quantized_kvcache.md
…
supported_hardware.md
[Doc]: Improve feature tables (
#13224
)
2025-02-18 18:52:39 +08:00