This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-18 00:34:35 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
quantization
History
Luka Govedič
172d1cd276
[Kernel] AQ AZP 4/4: Integrate asymmetric quantization to linear method (
#7271
)
2024-09-27 14:25:10 -04:00
..
__init__.py
…
test_bitsandbytes.py
[[Misc]Upgrade bitsandbytes to the latest version 0.44.0 (
#8768
)
2024-09-24 17:08:55 -07:00
test_compressed_tensors.py
[Kernel] AQ AZP 4/4: Integrate asymmetric quantization to linear method (
#7271
)
2024-09-27 14:25:10 -04:00
test_configs.py
…
test_cpu_offload.py
…
test_experts_int8.py
…
test_fp8.py
[CI/Build] Avoid CUDA initialization (
#8534
)
2024-09-18 10:38:11 +00:00
test_lm_head.py
…
utils.py
[CI/Build] Avoid CUDA initialization (
#8534
)
2024-09-18 10:38:11 +00:00