vllm/quantization at d67cc21b787dc594f6b168c4e44a0c6ecb415385 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-23 03:07:42 +08:00

History

Harry Mellor f2b20fe491

Consolidate Llama model usage in tests (#13094 )

2025-02-13 22:18:03 -08:00

..

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

test_bitsandbytes.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

test_compressed_tensors.py

[Bugfix/CI] Turn test_compressed_tensors_2of4_sparse back on (#13250 )

2025-02-13 20:19:25 -08:00

test_configs.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

test_cpu_offload.py

[CI] Fix failing FP8 cpu offload test (#13170 )

2025-02-12 19:16:06 +00:00

test_experts_int8.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

test_fp8.py

[ROCm] [Feature] [Doc] [Dockerfile] [BugFix] Support Per-Token-Activation Per-Channel-Weight FP8 Quantization Inferencing (#12501 )

2025-02-07 08:13:43 -08:00

test_gptq_dynamic.py

[CORE] [QUANT] Support for GPTQModel's dynamic quantization per module override/control (#7086 )

2025-02-12 09:19:43 -08:00

test_ipex_quant.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

test_lm_head.py

[CORE] [QUANT] Support for GPTQModel's dynamic quantization per module override/control (#7086 )

2025-02-12 09:19:43 -08:00

test_ptpc_fp8.py

[ROCm] [Feature] [Doc] [Dockerfile] [BugFix] Support Per-Token-Activation Per-Channel-Weight FP8 Quantization Inferencing (#12501 )

2025-02-07 08:13:43 -08:00

test_quark.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00

test_register_quantization_config.py

Consolidate Llama model usage in tests (#13094 )

2025-02-13 22:18:03 -08:00

utils.py

[Misc] Add SPDX-License-Identifier headers to python source files (#12628 )

2025-02-02 11:58:18 -08:00