vllm/quantization at 34cda778a091d4e1fd204cfde4a0f5e2b5616ac2 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-19 13:37:20 +08:00

History

[Perf] Use Triton instead of Torch for DeepGEMM Per Token Group Quant (#20841 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>

2025-07-12 19:38:45 -07:00

nvfp4_utils.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_allspark_gemm.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_aqlm.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_awq_triton.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_awq.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_block_fp8.py

[Perf] Use Triton instead of Torch for DeepGEMM Per Token Group Quant (#20841 )

2025-07-12 19:38:45 -07:00

test_block_int8.py

[Kernels] MoE refactor (#19636 )

2025-07-02 06:08:27 -07:00

test_cutlass_2of4_sparse.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_cutlass_scaled_mm.py

[Kernel] Integrate CUTLASS MoE kernel with PPLX (#18762 )

2025-06-06 18:26:11 -07:00

test_fp8_quant.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_ggml.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_gguf.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_gptq.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_int8_kernel.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_int8_quant.py

[Perf] Vectorize static / dynamic INT8 quant kernels (#19233 )

2025-06-12 06:51:41 -07:00

test_machete_mm.py

Enable group size 64 for Machete (#20290 )

2025-07-01 18:05:44 -07:00

test_marlin_gemm.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_nvfp4_quant.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_nvfp4_scaled_mm.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_rocm_skinny_gemms.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_triton_scaled_mm.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00