vllm/kernels at 9a3b88328f7e434cac35b90ee463de6689f9a833 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-19 00:37:00 +08:00

History

22quinn 4671ac6e2a

[Bugfix][Benchmark] Fix Marlin benchmark (#19929 )

2025-06-24 07:25:12 +09:00

..

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

bench_fp8_gemm.py

[Benchmark] Refactor benchmark script for fp8 & int8 (#19627 )

2025-06-15 15:15:37 +08:00

bench_int8_gemm.py

[Benchmark] Refactor benchmark script for fp8 & int8 (#19627 )

2025-06-15 15:15:37 +08:00

benchmark_aqlm.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_bitblas.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_cutlass_fp4_moe.py

[Hardware][NVIDIA] FP4 MoE kernel optimization (#19110 )

2025-06-05 09:48:26 -07:00

benchmark_grouped_gemm_cutlass.py

[Kernel] Integrate CUTLASS MoE kernel with PPLX (#18762 )

2025-06-06 18:26:11 -07:00

benchmark_layernorm.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_lora.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_machete.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_marlin.py

[Bugfix][Benchmark] Fix Marlin benchmark (#19929 )

2025-06-24 07:25:12 +09:00

benchmark_moe_align_block_size.py

[Frontend] Expose custom args in OpenAI APIs (#16862 )

2025-06-18 17:41:11 -07:00

benchmark_moe_permute_unpermute.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_moe.py

[Bugfix] Fix benchmark_moe.py (#19016 )

2025-06-09 18:04:36 -07:00

benchmark_paged_attention.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_quant.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_rmsnorm.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_rope.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_shapes.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

benchmark_w8a8_block_fp8.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

graph_machete_bench.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

requirements.txt

[Kernel] (2/N) Machete - Integrate into CompressedTensorsWNA16 and GPTQMarlin (#7701 )

2024-09-23 13:46:26 -04:00

utils.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

weight_shapes.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00