Logo
Explore Help
Sign In
xinyun/vllm
1
0
Fork 0
You've already forked vllm
mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-07 16:56:32 +08:00
Code Issues Packages Projects Releases Wiki Activity
vllm/csrc/quantization/cutlass_w8a8
History
Harry Mellor ba5c5e5404
[Docs] Switch to better markdown linting pre-commit hook (#21851)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
2025-07-29 19:45:08 -07:00
..
c3x
[Kernel] SM90 CUTLASS FP8 GEMM: add support for swap AB + kernel tuning (#20396)
2025-07-28 23:13:58 +00:00
moe
[Bug] Fix Compressed Tensor NVFP4 cutlass_fp4_group_mm illegal memory access (#21465)
2025-07-24 08:13:24 -07:00
Epilogues.md
[Docs] Switch to better markdown linting pre-commit hook (#21851)
2025-07-29 19:45:08 -07:00
scaled_mm_c2x_sm75_dispatch.cuh
…
scaled_mm_c2x_sm80_dispatch.cuh
…
scaled_mm_c2x_sm89_fp8_dispatch.cuh
[Bugfix] Fix cutlass dispatch for fp8/int8 to properly invoke M<=16 c… (#16751)
2025-04-27 19:38:42 -07:00
scaled_mm_c2x_sm89_int8_dispatch.cuh
[Bugfix] Fix cutlass dispatch for fp8/int8 to properly invoke M<=16 c… (#16751)
2025-04-27 19:38:42 -07:00
scaled_mm_c2x.cu
…
scaled_mm_c2x.cuh
…
scaled_mm_c3x_sm90.cu
Add cutlass support for blackwell fp8 blockwise gemm (#14383)
2025-05-08 15:09:55 -07:00
scaled_mm_c3x_sm100.cu
Add cutlass support for blackwell fp8 blockwise gemm (#14383)
2025-05-08 15:09:55 -07:00
scaled_mm_c3x_sm120.cu
[NVIDIA] Support Cutlass w8a8 FP8 for Blackwell Geforce GPUs (sm120) (#17280)
2025-07-02 06:47:19 -06:00
scaled_mm_entry.cu
[feat]: add SM100 support for cutlass FP8 groupGEMM (#20447)
2025-07-22 07:27:12 -07:00
Powered by Gitea Version: 1.23.1 Page: 6869ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API