This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-18 05:35:01 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
csrc
/
quantization
/
cutlass_w8a8
/
moe
History
Wentao Ye
e8cb0d0495
[Bug] Fix Compressed Tensor NVFP4
cutlass_fp4_group_mm
illegal memory access (
#21465
)
...
Signed-off-by: yewentao256 <zhyanwentao@126.com>
2025-07-24 08:13:24 -07:00
..
blockwise_scaled_group_mm_sm100.cu
[fix]: disable cutlass block scaled group gemm for EP (
#20781
)
2025-07-11 02:39:18 +00:00
get_group_starts.cuh
[Kernel] CUTLASS grouped gemm fp8 MoE kernel (
#13972
)
2025-03-27 00:54:44 +00:00
grouped_mm_c3x_sm90.cu
[feat]: add SM100 support for cutlass FP8 groupGEMM (
#20447
)
2025-07-22 07:27:12 -07:00
grouped_mm_c3x_sm100.cu
[feat]: add SM100 support for cutlass FP8 groupGEMM (
#20447
)
2025-07-22 07:27:12 -07:00
grouped_mm_c3x.cuh
[feat]: add SM100 support for cutlass FP8 groupGEMM (
#20447
)
2025-07-22 07:27:12 -07:00
moe_data.cu
[Bug] Fix Compressed Tensor NVFP4
cutlass_fp4_group_mm
illegal memory access (
#21465
)
2025-07-24 08:13:24 -07:00