This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-01-01 01:41:53 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
csrc
/
quantization
/
cutlass_w8a8
/
moe
History
Ming Yang
c438183e99
[Bugfix] Fix topk_ids indices_type for CUTLASS w8a8 FP8 MoE (
#20166
)
...
Signed-off-by: Ming Yang <yming@meta.com>
2025-07-08 23:10:57 +00:00
..
blockwise_scaled_group_mm_sm100.cu
[BugFix] Fix: ImportError when building on hopper systems (
#20513
)
2025-07-06 12:17:30 +08:00
get_group_starts.cuh
[Kernel] CUTLASS grouped gemm fp8 MoE kernel (
#13972
)
2025-03-27 00:54:44 +00:00
grouped_mm_c3x.cu
[Kernel] Integrate CUTLASS MoE kernel with PPLX (
#18762
)
2025-06-06 18:26:11 -07:00
grouped_mm_c3x.cuh
[Kernel] Integrate CUTLASS MoE kernel with PPLX (
#18762
)
2025-06-06 18:26:11 -07:00
moe_data.cu
[Bugfix] Fix topk_ids indices_type for CUTLASS w8a8 FP8 MoE (
#20166
)
2025-07-08 23:10:57 +00:00