support flashinfer_fp4 moe for 5090 gpu (#26669)

Signed-off-by: XiaobingSuper <xiaobingzhangupc@gmail.com>
Signed-off-by: Michael Goin <mgoin64@gmail.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>
This commit is contained in:
XiaobingZhang 2025-10-16 03:06:47 +08:00 committed by GitHub
parent 1f491aa0c8
commit 0b99f5d302
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -32,7 +32,7 @@ def is_flashinfer_fp4_cutlass_moe_available() -> bool:
envs.VLLM_USE_FLASHINFER_MOE_FP4
and has_flashinfer_cutlass_fused_moe()
and current_platform.is_cuda()
and current_platform.is_device_capability(100)
and current_platform.has_device_capability(100)
)