[BugFix] Workspace allocation during profile run : DeepEPHighThroughput + DeepGEMM (#30899)

This commit is contained in:
Varun Sundar Rabindranath 2025-12-17 18:00:59 -05:00 committed by GitHub
parent e06d0bf0aa
commit e3fc374a9a
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -795,7 +795,10 @@ class FusedMoEModularKernel(torch.nn.Module):
top_k,
global_num_experts,
local_num_experts,
expert_tokens_meta,
# expert_tokens_meta help in allocating optimal/minimal
# amount of workspace. Mark it None, so we allocate for
# the worst-case scenario.
expert_tokens_meta=None,
)
)