split_decodes_and_prefills(..., require_uniform=True)
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by: Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by: Benjamin Chislett <chislett.ben@gmail.com>
TRITON_MLA
vllm.utils
RendererConfig
ModelConfig
Optional[x]
x | None
Union[x, y]
x | y
multimodal_cpu_fields