[Misc] Fix Qwen3-VL video_grid_thw typing (#25646)

Signed-off-by: Roger Wang <hey@rogerw.io>
2026-01-26 12:24:28 +08:00 · 2025-09-25 03:16:45 -07:00 · 2025-09-25 03:16:45 -07:00 · 7be9ffcd9f
commit 7be9ffcd9f
parent 393de22d2e
1 changed files with 1 additions and 1 deletions
--- a/vllm/model_executor/models/qwen3_vl.py
+++ b/vllm/model_executor/models/qwen3_vl.py
@ -1249,7 +1249,7 @@ class Qwen3VLForConditionalGeneration(nn.Module, SupportsMultiModal,
                                                         rope_type="rope_3d")
            else:
                video_embeds = self.visual(pixel_values_videos,
-                                           grid_thw=grid_thw)
+                                           grid_thw=grid_thw_list)

        # Split concatenated embeddings for each video item.
        # Using prod on grid_thw_list instead of grid_thw.prod avoids CUDA sync