[Misc] Fix Qwen3-VL video_grid_thw typing (#25646)

Signed-off-by: Roger Wang <hey@rogerw.io>
Signed-off-by: yewentao256 <zhyanwentao@126.com>
This commit is contained in:
Roger Wang 2025-09-25 03:16:45 -07:00 committed by yewentao256
parent 5e16b8c552
commit fd28c58825

View File

@ -1249,7 +1249,7 @@ class Qwen3VLForConditionalGeneration(nn.Module, SupportsMultiModal,
rope_type="rope_3d")
else:
video_embeds = self.visual(pixel_values_videos,
grid_thw=grid_thw)
grid_thw=grid_thw_list)
# Split concatenated embeddings for each video item.
# Using prod on grid_thw_list instead of grid_thw.prod avoids CUDA sync