[Model] Qwen2.5-Omni Cleanup (#16872)

This commit is contained in:
Roger Wang 2025-04-19 02:37:02 -07:00 committed by GitHub
parent 83f3c3bd91
commit 5124f5bf51
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
2 changed files with 2 additions and 5 deletions

View File

@ -1117,8 +1117,8 @@ Our PaliGemma implementations have the same problem as Gemma 3 (see above) for b
:::
:::{note}
To use Qwen2.5-Omni, you have to install a fork of Hugging Face Transformers library from source via
`pip install git+https://github.com/BakerBunker/transformers.git@qwen25omni`.
To use Qwen2.5-Omni, you have to install Hugging Face Transformers library from source via
`pip install git+https://github.com/huggingface/transformers.git`.
Read audio from video pre-processing is currently supported on V0 (but not V1), because overlapping modalities is not yet supported in V1.
`--mm-processor-kwargs '{"use_audio_in_video": True}'`.

View File

@ -518,9 +518,6 @@ class Qwen2_5OmniThinkerMultiModalProcessor(
"""
Qwen2.5-Omni reimplements this function to handle text only.
"""
print(prompt)
print(hf_processor_mm_kwargs)
print(mm_items)
if isinstance(prompt, str):
if enable_hf_prompt_update:
return self._apply_hf_processor_text_mm(