mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2026-05-24 02:17:53 +08:00
[Doc] Remove Molmo warning (#27527)
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
This commit is contained in:
parent
315b860abe
commit
be7b55a83d
@ -736,37 +736,6 @@ Some models are supported only via the [Transformers backend](#transformers). Th
|
|||||||
!!! note
|
!!! note
|
||||||
To use `TIGER-Lab/Mantis-8B-siglip-llama3`, you have to pass `--hf_overrides '{"architectures": ["MantisForConditionalGeneration"]}'` when running vLLM.
|
To use `TIGER-Lab/Mantis-8B-siglip-llama3`, you have to pass `--hf_overrides '{"architectures": ["MantisForConditionalGeneration"]}'` when running vLLM.
|
||||||
|
|
||||||
!!! warning
|
|
||||||
The output quality of `AllenAI/Molmo-7B-D-0924` (especially in object localization tasks) has deteriorated in recent updates.
|
|
||||||
|
|
||||||
For the best results, we recommend using the following dependency versions (tested on A10 and L40):
|
|
||||||
|
|
||||||
??? code "Dependency versions"
|
|
||||||
|
|
||||||
```text
|
|
||||||
# Core vLLM-compatible dependencies with Molmo accuracy setup (tested on L40)
|
|
||||||
torch==2.5.1
|
|
||||||
torchvision==0.20.1
|
|
||||||
transformers==4.48.1
|
|
||||||
tokenizers==0.21.0
|
|
||||||
tiktoken==0.7.0
|
|
||||||
vllm==0.7.0
|
|
||||||
|
|
||||||
# Optional but recommended for improved performance and stability
|
|
||||||
triton==3.1.0
|
|
||||||
xformers==0.0.28.post3
|
|
||||||
uvloop==0.21.0
|
|
||||||
protobuf==5.29.3
|
|
||||||
openai==1.60.2
|
|
||||||
opencv-python-headless==4.11.0.86
|
|
||||||
pillow==10.4.0
|
|
||||||
|
|
||||||
# Installed FlashAttention (for float16 only)
|
|
||||||
flash-attn>=2.5.6 # Not used in float32, but should be documented
|
|
||||||
```
|
|
||||||
|
|
||||||
**Note:** Make sure you understand the security implications of using outdated packages.
|
|
||||||
|
|
||||||
!!! note
|
!!! note
|
||||||
The official `openbmb/MiniCPM-V-2` doesn't work yet, so we need to use a fork (`HwwwH/MiniCPM-V-2`) for now.
|
The official `openbmb/MiniCPM-V-2` doesn't work yet, so we need to use a fork (`HwwwH/MiniCPM-V-2`) for now.
|
||||||
For more details, please see: <https://github.com/vllm-project/vllm/pull/4087#issuecomment-2250397630>
|
For more details, please see: <https://github.com/vllm-project/vllm/pull/4087#issuecomment-2250397630>
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user