mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2026-01-26 21:34:39 +08:00
Add CPU support model (#28697)
Signed-off-by: Tsai, Louie <louie.tsai@intel.com>
This commit is contained in:
parent
7ed27f3cb5
commit
ae4821a108
26
docs/models/hardware_supported_models/cpu.md
Normal file
26
docs/models/hardware_supported_models/cpu.md
Normal file
@ -0,0 +1,26 @@
|
||||
# CPU - Intel® Xeon®
|
||||
|
||||
## Supported Models
|
||||
|
||||
### Text-only Language Models
|
||||
|
||||
| Model | Architecture | Supported |
|
||||
|--------------------------------------|-------------------------------------------|-----------|
|
||||
| meta-llama/Llama-3.1 / 3.3 | LlamaForCausalLM | ✅ |
|
||||
| meta-llama/Llama-4-Scout | Llama4ForConditionalGeneration | ✅ |
|
||||
| meta-llama/Llama-4-Maverick | Llama4ForConditionalGeneration | ✅ |
|
||||
| ibm-granite/granite (Granite-MOE) | GraniteMoeForCausalLM | ✅ |
|
||||
| Qwen/Qwen3 | Qwen3ForCausalLM | ✅ |
|
||||
| zai-org/GLM-4.5 | GLMForCausalLM | ✅ |
|
||||
| google/gemma | GemmaForCausalLM | ✅ |
|
||||
|
||||
### Multimodal Language Models
|
||||
|
||||
| Model | Architecture | Supported |
|
||||
|--------------------------------------|-------------------------------------------|-----------|
|
||||
| Qwen/Qwen2.5-VL | Qwen2VLForConditionalGeneration | ✅ |
|
||||
| openai/whisper | WhisperForConditionalGeneration | ✅ |
|
||||
|
||||
✅ Runs and optimized.
|
||||
🟨 Runs and correct but not optimized to green yet.
|
||||
❌ Does not pass accuracy test or does not run.
|
||||
Loading…
x
Reference in New Issue
Block a user