diff --git a/docs/models/hardware_supported_models/cpu.md b/docs/models/hardware_supported_models/cpu.md new file mode 100644 index 0000000000000..0832755f8fbe2 --- /dev/null +++ b/docs/models/hardware_supported_models/cpu.md @@ -0,0 +1,26 @@ +# CPU - Intel® Xeon® + +## Supported Models + +### Text-only Language Models + +| Model | Architecture | Supported | +|--------------------------------------|-------------------------------------------|-----------| +| meta-llama/Llama-3.1 / 3.3 | LlamaForCausalLM | ✅ | +| meta-llama/Llama-4-Scout | Llama4ForConditionalGeneration | ✅ | +| meta-llama/Llama-4-Maverick | Llama4ForConditionalGeneration | ✅ | +| ibm-granite/granite (Granite-MOE) | GraniteMoeForCausalLM | ✅ | +| Qwen/Qwen3 | Qwen3ForCausalLM | ✅ | +| zai-org/GLM-4.5 | GLMForCausalLM | ✅ | +| google/gemma | GemmaForCausalLM | ✅ | + +### Multimodal Language Models + +| Model | Architecture | Supported | +|--------------------------------------|-------------------------------------------|-----------| +| Qwen/Qwen2.5-VL | Qwen2VLForConditionalGeneration | ✅ | +| openai/whisper | WhisperForConditionalGeneration | ✅ | + +✅ Runs and optimized. +🟨 Runs and correct but not optimized to green yet. +❌ Does not pass accuracy test or does not run.