mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-13 07:15:00 +08:00

Signed-off-by: Tsai, Louie <louie.tsai@intel.com>

2025-11-27 11:30:50 +08:00

CPU - Intel® Xeon®

Validated Hardware

Hardware
Intel® Xeon® 6 Processors
Intel® Xeon® 5 Processors

Model	Architecture	Supported
meta-llama/Llama-3.1-8B-Instruct	LlamaForCausalLM	✅
meta-llama/Llama-3.2-3B-Instruct	LlamaForCausalLM	✅
ibm-granite/granite-3.2-2b-instruct	GraniteForCausalLM	✅
Qwen/Qwen3-1.7B	Qwen3ForCausalLM	✅
Qwen/Qwen3-4B	Qwen3ForCausalLM	✅
Qwen/Qwen3-8B	Qwen3ForCausalLM	✅
zai-org/glm-4-9b-hf	GLMForCausalLM	✅
google/gemma-7b	GemmaForCausalLM	✅

Model	Architecture	Supported
Qwen/Qwen2.5-VL-7B-Instruct	Qwen2VLForConditionalGeneration	✅
openai/whisper-large-v3	WhisperForConditionalGeneration	✅

✅ Runs and optimized.
🟨 Runs and correct but not optimized to green yet.
❌ Does not pass accuracy test or does not run.