mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2025-12-10 04:34:57 +08:00
[Doc] Update description of vLLM support for CPUs (#6003)
This commit is contained in:
parent
99ded1e1c4
commit
439c84581a
@ -59,7 +59,7 @@ vLLM is flexible and easy to use with:
|
|||||||
- Tensor parallelism support for distributed inference
|
- Tensor parallelism support for distributed inference
|
||||||
- Streaming outputs
|
- Streaming outputs
|
||||||
- OpenAI-compatible API server
|
- OpenAI-compatible API server
|
||||||
- Support NVIDIA GPUs, AMD GPUs, Intel CPUs and GPUs
|
- Support NVIDIA GPUs, AMD CPUs and GPUs, Intel CPUs and GPUs, PowerPC CPUs
|
||||||
- (Experimental) Prefix caching support
|
- (Experimental) Prefix caching support
|
||||||
- (Experimental) Multi-lora support
|
- (Experimental) Multi-lora support
|
||||||
|
|
||||||
|
|||||||
@ -20,7 +20,7 @@ Requirements
|
|||||||
|
|
||||||
* OS: Linux
|
* OS: Linux
|
||||||
* Compiler: gcc/g++>=12.3.0 (optional, recommended)
|
* Compiler: gcc/g++>=12.3.0 (optional, recommended)
|
||||||
* Instruction set architecture (ISA) requirement: AVX512 is required.
|
* Instruction set architecture (ISA) requirement: AVX512 (optional, recommended)
|
||||||
|
|
||||||
.. _cpu_backend_quick_start_dockerfile:
|
.. _cpu_backend_quick_start_dockerfile:
|
||||||
|
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user