vllm/README.md

188 B

CacheFlow

Installation

pip install cmake torch transformers
pip install flash-attn # This may take up to 10 mins.
pip install -e .

Run

python server.py