[Doc] add ccache guide in doc (#5012)

Co-authored-by: Michael Goin <michael@neuralmagic.com>
This commit is contained in:
youkaichao 2024-05-23 16:21:54 -07:00 committed by GitHub
parent e3470f8753
commit 6a50f4cafa
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -56,6 +56,10 @@ You can also build and install vLLM from source:
$ # export VLLM_INSTALL_PUNICA_KERNELS=1 # optionally build for multi-LoRA capability $ # export VLLM_INSTALL_PUNICA_KERNELS=1 # optionally build for multi-LoRA capability
$ pip install -e . # This may take 5-10 minutes. $ pip install -e . # This may take 5-10 minutes.
.. tip::
Building from source requires quite a lot compilation. If you are building from source for multiple times, it is beneficial to cache the compilation results. For example, you can install `ccache <https://github.com/ccache/ccache>`_ via either `conda install ccache` or `apt install ccache` . As long as `which ccache` command can find the `ccache` binary, it will be used automatically by the build system. After the first build, the subsequent builds will be much faster.
.. tip:: .. tip::
To avoid your system being overloaded, you can limit the number of compilation jobs To avoid your system being overloaded, you can limit the number of compilation jobs
to be run simultaneously, via the environment variable `MAX_JOBS`. For example: to be run simultaneously, via the environment variable `MAX_JOBS`. For example: