mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2025-12-10 04:54:56 +08:00
docs: update fastsafetensors usage instructions (#22891)
Signed-off-by: Nir Levy <bhr166@gmail.com>
This commit is contained in:
parent
33c63e9547
commit
637093ae26
@ -2,4 +2,5 @@ Loading Model weights with fastsafetensors
|
|||||||
===================================================================
|
===================================================================
|
||||||
|
|
||||||
Using fastsafetensors library enables loading model weights to GPU memory by leveraging GPU direct storage. See [their GitHub repository](https://github.com/foundation-model-stack/fastsafetensors) for more details.
|
Using fastsafetensors library enables loading model weights to GPU memory by leveraging GPU direct storage. See [their GitHub repository](https://github.com/foundation-model-stack/fastsafetensors) for more details.
|
||||||
For enabling this feature, set the environment variable ``USE_FASTSAFETENSOR`` to ``true``
|
|
||||||
|
To enable this feature, use the ``--load-format fastsafetensors`` command-line argument
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user