mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-12 09:35:34 +08:00

Signed-off-by: Russell Bryant <rbryant@redhat.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>

2025-04-28 15:12:17 +00:00

2.6 KiB

Raw Blame History

Security Guide

Inter-Node Communication

All communications between nodes in a multi-node vLLM deployment are insecure by default and must be protected by placing the nodes on an isolated network. This includes:

PyTorch Distributed communications
KV cache transfer communications
Tensor, Pipeline, and Data parallel communications

Configuration Options for Inter-Node Communications

The following options control inter-node communications in vLLM:

Environment Variables:
- VLLM_HOST_IP: Sets the IP address for vLLM processes to communicate on
KV Cache Transfer Configuration:
- --kv-ip: The IP address for KV cache transfer communications (default: 127.0.0.1)
- --kv-port: The port for KV cache transfer communications (default: 14579)
Data Parallel Configuration:
- data_parallel_master_ip: IP of the data parallel master (default: 127.0.0.1)
- data_parallel_master_port: Port of the data parallel master (default: 29500)

Notes on PyTorch Distributed

vLLM uses PyTorch's distributed features for some inter-node communication. For detailed information about PyTorch Distributed security considerations, please refer to the PyTorch Security Guide.

Key points from the PyTorch security guide:

PyTorch Distributed features are intended for internal communication only
They are not built for use in untrusted environments or networks
No authorization protocol is included for performance reasons
Messages are sent unencrypted
Connections are accepted from anywhere without checks

Security Recommendations

Network Isolation:
- Deploy vLLM nodes on a dedicated, isolated network
- Use network segmentation to prevent unauthorized access
- Implement appropriate firewall rules
Configuration Best Practices:
- Always set VLLM_HOST_IP to a specific IP address rather than using defaults
- Configure firewalls to only allow necessary ports between nodes
Access Control:
- Restrict physical and network access to the deployment environment
- Implement proper authentication and authorization for management interfaces
- Follow the principle of least privilege for all system components

Reporting Security Vulnerabilities

If you believe you have found a security vulnerability in vLLM, please report it following the project's security policy. For more information on how to report security issues and the project's security policy, please see the vLLM Security Policy.

2.6 KiB Raw Blame History