mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2025-12-20 03:45:01 +08:00
60 lines
1.7 KiB
Markdown
60 lines
1.7 KiB
Markdown
# Grafana Dashboards for vLLM Monitoring
|
|
|
|
This directory contains Grafana dashboard configurations (as JSON) designed to monitor
|
|
vLLM performance and metrics.
|
|
|
|
## Requirements
|
|
|
|
- Grafana 8.0+
|
|
- Prometheus data source configured in Grafana
|
|
- vLLM deployment with Prometheus metrics enabled
|
|
|
|
## Dashboard Descriptions
|
|
|
|
- **performance_statistics.json**: Tracks performance metrics including latency and
|
|
throughput for your vLLM service.
|
|
- **query_statistics.json**: Tracks query performance, request volume, and key
|
|
performance indicators for your vLLM service.
|
|
|
|
## Deployment Options
|
|
|
|
### Manual Import (Recommended)
|
|
|
|
The easiest way to use these dashboards is to manually import the JSON configurations
|
|
directly into your Grafana instance:
|
|
|
|
1. Navigate to your Grafana instance
|
|
2. Click the '+' icon in the sidebar
|
|
3. Select 'Import'
|
|
4. Copy and paste the JSON content from the dashboard files, or upload the JSON files
|
|
directly
|
|
|
|
### Grafana Operator
|
|
|
|
If you're using the [Grafana Operator](https://github.com/grafana-operator/grafana-operator)
|
|
in Kubernetes, you can wrap these JSON configurations in a `GrafanaDashboard` custom
|
|
resource:
|
|
|
|
```yaml
|
|
# Note: Adjust the instanceSelector to match your Grafana instance's labels
|
|
# You can check with: kubectl get grafana -o yaml
|
|
apiVersion: grafana.integreatly.org/v1beta1
|
|
kind: GrafanaDashboard
|
|
metadata:
|
|
name: vllm-performance-dashboard
|
|
spec:
|
|
instanceSelector:
|
|
matchLabels:
|
|
dashboards: grafana # Adjust to match your Grafana instance labels
|
|
folder: "vLLM Monitoring"
|
|
json: |
|
|
# Replace this comment with the complete JSON content from
|
|
# performance_statistics.json - The JSON should start with { and end with }
|
|
```
|
|
|
|
Then apply to your cluster:
|
|
|
|
```bash
|
|
kubectl apply -f your-dashboard.yaml -n <namespace>
|
|
```
|