fangpings 7e0941055f
[Bugfix] Fix incorrect kv cache metrics in grafana.json (#27133)
Signed-off-by: Fangping Shi <fangping_shi@apple.com>
Co-authored-by: Fangping Shi <fangping_shi@apple.com>
2025-10-22 20:58:36 -07:00
..

Perses Dashboards for vLLM Monitoring

This directory contains Perses dashboard configurations designed to monitor vLLM performance and metrics.

Requirements

  • Perses instance (standalone or via operator)
  • Prometheus data source configured in Perses
  • vLLM deployment with Prometheus metrics enabled

Dashboard Format

We provide dashboards in the native Perses YAML format that works across all deployment methods:

  • Files: *.yaml (native Perses dashboard specifications)
  • Format: Pure dashboard specifications that work everywhere
  • Usage: Works with standalone Perses, API imports, CLI, and file provisioning
  • Kubernetes: Directly compatible with Perses Operator

Dashboard Descriptions

  • performance_statistics.yaml: Performance metrics with aggregated latency statistics
  • query_statistics.yaml: Query performance and deployment metrics

Deployment Options

Direct Import to Perses

Import the dashboard specifications via Perses API or CLI:

percli apply -f performance_statistics.yaml

Perses Operator (Kubernetes)

The native YAML format works directly with the Perses Operator:

kubectl apply -f performance_statistics.yaml -n <namespace>

File Provisioning

Place the YAML files in a Perses provisioning folder for automatic loading.