1 - Deployment Performance & Health
185,459

Created 10/27/2021
Updated 6/22/2023
Revision 20
Grafana Version >=9.5.3
Datasources
Prometheus

Description

This dashboard monitors deployed application health and performance end-to-end, combining request-level observability with pod and node resource metrics. It highlights latency distributions, success vs error rates, and resource saturation, enabling rapid diagnosis of performance regressions and capacity constraints. Key metrics include istio_request_duration_milliseconds_bucket for latency across response codes, istio_requests_total with % of Responses By Response Code to track success and error mix, and container_cpu_cfs_throttled_seconds_total alongside memory and pod replica data to surface CPU throttling and resource pressure.

Screenshots

Source Grafana.com

Used Metrics 24

Get Dashboard
Download
Copy to Clipboard