CloudNativePG
448,310

Created 2/3/2024
Updated 4/14/2025
Revision 4
Categories
Databases
Grafana Version >=10.3.3
Datasources
Prometheus

Description

This dashboard monitors PostgreSQL replication health and resource usage for cloud-native deployments. It highlights replication status and lag with metrics like cnpg_pg_replication_lag, cnpg_pg_stat_replication_write_lag_seconds, and cnpg_pg_stat_replication_flush_lag_seconds, while also presenting node and volume metrics such as kubelet_volume_stats_available_bytes and kubelet_volume_stats_capacity_bytes to track storage capacity and availability. Key features include last failover, tps and latency metrics, and base backup/archived WAL status to provide a comprehensive view of both database health and underlying infrastructure.

Screenshots

Source Grafana.com

Used Metrics 40

  • cnpg_backends_max_tx_duration_seconds

  • cnpg_backends_total

  • cnpg_backends_waiting_total

  • cnpg_collector_first_recoverability_point

  • cnpg_collector_last_available_backup_timestamp

  • cnpg_collector_postgres_version

  • cnpg_pg_database_size_bytes

  • cnpg_pg_database_xid_age

  • cnpg_pg_postmaster_start_time

  • cnpg_pg_replication_in_recovery

  • cnpg_pg_replication_is_wal_receiver_up

  • cnpg_pg_replication_lag

  • cnpg_pg_replication_streaming_replicas

  • cnpg_pg_settings_setting

  • cnpg_pg_stat_archiver_seconds_since_last_archival

  • cnpg_pg_stat_database_deadlocks

  • cnpg_pg_stat_database_xact_commit

  • cnpg_pg_stat_database_xact_rollback

  • cnpg_pg_stat_replication_flush_lag_seconds

  • cnpg_pg_stat_replication_replay_lag_seconds

  • cnpg_pg_stat_replication_write_lag_seconds

  • container_memory_working_set_bytes

  • controller_runtime_reconcile_total

  • kube_node_labels

  • kube_pod_container_resource_requests

  • kube_pod_container_status_ready

  • kube_pod_info

  • kube_pod_spec_volumes_persistentvolumeclaims_info

  • kube_pod_status_ready

  • kubelet_volume_stats_available_bytes

  • kubelet_volume_stats_capacity_bytes

  • kubelet_volume_stats_inodes

  • kubelet_volume_stats_inodes_used

  • kubelet_volume_stats_used_bytes

  • label_topology_kubernetes_io_zone

  • node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate

  • tbs

  • timestamp

  • volume

  • wal

Get Dashboard
Download
Copy to Clipboard