Kubernetes / Kubelet
4,702,803

Created 4/22/2020
Updated 4/22/2020
Revision 1
Grafana Version >=6.6.2

Description

This dashboard provides an at-a-glance view of a node’s kubelet health and performance, correlating system readiness with pod, container, and volume activity. It emphasizes operation latency and reliability, with key metrics such as up, kubelet_runtime_operations_duration_seconds_bucket, and kubelet_pod_start_duration_seconds_count to surface both overall availability and latency/throughput of core kubelet tasks. Other notable areas include pod and container counts, volume state, and configuration error monitoring, enabling rapid diagnosis of scheduling and runtime issues.

Source Grafana.com

Used Metrics 24

  • go_goroutines

  • kubelet_cgroup_manager_duration_seconds_bucket

  • kubelet_cgroup_manager_duration_seconds_count

  • kubelet_node_config_error

  • kubelet_pleg_relist_duration_seconds_bucket

  • kubelet_pleg_relist_duration_seconds_count

  • kubelet_pleg_relist_interval_seconds_bucket

  • kubelet_pod_start_duration_seconds_count

  • kubelet_pod_worker_duration_seconds_bucket

  • kubelet_pod_worker_duration_seconds_count

  • kubelet_running_container_count

  • kubelet_running_pod_count

  • kubelet_runtime_operations_duration_seconds_bucket

  • kubelet_runtime_operations_errors_total

  • kubelet_runtime_operations_total

  • process_cpu_seconds_total

  • process_resident_memory_bytes

  • rest_client_request_latency_seconds_bucket

  • rest_client_requests_total

  • storage_operation_duration_seconds_bucket

  • storage_operation_duration_seconds_count

  • storage_operation_errors_total

  • up

  • volume_manager_total_volumes

Get Dashboard
Download
Copy to Clipboard