Health Dashboard

The Health Dashboard provides an at-a-glance view of the overall health of your Kubernetes cluster, with indicators for node health, workload status, and other key metrics.

Overview

The Health Dashboard includes:

Node Health: Status of nodes in the cluster
Workload Health: Deployment and service health checks
Resource Availability: Current utilization of CPU, memory, and storage
Alerting Dashboard: Active alerts and notifications

Key Health Metrics

Node Health Indicators

Node Status: Ready, NotReady, Unknown
Capacity and Allocatable Resources: CPU and memory
Node Conditions: MemoryPressure, DiskPressure, etc.

Workload Health Indicators

Deployment Status: Pod status and availability
Failed Pods: Recent failures and restarts
Service Reachability: Network and service checks

Alerting and Notifications

Current Alerts: Active cluster-wide alerts
History of Alerts: Past alerts and resolutions
Critical Notifications: Node and workload alerts

Usage

To view the health dashboard, use:

k8s-analyzer dashboard --view health

Integration

Monitoring Tools

Prometheus for gathering health metrics
Grafana for visual representation
Alertmanager for alerting

Maintenance Recommendations

Regular Monitoring: Always keep an eye on the dashboard for real-time indicators.
Alert Response: Address alerts promptly to mitigate issues.
Capacity Planning: Use health metrics for long-term planning.