KubeLab boots a three-node Kubernetes cluster and runs seven failure simulations. It deploys Node.js, Postgres, Prometheus, and Grafana. Then it deletes pods, forces OOMKill, throttles CPU, drains nodes, and scales a StatefulSet to zero. Each scenario surfaces fixes: readiness probes, PodDisruptionBudget, anti-affinity, resource tuning, and Prometheus alerts. No toy chaos. Real nodes. Real pain.










