blumeops/argocd/manifests
Erich Blume 737371ab59 Add pod state observability to minikube dashboard (#83)
## Summary
- Add "Unhealthy Pods" stat panel showing count of pods in error states (ImagePullBackOff, CrashLoopBackOff, etc.) with red background when > 0
- Add "Pods by Waiting Reason" time series chart showing container waiting states over time
- Provides visibility into stuck pods that ArgoCD doesn't track (since it manages CronJobs, not the Jobs/Pods they spawn)

## Context
This addresses the issue where a `zim-watcher` cronjob pod was stuck in `ImagePullBackOff` for 11 days without any alerting. ArgoCD showed the CronJob as "Synced, Healthy" because it only manages the CronJob resource, not its spawned Jobs/Pods.

## Deployment and Testing
- [ ] Sync grafana-config app to test branch
- [ ] Verify dashboard renders correctly
- [ ] Confirm "Unhealthy Pods" shows 0 (green) when no issues
- [ ] Reset to main after merge

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Reviewed-on: https://forge.ops.eblu.me/eblume/blumeops/pulls/83
2026-02-03 07:20:05 -08:00
..
1password-connect Add External Secrets Operator with 1Password Connect (#66) (#66) 2026-01-28 19:30:10 -08:00
alloy-k8s Remove logfmt parsing stage from Alloy k8s config 2026-01-22 18:06:34 -08:00
argocd Homepage dashboard improvements (#76) 2026-01-30 15:05:02 -08:00
cloudnative-pg K8s Migration Phase 1: Infrastructure Setup (#29) 2026-01-19 09:49:52 -08:00
databases Switch all ExternalSecrets to creationPolicy: Owner 2026-01-28 20:27:16 -08:00
devpi Replace hajimari with gethomepage (#75) 2026-01-30 13:21:12 -08:00
external-secrets Add External Secrets Operator with 1Password Connect (#66) (#66) 2026-01-28 19:30:10 -08:00
forgejo-runner Update forgejo-runner to v2.2.0 (adds skopeo) 2026-01-30 11:13:54 -08:00
grafana Homepage dashboard improvements (#76) 2026-01-30 15:05:02 -08:00
grafana-config Add pod state observability to minikube dashboard (#83) 2026-02-03 07:20:05 -08:00
homepage Expand homepage widgets and info panels (#81) 2026-02-02 16:11:20 -08:00
immich Expand homepage widgets and info panels (#81) 2026-02-02 16:11:20 -08:00
kiwix Add pod state observability to minikube dashboard (#83) 2026-02-03 07:20:05 -08:00
kube-state-metrics Observability cleanup and k8s service monitoring (#43) (#43) 2026-01-22 13:51:01 -08:00
loki Replace hajimari with gethomepage (#75) 2026-01-30 13:21:12 -08:00
miniflux Expand homepage widgets and info panels (#81) 2026-02-02 16:11:20 -08:00
navidrome Expand homepage widgets and info panels (#81) 2026-02-02 16:11:20 -08:00
prometheus Replace hajimari with gethomepage (#75) 2026-01-30 13:21:12 -08:00
tailscale-operator Switch all ExternalSecrets to creationPolicy: Owner 2026-01-28 20:27:16 -08:00
teslamate Replace hajimari with gethomepage (#75) 2026-01-30 13:21:12 -08:00
torrent Expand homepage widgets and info panels (#81) 2026-02-02 16:11:20 -08:00