From 2c1652604becfb356c270590ad46a4d24a90bf3b Mon Sep 17 00:00:00 2001 From: Erich Blume Date: Thu, 26 Mar 2026 19:48:37 -0700 Subject: [PATCH] Reduce PodNotReady alert lookback from 5m to 60s The 5-minute lookback window kept stale data from terminated pods visible during rollouts, causing the alert to sit in Pending for ~5 minutes after every routine deployment. 60s still covers two scrape cycles (30s interval) while clearing stale data much faster. Co-Authored-By: Claude Opus 4.6 (1M context) --- argocd/manifests/grafana/alerting.yaml | 2 +- docs/changelog.d/+podnotready-lookback.infra.md | 1 + 2 files changed, 2 insertions(+), 1 deletion(-) create mode 100644 docs/changelog.d/+podnotready-lookback.infra.md diff --git a/argocd/manifests/grafana/alerting.yaml b/argocd/manifests/grafana/alerting.yaml index 69dbec5..b220044 100644 --- a/argocd/manifests/grafana/alerting.yaml +++ b/argocd/manifests/grafana/alerting.yaml @@ -277,7 +277,7 @@ groups: - refId: A datasourceUid: prometheus relativeTimeRange: - from: 300 + from: 60 to: 0 model: expr: >- diff --git a/docs/changelog.d/+podnotready-lookback.infra.md b/docs/changelog.d/+podnotready-lookback.infra.md new file mode 100644 index 0000000..fec02df --- /dev/null +++ b/docs/changelog.d/+podnotready-lookback.infra.md @@ -0,0 +1 @@ +Reduce PodNotReady alert lookback window from 5m to 60s to clear faster after rollouts.