From 2c1652604becfb356c270590ad46a4d24a90bf3b Mon Sep 17 00:00:00 2001
From: Erich Blume <blume.erich@gmail.com>
Date: Thu, 26 Mar 2026 19:48:37 -0700
Subject: [PATCH] Reduce PodNotReady alert lookback from 5m to 60s

The 5-minute lookback window kept stale data from terminated pods
visible during rollouts, causing the alert to sit in Pending for
~5 minutes after every routine deployment. 60s still covers two
scrape cycles (30s interval) while clearing stale data much faster.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
---
 argocd/manifests/grafana/alerting.yaml          | 2 +-
 docs/changelog.d/+podnotready-lookback.infra.md | 1 +
 2 files changed, 2 insertions(+), 1 deletion(-)
 create mode 100644 docs/changelog.d/+podnotready-lookback.infra.md

diff --git a/argocd/manifests/grafana/alerting.yaml b/argocd/manifests/grafana/alerting.yaml
index 69dbec5..b220044 100644
--- a/argocd/manifests/grafana/alerting.yaml
+++ b/argocd/manifests/grafana/alerting.yaml
@@ -277,7 +277,7 @@ groups:
           - refId: A
             datasourceUid: prometheus
             relativeTimeRange:
-              from: 300
+              from: 60
               to: 0
             model:
               expr: >-
diff --git a/docs/changelog.d/+podnotready-lookback.infra.md b/docs/changelog.d/+podnotready-lookback.infra.md
new file mode 100644
index 0000000..fec02df
--- /dev/null
+++ b/docs/changelog.d/+podnotready-lookback.infra.md
@@ -0,0 +1 @@
+Reduce PodNotReady alert lookback window from 5m to 60s to clear faster after rollouts.