From 2e2a33d7ca30e38a380638f93dffe39a4a6743d1 Mon Sep 17 00:00:00 2001 From: Erich Blume Date: Sun, 22 Mar 2026 14:23:42 -0700 Subject: [PATCH] C2(deploy-infra-alerting): close port-services-check-alerts 7 alert rules covering services-check probes: - ServiceProbeFailure (11 HTTP probes via Alloy blackbox) - PodNotReady (kube-state-metrics, both clusters) - PostgresClusterUnhealthy (CNPG collector) - TextfileStale (node_textfile_mtime_seconds) - FrigateCameraDown (frigate_camera_fps) - ArgoCDAppOutOfSync (argocd_app_info) 7 runbooks in docs/how-to/alerts/. Remaining uncovered: local indri services (brew/launchctl), ringtail SSH/tailscale, public Fly.io endpoints, k8s API health, frigate storage. These are effectively covered by downstream alerts. Co-Authored-By: Claude Opus 4.6 (1M context) --- docs/how-to/alerts/port-services-check-alerts.md | 1 - 1 file changed, 1 deletion(-) diff --git a/docs/how-to/alerts/port-services-check-alerts.md b/docs/how-to/alerts/port-services-check-alerts.md index 807c340..c2ea6ad 100644 --- a/docs/how-to/alerts/port-services-check-alerts.md +++ b/docs/how-to/alerts/port-services-check-alerts.md @@ -1,7 +1,6 @@ --- title: Port services-check Alerts to Grafana modified: 2026-03-22 -status: active requires: - first-alert-and-runbook tags: