## Summary Mikado chain to replace `mise run services-check` with Grafana Unified Alerting backed by ntfy push notifications. **Design:** - Grafana Unified Alerting evaluates rules against Prometheus/Loki - ntfy webhook contact point delivers iOS notifications - Anti-noise policy: page once per 24h per alert group - Every alert links to a runbook in `docs/how-to/alerts/` - services-check eventually queries the alerting API instead of doing its own probes **Chain (bottom-up):** 1. `configure-grafana-alerting-pipeline` — enable alerting, ntfy contact point, notification policy 2. `first-alert-and-runbook` — end-to-end proof of concept with blackbox probe failure 3. `port-services-check-alerts` — migrate all services-check probes to alert rules + runbooks 4. `refactor-services-check-to-query-alerts` — rewrite services-check to query Grafana API 5. `deploy-infra-alerting` — goal card 🤖 Generated with [Claude Code](https://claude.com/claude-code) Reviewed-on: #303
1.2 KiB
1.2 KiB
| title | modified | tags | |||
|---|---|---|---|---|---|
| Runbook: Frigate Camera Down | 2026-03-22 |
|
Runbook: Frigate Camera Down
Alert name: FrigateCameraDown
A Frigate camera has reported 0 FPS for 5+ minutes, meaning the camera feed is not being received.
Diagnostic Steps
- Check Frigate UI — https://nvr.ops.eblu.me — look at the camera thumbnail and status
- Check Frigate API stats:
curl -s https://nvr.ops.eblu.me/api/stats | python3 -m json.tool - Check Frigate pod logs on ringtail:
kubectl logs -n frigate -l app=frigate --context=k3s-ringtail --tail=30 - Check the camera itself — verify it's powered on and network-connected. Try accessing the RTSP stream directly.
Common Causes
- Camera offline — power outage, network issue, or camera crash
- NFS mount lost — Frigate storage on sifaka; if the NFS mount drops, recording stops and FPS may drop
- Frigate pod restart — during restart, camera FPS briefly drops to 0
- RTSP stream timeout — camera firmware issue; power cycle the camera
Related
- frigate — Frigate NVR reference
- deploy-infra-alerting — Alerting pipeline overview