Fix 502 errors during Fly.io proxy deploys (#131)
All checks were successful
Deploy Fly.io Proxy / deploy (push) Successful in 1m20s
All checks were successful
Deploy Fly.io Proxy / deploy (push) Successful in 1m20s
## Summary - Health check (`/healthz`) now returns 503 until Tailscale is connected - `start.sh` creates `/tmp/tailscale-ready` sentinel after `tailscale up` succeeds - Fly.io keeps the old machine serving traffic during the ~7s startup window Previously, nginx passed the health check immediately, Fly.io routed traffic to the new machine, but MagicDNS wasn't available yet — causing upstream DNS timeouts and 502s on every request until Tailscale connected. ## Deployment and Testing - [ ] Merge and `fly deploy` from `fly/` directory - [ ] Verify deploy completes with zero 502s (check Grafana docs-apm dashboard) - [ ] Confirm health check transitions from 503 → 200 in `fly logs` Reviewed-on: https://forge.ops.eblu.me/eblume/blumeops/pulls/131
This commit is contained in:
parent
3415cad38c
commit
bd61da4f85
3 changed files with 9 additions and 4 deletions
1
docs/changelog.d/fix-deploy-healthcheck-race.bugfix.md
Normal file
1
docs/changelog.d/fix-deploy-healthcheck-race.bugfix.md
Normal file
|
|
@ -0,0 +1 @@
|
|||
Fix 502 errors during Fly.io proxy deploys by deferring health check until Tailscale is connected.
|
||||
Loading…
Add table
Add a link
Reference in a new issue