blumeops/docs/changelog.d
Erich Blume 4bbe4e7c20 Zero-downtime Fly.io deploys: bluegreen + startup reorder
Three changes to eliminate 502s during proxy deploys:

1. Start nginx after Tailscale connects (not before) so MagicDNS is
   always available when the first request arrives. This is the
   community-recommended pattern for Tailscale sidecars on Fly.io.

2. Switch deploy strategy to bluegreen — the old machine keeps serving
   traffic until the new one passes health checks, then Fly.io cuts
   over. Rolling deploys with a single machine always cause downtime.

3. Replace top-level [checks] with [[http_service.checks]]. Top-level
   checks only monitor; they don't gate traffic routing. Service-level
   checks tell the Fly Proxy to hold traffic until the app is ready.

The sentinel file (/tmp/tailscale-ready) and nginx if-check are removed
since nginx no longer starts before Tailscale.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-09 11:31:52 -08:00
..
.gitkeep Add towncrier changelog system (#86) 2026-02-03 11:48:13 -08:00
fix-deploy-healthcheck-race.bugfix.md Fix 502 errors during Fly.io proxy deploys (#131) 2026-02-09 11:07:36 -08:00
fix-real-client-ip-logging.bugfix.md Log real client IPs via Fly-Client-IP header (#130) 2026-02-09 11:02:06 -08:00
fix-zero-downtime-deploy.infra.md Zero-downtime Fly.io deploys: bluegreen + startup reorder 2026-02-09 11:31:52 -08:00