blumeops/docs/reference/infrastructure/routing.md
Erich Blume 85e36cd807 Operations and observability for sifaka NAS (#135)
## Summary
- Add `smartctl_exporter` Docker container to sifaka for SMART disk health monitoring
- Formalize existing `node_exporter` container under Ansible management
- Route both exporters through Caddy L4 TCP proxy (`nas.ops.eblu.me:9100`, `nas.ops.eblu.me:9633`), replacing the hardcoded LAN IP in Prometheus
- Create "Sifaka Disk Health" Grafana dashboard (health status, temperature, wear indicators, lifetime)
- Introduce `ansible/playbooks/sifaka.yml` and `mise run provision-sifaka` — first Ansible playbook for the NAS
- Shared exporter port variables in `group_vars/all.yml` to avoid duplication between Caddy and sifaka roles

## Prerequisites before deploy
- [ ] Enable SSH on sifaka (DSM Control Panel > Terminal & SNMP)
- [ ] Verify `ssh eblume@sifaka 'docker ps'` works
- [ ] Run `mise run provision-sifaka` to deploy containers
- [ ] Run `mise run provision-indri -- --tags caddy` to add L4 routes
- [ ] `argocd app sync prometheus` + `argocd app sync grafana-config`

## Test plan
- [ ] Verify smartctl_exporter metrics: `curl http://nas.ops.eblu.me:9633/metrics`
- [ ] Verify Prometheus targets page shows both sifaka jobs as UP
- [ ] Verify Grafana "Sifaka Disk Health" dashboard loads with data

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Reviewed-on: https://forge.ops.eblu.me/eblume/blumeops/pulls/135
2026-02-09 17:44:05 -08:00

3.1 KiB

title tags
Routing
infrastructure
networking

Service Routing

Services are accessible via three DNS domains with different reachability.

DNS Domains

Domain Proxy Reachable From
*.eblu.me flyio-proxy (Fly.io → Tailscale tunnel) Public internet
*.ops.eblu.me Caddy on indri k8s pods, docker containers, tailnet clients
*.tail8d86e.ts.net Tailscale MagicDNS Tailnet clients only

Use *.ops.eblu.me for services that need pod-to-service communication. Use *.eblu.me for services exposed publicly via Fly.io.

Caddy Services (*.ops.eblu.me)

DNS points to indri's Tailscale IP. TLS via Let's Encrypt (ACME DNS-01 with Gandi).

Service URL Description
Homepage https://go.ops.eblu.me Service dashboard
forgejo https://forge.ops.eblu.me Git hosting (SSH: 2222)
zot https://registry.ops.eblu.me Container registry
grafana https://grafana.ops.eblu.me Dashboards
argocd https://argocd.ops.eblu.me GitOps CD
prometheus https://prometheus.ops.eblu.me Metrics
loki https://loki.ops.eblu.me Logs
miniflux https://feed.ops.eblu.me RSS reader
kiwix https://kiwix.ops.eblu.me Offline Wikipedia
transmission https://torrent.ops.eblu.me BitTorrent
teslamate https://tesla.ops.eblu.me Tesla logger
navidrome https://dj.ops.eblu.me Music streaming
jellyfin https://jellyfin.ops.eblu.me Media server
postgresql pg.ops.eblu.me:5432 Database
[[sifaka Sifaka]] https://nas.ops.eblu.me

Public Services (*.eblu.me)

DNS CNAMEs point to blumeops-proxy.fly.dev. TLS via Fly.io-managed Let's Encrypt. Traffic tunnels back to the homelab over Tailscale. Only services tagged tag:flyio-target are reachable by the proxy — see flyio-proxy for details.

Service URL Description
docs https://docs.eblu.me Documentation site

Tailscale-Only Services

Service URL Description
Kubernetes https://k8s.tail8d86e.ts.net Minikube API

Port Map (Indri)

Port Service Protocol Binding Notes
443 Caddy HTTPS 0.0.0.0 Reverse proxy
2222 Caddy L4 TCP 0.0.0.0 SSH proxy to Forgejo
5432 Caddy L4 TCP 0.0.0.0 PostgreSQL proxy
9100 Caddy L4 TCP 0.0.0.0 Sifaka node_exporter proxy
9633 Caddy L4 TCP 0.0.0.0 Sifaka smartctl_exporter proxy
2200 Forgejo SSH TCP localhost Built-in SSH server
3001 Forgejo HTTP localhost Web UI
5050 Zot HTTP localhost Registry API
8096 Jellyfin HTTP localhost Media server
44491 K8s API HTTPS 0.0.0.0 Minikube API server