Localize the Tailscale operator stack (k8s-operator + indri ProxyClass) (#374)
Weekly non-local-container task: localize the Tailscale operator stack on **both clusters**. ## What - **`containers/tailscale-operator/`** (new) — builds `cmd/k8s-operator` v1.94.2 from the forge mirror, mirroring upstream's mkctr recipe (`/usr/local/bin/operator`, `ts_kube,ts_package_container` go tags, version stamps). `container.py` (dagger) for indri/arm64; `default.nix` for ringtail/amd64. - **`containers/tailscale/container.py`** (new) — dagger/arm64 build of the proxy image (containerboot), mirroring the upstream Dockerfile (iptables-legacy symlinks, `/tailscale/run.sh` compat). Ringtail already consumes the existing nix build; this completes parity for indri. - **Version pinned at v1.94.2** (same as currently deployed) — this PR is a pure supply-chain swap, no version change. v1.96.x is avoided deliberately (MagicDNS-in-containers regression). - Docs-first: tailscale-operator card gains **Local Images** and **Rollout Safety** sections. ## Rollout plan (after image builds) 1. Manifest commit: per-overlay `images:` override for the operator + ProxyClass strategic-merge patch on indri (kustomize `images:` can't touch CR fields). 2. `argocd app set tailscale-operator --revision <branch> && argocd app sync` — indri first, verify, then ringtail. 3. **Shadow-device safety**: device identity lives in the tailscale state Secrets; an image swap re-uses existing node keys, so no `-1` clones. State Secrets are not touched. Post-sync verification: pod health, device names unchanged, `mise run services-check`. ## Follow-ups (not this PR) - `dnsconfig` nameserver image (`tailscale/k8s-nameserver:stable`) still upstream. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Reviewed-on: #374
This commit is contained in:
parent
b24fd147ac
commit
d03ed337a9
8 changed files with 307 additions and 12 deletions
|
|
@ -1,7 +1,7 @@
|
|||
---
|
||||
title: Tailscale Operator
|
||||
modified: 2026-06-08
|
||||
last-reviewed: 2026-06-08
|
||||
modified: 2026-06-09
|
||||
last-reviewed: 2026-06-09
|
||||
tags:
|
||||
- kubernetes
|
||||
- tailscale
|
||||
|
|
@ -22,10 +22,42 @@ The Tailscale operator enables Kubernetes services to be exposed directly on the
|
|||
The operator runs on **both** clusters — indri's minikube and ringtail's k3s.
|
||||
Both apps layer on the shared `tailscale-operator-base` kustomize directory
|
||||
(operator manifest, `ProxyClass`, `dnsconfig`); each cluster supplies its own
|
||||
`ProxyGroup` (indri: 2 replicas, ringtail: 1) and OAuth `ExternalSecret`. The
|
||||
ringtail overlay additionally rewrites the proxy image to a locally nix-built
|
||||
mirror. See [[ringtail]] and [[migrate-wave1-ringtail]] for the ongoing
|
||||
migration of k8s workloads onto ringtail.
|
||||
`ProxyGroup` (indri: 2 replicas, ringtail: 1) and OAuth `ExternalSecret`. See
|
||||
[[ringtail]] and [[migrate-wave1-ringtail]] for the ongoing migration of k8s
|
||||
workloads onto ringtail.
|
||||
|
||||
## Local Images
|
||||
|
||||
Both the operator and the proxy run locally-built images from the forge
|
||||
mirror (`mirrors/tailscale`), not Docker Hub:
|
||||
|
||||
| Image | Build | Used by |
|
||||
|-------|-------|---------|
|
||||
| `blumeops/tailscale-operator` | `containers/tailscale-operator/` (`container.py` for indri/arm64, `default.nix` `-nix` tag for ringtail/amd64) | operator Deployment, via each overlay's `images:` override |
|
||||
| `blumeops/tailscale` | `containers/tailscale/` (same dual build) | `ProxyClass` proxy pods, via a strategic-merge patch in each overlay |
|
||||
|
||||
The ProxyClass image must be set with a **patch**, not kustomize's `images:`
|
||||
directive — that directive only rewrites standard container fields, not
|
||||
custom-resource fields like `ProxyClass.spec.statefulSet.pod.tailscaleContainer.image`.
|
||||
|
||||
The `dnsconfig` nameserver image (`tailscale/k8s-nameserver:stable`) is still
|
||||
upstream — a known follow-up.
|
||||
|
||||
## Rollout Safety (device identity)
|
||||
|
||||
Proxy and operator tailnet identity lives in Kubernetes state Secrets in the
|
||||
`tailscale` namespace, not in pods or images. An image swap rolls the
|
||||
Deployment/StatefulSets but pods re-authenticate with their existing node
|
||||
keys — devices keep their names. Shadow devices (`foo-1` suffixes) appear only
|
||||
when a pod registers *fresh* while a stale device record still holds the name
|
||||
(deleted state Secrets, cluster rebuilds). When rolling out image changes:
|
||||
|
||||
1. Never delete the `tailscale` namespace state Secrets.
|
||||
2. Verify after sync: pods healthy, device names unchanged in the admin
|
||||
console, `mise run services-check` green.
|
||||
3. If a collision does occur: delete the stale device in the admin console
|
||||
AND the affected state Secret, then restart the pod (see
|
||||
[[rebuild-minikube-cluster]]).
|
||||
|
||||
## How It Works
|
||||
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue