blumeops/argocd/manifests
Erich Blume c281fb5403 Add OpenTelemetry distributed tracing (Tempo + Beyla eBPF) (#286)
## Summary

Adds the third observability pillar — **distributed tracing** — alongside existing metrics (Prometheus) and logs (Loki).

- **Grafana Tempo 2.10.1** on minikube-indri for trace storage with 7d retention, OTLP receivers, and `metrics_generator` that remote-writes span-metrics (RED) to Prometheus
- **Beyla eBPF auto-instrumentation** via a privileged Alloy DaemonSet on ringtail — instruments HTTP services (Frigate, ntfy, Ollama, Immich) without code changes
- **Grafana integration** — Tempo datasource with trace↔log and trace↔metrics correlation, plus Loki derivedFields for trace ID linking
- **Prometheus** scrapes Tempo operational metrics

### Architecture

```
ringtail (k3s)                                indri (minikube)
┌──────────────────────┐                      ┌─────────────────────┐
│ Alloy+Beyla (eBPF)   │──OTLP HTTP────────→ │ Tempo               │
│  ↳ Frigate, ntfy,    │  via tailnet         │  ↳ trace storage    │
│    Ollama, Immich     │                      │  ↳ RED → Prometheus │
└──────────────────────┘                      │                     │
                                              │ Grafana             │
                                              │  ↳ Tempo datasource │
                                              └─────────────────────┘
```

### New files (12)
- `docs/reference/services/tempo.md` — reference doc
- `docs/changelog.d/feature-otel-tracing.feature.md`
- `argocd/apps/tempo.yaml` + `argocd/manifests/tempo/` (6 files)
- `argocd/apps/alloy-tracing-ringtail.yaml` + `argocd/manifests/alloy-tracing-ringtail/` (4 files)

### Modified files (6)
- `argocd/manifests/grafana/datasources.yaml` — Tempo datasource + Loki derivedFields
- `argocd/manifests/prometheus/prometheus.yml` — Tempo scrape target
- `service-versions.yaml` — tempo + alloy-tracing-ringtail entries
- `docs/reference/services/grafana.md` — Tempo in datasources table
- `docs/reference/reference.md` — Tempo in services index
- `docs/reference/operations/observability.md` — Tempo in components list

## Deployment and Testing

- [ ] Sync `apps` app to pick up new Application definitions
- [ ] `argocd app set tempo --revision feature/otel-tracing && argocd app sync tempo`
- [ ] Verify Tempo pod: `kubectl --context=minikube-indri get pods -n monitoring -l app=tempo`
- [ ] Verify Tempo ready: port-forward 3200 and `curl localhost:3200/ready`
- [ ] Verify Tailscale ingresses: `kubectl --context=minikube-indri get ingress -n monitoring`
- [ ] `argocd app set alloy-tracing-ringtail --revision feature/otel-tracing && argocd app sync alloy-tracing-ringtail`
- [ ] Check Beyla discovery in alloy-tracing logs on ringtail
- [ ] Sync grafana-config for updated datasources
- [ ] Sync prometheus for updated scrape config
- [ ] Test Grafana Tempo datasource connection
- [ ] Generate test traffic and search traces in Grafana Explore → Tempo
- [ ] After merge: reset all ArgoCD app revisions back to main

Reviewed-on: #286
2026-03-05 10:51:07 -08:00
..
1password-connect Fix 1Password Connect credentials for chart 2.3.0 2026-02-13 17:30:45 -08:00
alloy-k8s Add multi-cluster observability with ringtail metrics and dashboards (#270) 2026-02-25 22:01:00 -08:00
alloy-ringtail Add multi-cluster observability with ringtail metrics and dashboards (#270) 2026-02-25 22:01:00 -08:00
alloy-tracing-ringtail Add OpenTelemetry distributed tracing (Tempo + Beyla eBPF) (#286) 2026-03-05 10:51:07 -08:00
argocd Add ExternalSecret default fields to prevent ArgoCD drift 2026-03-05 09:11:23 -08:00
authentik Add Authentik OIDC login for ArgoCD (#284) 2026-03-05 09:07:25 -08:00
cloudnative-pg Port CloudNative-PG off Helm to direct release manifest (#268) 2026-02-25 17:37:53 -08:00
cv Expose Forgejo publicly at forge.eblu.me (#278) 2026-03-03 08:40:41 -08:00
databases Pin blumeops-pg to PostgreSQL 18.3 2026-02-27 16:25:32 -08:00
devpi Add kustomize images: and configMapGenerator: across services (#264) 2026-02-24 14:25:19 -08:00
docs Expose Forgejo publicly at forge.eblu.me (#278) 2026-03-03 08:40:41 -08:00
external-secrets Add External Secrets Operator with 1Password Connect (#66) (#66) 2026-01-28 19:30:10 -08:00
forgejo-runner Route runner polling through internal forge.ops.eblu.me 2026-03-03 10:33:40 -08:00
frigate Fix Frigate database path to use persistent volume 2026-03-03 15:18:16 -08:00
grafana Add OpenTelemetry distributed tracing (Tempo + Beyla eBPF) (#286) 2026-03-05 10:51:07 -08:00
grafana-config Add OpenTelemetry distributed tracing (Tempo + Beyla eBPF) (#286) 2026-03-05 10:51:07 -08:00
homepage Add Gandi bookmark to homepage dashboard 2026-03-03 13:05:50 -08:00
immich Fix mirror org refs in ArgoCD apps and widen credential template (#266) 2026-02-25 06:55:53 -08:00
kiwix Pin kiwix-serve image tag to v3.8.2-f6f0f79 2026-03-05 08:17:40 -08:00
kube-state-metrics Add kustomize images: and configMapGenerator: across services (#264) 2026-02-24 14:25:19 -08:00
kube-state-metrics-ringtail Add multi-cluster observability with ringtail metrics and dashboards (#270) 2026-02-25 22:01:00 -08:00
loki Update Loki to v3.6.5-3dc4ed7 container image 2026-03-03 13:01:49 -08:00
miniflux Update homepage, navidrome, ntfy, miniflux image tags after mirror migration 2026-02-24 21:06:08 -08:00
mosquitto Add kustomize images: and configMapGenerator: across services (#264) 2026-02-24 14:25:19 -08:00
navidrome Update homepage, navidrome, ntfy, miniflux image tags after mirror migration 2026-02-24 21:06:08 -08:00
ntfy Update homepage, navidrome, ntfy, miniflux image tags after mirror migration 2026-02-24 21:06:08 -08:00
nvidia-device-plugin Deploy Ollama LLM server on ringtail (#277) 2026-03-02 20:39:51 -08:00
ollama Add qwen3.5:9b to Ollama model list 2026-03-04 19:49:39 -08:00
prometheus Add OpenTelemetry distributed tracing (Tempo + Beyla eBPF) (#286) 2026-03-05 10:51:07 -08:00
tailscale-operator Expose Forgejo publicly at forge.eblu.me (#278) 2026-03-03 08:40:41 -08:00
tailscale-operator-base Add kustomize images: and configMapGenerator: across services (#264) 2026-02-24 14:25:19 -08:00
tailscale-operator-ringtail Deploy Tailscale operator on ringtail k3s cluster (#215) 2026-02-19 09:33:05 -08:00
tempo Add OpenTelemetry distributed tracing (Tempo + Beyla eBPF) (#286) 2026-03-05 10:51:07 -08:00
teslamate Update teslamate to v3.0.0-eb9bc57 container image 2026-03-03 12:02:26 -08:00
torrent Pin transmission-exporter image tag to v1.0.1-c93448f 2026-03-05 08:05:17 -08:00