Recurring review sweep: 4 doc cards + nvidia-device-plugin v0.19.2 #366

Merged
eblume merged 1 commit from reviews-jun4 into main 2026-06-04 13:37:03 -07:00
8 changed files with 21 additions and 13 deletions

View file

@ -10,4 +10,4 @@ resources:
images:
- name: nvcr.io/nvidia/k8s-device-plugin
newTag: v0.19.0
newTag: v0.19.2

View file

@ -0,0 +1 @@
Reviewed four never-reviewed reference cards (`cluster`, `ntfy`, `tempo`, `alloy`) and corrected drift: minikube is now Kubernetes v1.35.0; ntfy, tempo, and alloy-k8s images are now locally-built `registry.ops.eblu.me/blumeops/*` nix containers (v2.19.2, v2.10.3, v1.16.0) rather than upstream Docker Hub; the Fly.io alloy binary is v1.16.1; and the ringtail workload list reflects the in-progress minikube→k3s migration.

View file

@ -0,0 +1 @@
Upgraded the nvidia-device-plugin on ringtail from v0.19.0 to v0.19.2 (upstream patch release: CDI/Tegra fixes and dependency bumps, no breaking changes for our manifest-based CDI + RuntimeClass setup).

View file

@ -1,6 +1,7 @@
---
title: Cluster
modified: 2026-02-19
modified: 2026-06-04
last-reviewed: 2026-06-04
tags:
- kubernetes
---
@ -15,7 +16,7 @@ BlumeOps runs two Kubernetes clusters: a Minikube cluster on [[indri]] (most ser
|----------|-------|
| **Driver** | docker |
| **Container Runtime** | docker |
| **Kubernetes Version** | v1.34.0 |
| **Kubernetes Version** | v1.35.0 |
| **CPUs** | 6 |
| **Memory** | 11GB |
| **Disk** | 200GB |
@ -41,7 +42,9 @@ Single-node k3s cluster for workloads requiring amd64 or GPU access. See [[ringt
|----------|-------|
| **Context** | `k3s-ringtail` |
| **API Server** | `https://ringtail.tail8d86e.ts.net:6443` |
| **Workloads** | Frigate (GPU), ntfy, frigate-notify, nvidia-device-plugin |
| **Workloads** | GPU workloads (Frigate, Ollama), notifications (ntfy, frigate-notify), [[authentik]], and services migrated off indri minikube (Immich, Mealie, Paperless, TeslaMate). See [[ringtail]] for the authoritative list. |
Services are being progressively migrated from indri's minikube to ringtail's k3s; the split above reflects an in-progress state, not a fixed boundary.
## Related

View file

@ -1,6 +1,7 @@
---
title: Alloy
modified: 2026-03-13
modified: 2026-06-04
last-reviewed: 2026-06-04
tags:
- service
- observability
@ -20,10 +21,10 @@ Unified observability collector for metrics and logs with three deployments:
| **Indri Binary** | `~/.local/bin/alloy` |
| **Indri Config** | `~/.config/grafana-alloy/config.alloy` |
| **K8s Namespace** | `alloy` |
| **K8s Image** | `grafana/alloy:v1.14.0` |
| **K8s Image** | `registry.ops.eblu.me/blumeops/alloy:v1.16.0-9564435` (locally built) |
| **ArgoCD App** | `alloy-k8s` |
| **Fly.io Config** | `fly/alloy.river` |
| **Fly.io Image** | `grafana/alloy:v1.5.1` (binary copied into nginx container) |
| **Fly.io Image** | `grafana/alloy:v1.16.1` (binary copied into nginx container, sha-pinned) |
## Metrics Collected

View file

@ -1,6 +1,7 @@
---
title: Ntfy
modified: 2026-02-17
modified: 2026-06-04
last-reviewed: 2026-06-04
tags:
- service
- notifications
@ -17,7 +18,7 @@ Self-hosted push notification service. Ntfy receives HTTP POST messages and deli
| **URL** | https://ntfy.ops.eblu.me |
| **Tailscale URL** | https://ntfy.tail8d86e.ts.net |
| **Namespace** | `ntfy` |
| **Image** | `binwiederhier/ntfy:v2.17.0` |
| **Image** | `registry.ops.eblu.me/blumeops/ntfy:v2.19.2-fd0bebb-nix` (locally built) |
| **Upstream** | https://github.com/binwiederhier/ntfy |
| **Manifests** | `argocd/manifests/ntfy/` |

View file

@ -1,6 +1,7 @@
---
title: Tempo
modified: 2026-03-05
modified: 2026-06-04
last-reviewed: 2026-06-04
tags:
- service
- observability
@ -18,7 +19,7 @@ Distributed tracing backend for BlumeOps infrastructure. Receives traces via OTL
| **Tailscale URL** | https://tempo.tail8d86e.ts.net |
| **OTLP Endpoint** | https://tempo-otlp.tail8d86e.ts.net |
| **Namespace** | `monitoring` |
| **Image** | `grafana/tempo:2.10.1` |
| **Image** | `registry.ops.eblu.me/blumeops/tempo:v2.10.3-75f9ba4` (locally built) |
| **Storage** | 10Gi PVC (local filesystem) |
| **Retention** | 7 days |

View file

@ -56,8 +56,8 @@ services:
- name: nvidia-device-plugin
type: argocd
last-reviewed: 2026-03-27
current-version: "v0.19.0"
last-reviewed: 2026-06-04
current-version: "v0.19.2"
upstream-source: https://github.com/NVIDIA/k8s-device-plugin/releases
notes: DaemonSet + RuntimeClass on ringtail for GPU workloads