blumeops/argocd/manifests/nvidia-device-plugin/kustomization.yaml at 07376cc970b723ff3fb79e04ef4b014d4d3a30cc - eblume/blumeops - Forgejo: Beyond coding. We Forge.

eblume/blumeops

Erich Blume 0d1c2eb81a Deploy Ollama LLM server on ringtail with GPU time-slicing

Add Ollama as a new ArgoCD-managed service on ringtail's k3s cluster:
- Deployment with main ollama container and model-sync sidecar
- Declarative model list (qwen2.5:14b, deepseek-r1:14b, phi4:14b, gemma3:12b)
- hostPath PV on /mnt/storage1/ollama for fast local model storage
- Tailscale ingress at ollama.ops.eblu.me
- Enable GPU time-slicing (replicas: 2) on nvidia-device-plugin so
  Frigate and Ollama can share the RTX 4080

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-03-02 20:01:45 -08:00

13 lines

252 B

YAML

Raw Blame History

 apiVersion: kustomize.config.k8s.io/v1beta1
 kind: Kustomization
 namespace: nvidia-device-plugin
 resources:
   - daemonset.yaml
   - runtime-class.yaml
   - time-slicing-config.yaml
 images:
   - name: nvcr.io/nvidia/k8s-device-plugin
     newTag: v0.18.2