blumeops/argocd/manifests/nvidia-device-plugin/kustomization.yaml
Erich Blume 0d1c2eb81a Deploy Ollama LLM server on ringtail with GPU time-slicing
Add Ollama as a new ArgoCD-managed service on ringtail's k3s cluster:
- Deployment with main ollama container and model-sync sidecar
- Declarative model list (qwen2.5:14b, deepseek-r1:14b, phi4:14b, gemma3:12b)
- hostPath PV on /mnt/storage1/ollama for fast local model storage
- Tailscale ingress at ollama.ops.eblu.me
- Enable GPU time-slicing (replicas: 2) on nvidia-device-plugin so
  Frigate and Ollama can share the RTX 4080

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-02 20:01:45 -08:00

13 lines
252 B
YAML

apiVersion: kustomize.config.k8s.io/v1beta1
kind: Kustomization
namespace: nvidia-device-plugin
resources:
- daemonset.yaml
- runtime-class.yaml
- time-slicing-config.yaml
images:
- name: nvcr.io/nvidia/k8s-device-plugin
newTag: v0.18.2