Add Ollama as a new ArgoCD-managed service on ringtail's k3s cluster: - Deployment with main ollama container and model-sync sidecar - Declarative model list (qwen2.5:14b, deepseek-r1:14b, phi4:14b, gemma3:12b) - hostPath PV on /mnt/storage1/ollama for fast local model storage - Tailscale ingress at ollama.ops.eblu.me - Enable GPU time-slicing (replicas: 2) on nvidia-device-plugin so Frigate and Ollama can share the RTX 4080 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
13 lines
252 B
YAML
13 lines
252 B
YAML
apiVersion: kustomize.config.k8s.io/v1beta1
|
|
kind: Kustomization
|
|
|
|
namespace: nvidia-device-plugin
|
|
|
|
resources:
|
|
- daemonset.yaml
|
|
- runtime-class.yaml
|
|
- time-slicing-config.yaml
|
|
|
|
images:
|
|
- name: nvcr.io/nvidia/k8s-device-plugin
|
|
newTag: v0.18.2
|