--- title: NVIDIA Device Plugin modified: 2026-03-27 tags: - service - gpu --- # NVIDIA Device Plugin Kubernetes device plugin that exposes NVIDIA GPUs to pods on [[ringtail]]. Required for GPU workloads like [[frigate]] (object detection) and [[ollama]] (LLM inference). ## Quick Reference | Property | Value | |----------|-------| | **Namespace** | `nvidia-device-plugin` | | **Image** | `nvcr.io/nvidia/k8s-device-plugin` | | **Upstream** | https://github.com/NVIDIA/k8s-device-plugin | | **Manifests** | [argocd/manifests/nvidia-device-plugin/](https://forge.eblu.me/eblume/blumeops/src/branch/main/argocd/manifests/nvidia-device-plugin) | ## Architecture Runs as a DaemonSet with `privileged` security context, mounting the host's device-plugins socket, CDI specs, and NVIDIA driver libraries. A `RuntimeClass` named `nvidia` is defined for pods that need GPU access. Time-slicing is configured with 2 replicas per GPU, allowing two pods to share a single physical GPU.