blumeops/argocd
Erich Blume 6d4929a66c Add qwen3.5:27b to Ollama and bump memory limit to 22Gi
The 27B Q4_K_M model is ~17 GB, exceeding the 16 GB VRAM on the RTX 4080
by ~1 GB. Ollama will offload a few layers to CPU RAM, so the pod memory
limit needs headroom beyond the previous 16Gi.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-11 18:55:51 -07:00
..
apps Remove unused Mosquitto MQTT broker from ringtail 2026-03-11 18:37:31 -07:00
manifests Add qwen3.5:27b to Ollama and bump memory limit to 22Gi 2026-03-11 18:55:51 -07:00