Deploy Ollama LLM server on ringtail with GPU acceleration and declarative model management