Fix ansible handler timeouts for alloy and loki restarts #12

Merged
eblume merged 4 commits from fix/ansible-handler-timeouts into main 2026-01-15 13:56:12 -08:00

4 commits

Author SHA1 Message Date
c3b8362d8b Fix Loki dashboard metrics
- Change "Chunks in Memory" to "Total Stored" using loki_ingester_chunk_stored_bytes_total
  (loki_ingester_memory_chunks_bytes doesn't exist)
- Update "Memory Usage Over Time" to "Storage Over Time" with correct metric

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-15 13:23:33 -08:00
d1f5315c4b Remove node_exporter stop task from alloy role
Migration complete - no longer need to stop node_exporter on each run.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-15 13:13:09 -08:00
ad50a08008 Optimize kiwix torrent status checks
- Fetch transmission list once and parse locally instead of calling
  transmission-remote for each archive (43 calls -> 1 call)
- Only recheck torrent status if new torrents were actually added
- Use set_fact to merge initial/recheck status for downstream tasks

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-15 13:11:31 -08:00
acff29f554 Fix ansible handler timeouts for alloy and loki restarts
Use async with poll: 0 to fire-and-forget service restarts.
These services have graceful shutdown periods that can exceed
ansible's default command timeout.

Also update transmission_metrics and devpi_metrics to depend on
alloy instead of node_exporter, since alloy now handles the
textfile collector.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-15 13:06:17 -08:00