Add operations reference cards
- observability - links to prometheus, loki, alloy, grafana - backup - links to borgmatic, sifaka, backup policy - disaster-recovery - TBD, links to borgmatic, 1password, forgejo Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
parent
6f8fb33122
commit
10b062051d
4 changed files with 58 additions and 0 deletions
|
|
@ -56,3 +56,11 @@ Network storage and backup configuration.
|
|||
- [[storage/sifaka|Sifaka]] - Synology NAS configuration
|
||||
- [[storage/postgresql|PostgreSQL]] - Database cluster
|
||||
- [[storage/backups|Backups]] - Backup policy and schedule
|
||||
|
||||
## Operations
|
||||
|
||||
Operational concerns and their components.
|
||||
|
||||
- [[operations/observability|Observability]] - Metrics, logs, dashboards
|
||||
- [[operations/backup|Backup]] - Data protection
|
||||
- [[operations/disaster-recovery|Disaster Recovery]] - Recovery procedures (TBD)
|
||||
|
|
|
|||
15
docs/reference/operations/backup.md
Normal file
15
docs/reference/operations/backup.md
Normal file
|
|
@ -0,0 +1,15 @@
|
|||
---
|
||||
title: Backup
|
||||
tags:
|
||||
- operations
|
||||
---
|
||||
|
||||
# Backup
|
||||
|
||||
Daily automated backups of BlumeOps data.
|
||||
|
||||
## Components
|
||||
|
||||
- [[services/borgmatic|Borgmatic]] - Backup orchestration
|
||||
- [[storage/sifaka|Sifaka]] - Backup target (NAS)
|
||||
- [[storage/backups|Backup Policy]] - What gets backed up and retention
|
||||
19
docs/reference/operations/disaster-recovery.md
Normal file
19
docs/reference/operations/disaster-recovery.md
Normal file
|
|
@ -0,0 +1,19 @@
|
|||
---
|
||||
title: Disaster Recovery
|
||||
tags:
|
||||
- operations
|
||||
---
|
||||
|
||||
# Disaster Recovery
|
||||
|
||||
TBD. Current state:
|
||||
|
||||
- [[services/borgmatic|Borgmatic]] provides daily backups to [[storage/sifaka|Sifaka]]
|
||||
- Infrastructure can be rebootstrapped using the blumeops repo
|
||||
- Detailed DR procedures not yet documented
|
||||
|
||||
## Components
|
||||
|
||||
- [[services/borgmatic|Borgmatic]] - Backup restoration
|
||||
- [[services/1password|1Password]] - Credential recovery
|
||||
- [[services/forgejo|Forgejo]] - Source of truth for infrastructure code
|
||||
16
docs/reference/operations/observability.md
Normal file
16
docs/reference/operations/observability.md
Normal file
|
|
@ -0,0 +1,16 @@
|
|||
---
|
||||
title: Observability
|
||||
tags:
|
||||
- operations
|
||||
---
|
||||
|
||||
# Observability
|
||||
|
||||
Metrics, logs, and dashboards for BlumeOps infrastructure.
|
||||
|
||||
## Components
|
||||
|
||||
- [[services/prometheus|Prometheus]] - Metrics storage and querying
|
||||
- [[services/loki|Loki]] - Log aggregation
|
||||
- [[services/alloy|Alloy]] - Metrics and log collection
|
||||
- [[services/grafana|Grafana]] - Dashboards and visualization
|
||||
Loading…
Add table
Add a link
Reference in a new issue