diff --git a/docs/reference/index.md b/docs/reference/index.md index 57ee816..a4b211b 100644 --- a/docs/reference/index.md +++ b/docs/reference/index.md @@ -56,3 +56,11 @@ Network storage and backup configuration. - [[storage/sifaka|Sifaka]] - Synology NAS configuration - [[storage/postgresql|PostgreSQL]] - Database cluster - [[storage/backups|Backups]] - Backup policy and schedule + +## Operations + +Operational concerns and their components. + +- [[operations/observability|Observability]] - Metrics, logs, dashboards +- [[operations/backup|Backup]] - Data protection +- [[operations/disaster-recovery|Disaster Recovery]] - Recovery procedures (TBD) diff --git a/docs/reference/operations/backup.md b/docs/reference/operations/backup.md new file mode 100644 index 0000000..da79caa --- /dev/null +++ b/docs/reference/operations/backup.md @@ -0,0 +1,15 @@ +--- +title: Backup +tags: + - operations +--- + +# Backup + +Daily automated backups of BlumeOps data. + +## Components + +- [[services/borgmatic|Borgmatic]] - Backup orchestration +- [[storage/sifaka|Sifaka]] - Backup target (NAS) +- [[storage/backups|Backup Policy]] - What gets backed up and retention diff --git a/docs/reference/operations/disaster-recovery.md b/docs/reference/operations/disaster-recovery.md new file mode 100644 index 0000000..e110a30 --- /dev/null +++ b/docs/reference/operations/disaster-recovery.md @@ -0,0 +1,19 @@ +--- +title: Disaster Recovery +tags: + - operations +--- + +# Disaster Recovery + +TBD. Current state: + +- [[services/borgmatic|Borgmatic]] provides daily backups to [[storage/sifaka|Sifaka]] +- Infrastructure can be rebootstrapped using the blumeops repo +- Detailed DR procedures not yet documented + +## Components + +- [[services/borgmatic|Borgmatic]] - Backup restoration +- [[services/1password|1Password]] - Credential recovery +- [[services/forgejo|Forgejo]] - Source of truth for infrastructure code diff --git a/docs/reference/operations/observability.md b/docs/reference/operations/observability.md new file mode 100644 index 0000000..388ae47 --- /dev/null +++ b/docs/reference/operations/observability.md @@ -0,0 +1,16 @@ +--- +title: Observability +tags: + - operations +--- + +# Observability + +Metrics, logs, and dashboards for BlumeOps infrastructure. + +## Components + +- [[services/prometheus|Prometheus]] - Metrics storage and querying +- [[services/loki|Loki]] - Log aggregation +- [[services/alloy|Alloy]] - Metrics and log collection +- [[services/grafana|Grafana]] - Dashboards and visualization