Files
orion/monitoring/prometheus.yml
Samir Boulahtit 4bce16fb73
All checks were successful
CI / ruff (push) Successful in 11s
CI / pytest (push) Successful in 36m6s
CI / validate (push) Successful in 22s
CI / dependency-scanning (push) Successful in 28s
CI / docs (push) Successful in 37s
CI / deploy (push) Successful in 47s
feat(infra): add alerting, network segmentation, and ops docs (Steps 19-24)
- Prometheus alert rules (host, container, API, Celery, target-down)
- Alertmanager with email routing (critical 1h, warning 4h repeat)
- Docker network segmentation (frontend/backend/monitoring)
- Incident response runbook with 8 copy-paste runbooks
- Environment variables reference (55+ vars documented)
- Hetzner setup docs updated with Steps 19-24
- Launch readiness updated with Feb 2026 infrastructure status

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-15 22:06:54 +01:00

54 lines
1.6 KiB
YAML

# Prometheus configuration for Orion platform
# Docs: https://prometheus.io/docs/prometheus/latest/configuration/configuration/
global:
scrape_interval: 15s
evaluation_interval: 15s
# ─── Alerting ────────────────────────────────────────────────────────────
alerting:
alertmanagers:
- static_configs:
- targets: ["alertmanager:9093"]
rule_files:
- /etc/prometheus/alert.rules.yml
# ─── Scrape Configs ─────────────────────────────────────────────────────
scrape_configs:
# Orion API — /metrics endpoint (prometheus_client)
- job_name: "orion-api"
metrics_path: /metrics
static_configs:
- targets: ["api:8000"]
labels:
service: "orion-api"
# Node Exporter — host-level CPU, RAM, disk metrics
- job_name: "node-exporter"
static_configs:
- targets: ["node-exporter:9100"]
labels:
service: "node-exporter"
# cAdvisor — per-container resource metrics
- job_name: "cadvisor"
static_configs:
- targets: ["cadvisor:8080"]
labels:
service: "cadvisor"
# Prometheus self-monitoring
- job_name: "prometheus"
static_configs:
- targets: ["localhost:9090"]
labels:
service: "prometheus"
# Alertmanager
- job_name: "alertmanager"
static_configs:
- targets: ["alertmanager:9093"]
labels:
service: "alertmanager"