Files
znetsixe 4117ec6063 feat(cloud): single-shot deploy.sh + FROST stack + healthchecks
Stage 5 — make the cloud composition spin up in one command and add
the SensorThings (FROST) stack as a fully segregated tenant.

cloud/deploy.sh — idempotent, 7-step bring-up:
  preflight → validate → up + wait → cert state → issue/renew →
  service status → endpoint smoke test. Reissues LE cert only when
  current issuer no longer matches ACME_CA_URI. Move-aside-then-
  restore-on-failure so the bootstrap cert survives a failed certbot.

stacks/frost — new stack, segregated from shared sql/rabbitmq:
  - dedicated postgis container (frost-db)
  - dedicated internal mosquitto bus (frost-mosquitto)
  - frost-http + frost-mqtt on a private frost-internal network,
    joined to cloud-app only for nginx ingress at frost.wbd-rd.nl
  - shared mosquitto stack deleted; rabbitmq remains the only public
    MQTT broker (mqtt.wbd-rd.nl:8883 via stream proxy)

stacks/sql — pg_isready healthcheck so keycloak/gitea/mlflow can gate
on service_healthy via cloud-level depends_on overrides.

stacks/nginx-proxy:
  - nginx-init service generates a self-signed bootstrap cert on
    fresh deploy so nginx starts before certbot has issued a real one
  - frost.wbd-rd.nl vhost (/FROST-Server → frost-http:8080,
    /mqtt → frost-mqtt:9876 WebSocket)

stacks/mlflow — custom Dockerfile (upstream + psycopg2-binary) so the
official image can speak to the shared sql backend.

stacks/jupyterhub — DummyAuthenticator stub gated by
JUPYTERHUB_ADMIN_PASSWORD; TODO comments point at OIDC + DockerSpawner.

stacks/rabbitmq — config/{enabled_plugins,rabbitmq.conf} stubs
(management + mqtt plugins, MQTT auth required).

stacks/portainer — ports unpublished; nginx now the only ingress.

stacks/node-red — pin to 4.1 (the floating "4" tag does not exist).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-21 16:37:58 +02:00
..

sql

Central configuration database — the "single point of config" backing Keycloak, Gitea, MLflow, and any stack that needs a relational store. Cloud-only.

  • Engine: postgres 16-alpine
  • Network: data only (no internet egress)
  • Volume: sql-data (PGDATA)
  • Init scripts: config/init.d/*.sh runs once on first container start

Per-app databases

On first start, config/init.d/01-databases.sh provisions:

Database Owner role Used by
gitea gitea gitea stack
keycloak keycloak keycloak stack
mlflow mlflow mlflow stack

Passwords come from env vars (GITEA_DB_PASSWORD, KEYCLOAK_DB_PASSWORD, MLFLOW_DB_PASSWORD) which must be set in the cloud .env before first start.

Important: init scripts only run when sql-data is empty. Changing the script after first start has no effect until the volume is wiped. To add a new app DB later, connect with psql and create it manually, then update this script for fresh deploys.

Reset / re-init

docker compose down
docker volume rm cloud_sql-data       # ⚠ destroys all data
docker compose up -d

TODO

  • Backup strategy (pg_dump cron sidecar vs streaming replica vs WAL archiving to MinIO)
  • Per-app least-privilege grants (currently each role owns its DB only — fine for now)
  • Monitoring (postgres_exporter for Prometheus when observability stack lands)