Postgres Failover
Procedure
- Confirm primary is unavailable for 60+ seconds.
- Promote healthiest replica and freeze schema migrations.
- Update connection routing and recycle API pools.
Post-checks
- Write throughput restored
- Replica lag below 3 seconds
Operator field manual: playbooks, runbooks, checklists, and postmortems.