Multi-service health matrix

Realtime visibility across all production services

Filter by severity and region to understand how localized incidents impact your tenants. Subscribe to get alerted when service levels change.

Updated
Live data refreshed 2 minutes ago
Filters syncing with webhook subscriptions and custom alerts.
Service
US-EastVirginia & Ohio
EU-WestIreland & Frankfurt
APACSingapore

API Gateway

Ingress routing layer for REST and GraphQL requests.

mission-critical

Owner: SRE Platform

Operational

Serving 52k req/s

Updated 5m ago
Rate limited

Burst limits applied while we rebalance traffic.

Updated 2m ago
Operational

Latency ±4% of baseline

Updated 5m ago

Billing Service

Handles invoicing, subscriptions, and metered usage.

financial

Owner: FinOps

Delayed

Usage aggregation running 18 minutes behind schedule.

Updated 7m ago
Delayed

Invoice rendering paused while we patch the worker pool.

Updated 7m ago
Operational
Updated 7m ago

Edge CDN

Static asset and media delivery network.

Owner: Edge Reliability

Operational
Updated 3m ago
Operational
Updated 3m ago
Operational
Updated 3m ago

Workflow Automations

Background job engine powering playbooks and data syncs.

queueingasync

Owner: Automation Core

Outage

Jobs paused while we remediate queue corruption.

Updated 1m ago
Read-only

New automations paused; existing jobs draining.

Updated 1m ago
Lagging

Processing backlog at 72% throughput.

Updated 1m ago
Filter regions
Incident feed
Latest updates from the global operations team.
Mitigation09:12 UTC

Automation queues paused in US-East

Engineers replaying failed jobs after isolating corrupted payloads. Expect progress update in 15 minutes.

Investigation08:46 UTC

Billing usage aggregation running behind

Worker pool scaled up 3x while we evaluate anomalous invoices.

Mitigation08:30 UTC

API gateway traffic shifted from EU-West to US-East

Temporary routing adjustments while we replace faulty load balancer in EU-West.

Upcoming maintenance
Mar 08 — 02:00 UTCPlanned cache shard expansion (EU-West)Mar 10 — 05:00 UTCWorkflow automation failover test (APAC)
Request proactive outreach
Enterprise account team will follow up within the hour.
Call my account teamEscalate urgent issues with direct SRE support.Request executive briefingSchedule a stakeholder-ready summary deck.

Subscribe to matrix changes

Receive email summaries whenever severity shifts for selected services.

Subscribe to updates