Teleport Cloud outage us-west-2

Incident Report for Teleport Cloud

Postmortem

On January 6, 2026 at 18:24 UTC, approximately 24% of Teleport Cloud customers experienced an interruption to Teleport Auth and/or Proxy services for up to 24 minutes.

The incident was caused by two concurrent infrastructure events that compounded each other. A resource exhaustion issue on multiple nodes temporarily disrupted scheduling. At the same time, an unrelated scaling event interrupted a backend service responsible for refreshing cloud credentials used by Teleport Auth, which led to authentication failures.

Once the affected infrastructure was removed, service recovery began immediately. Backend services were restored, credentials were refreshed, and impacted Teleport services gradually returned to a healthy state.

We are making changes to improve isolation between critical services, enforce stricter resource limits, and ensure that a single infrastructure disruption cannot prevent credential refresh or delay service recovery in the future.

Posted Jan 12, 2026 - 16:57 UTC

Resolved

The platform has recovered and is fully operational.
Posted Jan 06, 2026 - 19:01 UTC

Monitoring

We've identified the cause and have remediated the issue. We're monitoring the system for full recovery.
Posted Jan 06, 2026 - 18:42 UTC

Investigating

We're currently investigating an increase in errors in us-west-2.
Posted Jan 06, 2026 - 18:36 UTC
This incident affected: Cloud Service.