Outage in dbt Cloud

Autocancelation of out of memory job runs causing future job run failures

Resolved Minor
December 24, 2024 - Started 10 months ago - Lasted about 7 hours
Official incident page

Need to monitor dbt Cloud outages?
Stay on top of outages with IsDown. Monitor the official status pages of all your vendors, SaaS, and tools, including dbt Cloud, and never miss an outage again.
Start Free Trial

Outage Details

We're investigating an issue with out of memory run pods that is causing intermittent run failures. This is impacting run pods in dbt Cloud multi-tenant and multi-cell instances beginning on December 17. The team is working on a resolution and we will provide updates as soon as new information becomes available.
Latest Updates ( sorted recent to last )
RESOLVED 10 months ago - at 12/25/2024 12:40AM

The issue has been resolved, and all affected systems are now functioning normally as of 21:30 UTC. Please contact Support via email support@getdbt.com if you continue to experience delays and are unsure of the root cause.

MONITORING 10 months ago - at 12/24/2024 11:32PM

We have deployed a fix for canceled run pods that was causing intermittent job run failures. The job scheduler has returned to its normal state and we are continuing to monitor.

IDENTIFIED 10 months ago - at 12/24/2024 06:59PM

We have identified an issue with canceled run pods that resulted in erroring job runs. A fix is being implemented, and we will provide an update shortly.

INVESTIGATING 10 months ago - at 12/24/2024 06:31PM

We are continuing to investigate this issue. Please note that this issue is only impacting AWS environments, and is not impacting Azure instances.

INVESTIGATING 10 months ago - at 12/24/2024 06:08PM

We're investigating an issue with out of memory run pods that is causing intermittent run failures. This is impacting run pods in dbt Cloud multi-tenant and multi-cell instances beginning on December 17. The team is working on a resolution and we will provide updates as soon as new information becomes available.

Status Page Aggregator Built for IT Managers

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 4522 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook