Outage in AWS Databricks

ES-1613032

Resolved Minor
October 14, 2025 - Started 3 days ago - Lasted about 10 hours

Incident Report

Starting at approximately 21:00 UTC October 13, 2025, customers may experience delays in starting new job runs or terminating existing runs. Running jobs may not be completed on time.

We are currently investigating. Further updates will be provided in 1 hour, or as events warrant.
Components affected
AWS Databricks Compute Service
The Status Page Aggregator Built for IT Teams

One place to monitor all your cloud vendors. Get instant alerts when an outage is detected.

Try IsDown risk-free 14-day free trial · No credit card required
Latest Updates ( sorted recent to last )
3 days ago - at 10/14/2025 06:26AM

Starting at approximately 21:00 UTC October 13, 2025, customers may experience delays in starting new job runs or terminating existing runs. Running jobs may not be completed on time.

We are currently investigating. Further updates will be provided in 1 hour, or as events warrant.

3 days ago - at 10/14/2025 08:28AM

Starting at approximately 21:00 UTC on October 13, 2025, customers may experience failures or delays in starting new job runs or terminating existing runs due to library installation failures on compute resources using Graviton-based instance types. As a result, some running jobs may not complete on time.

Workaround:
Where possible, customers can avoid using Graviton instance types until the issue is fully resolved.

Our engineering team is actively investigating the root cause and working to restore service. Further updates will be provided within 1 hour or as events warrant.

3 days ago - at 10/14/2025 10:05AM

Starting at approximately 21:00 UTC on October 13, 2025, customers may experience failures or delays in starting new job runs or terminating existing runs due to library installation failures on compute resources using Graviton-based instance types. As a result, some running jobs may not complete on time.

Workaround:
Where possible, customers can avoid using Graviton instance types until the issue is fully resolved.

Our engineering team is actively investigating the root cause and working to restore service. Further updates will be provided within 1 hour or as events warrant.

3 days ago - at 10/14/2025 11:09AM

Starting at approximately 21:00 UTC on October 13, 2025, customers may experience failures or delays in starting new job runs or terminating existing runs due to library installation failures on compute resources using Graviton-based instance types. As a result, some running jobs may not complete on time.

Workaround:
Where possible, customers can avoid using Graviton instance types until the issue is fully resolved.

Our engineering team is actively investigating the root cause and working to restore service. Further updates will be provided within 1 hour or as events warrant.

3 days ago - at 10/14/2025 12:42PM

Starting at approximately 21:00 UTC on October 13, 2025, customers may experience failures or delays in starting new job runs or terminating existing runs due to library installation failures on compute resources using Graviton-based instance types. As a result, some running jobs may not complete on time.

Our engineering team is actively investigating and working to restore normal operation. If you are affected, please contact Databricks Support at help@databricks.com for potential workarounds.

Further updates will be provided within 2 hour or as events warrant.

3 days ago - at 10/14/2025 02:58PM

Starting at approximately 21:00 UTC on October 13, 2025, customers may experience failures or delays in starting new job runs or terminating existing runs due to library installation failures on compute resources using Graviton-based instance types. As a result, some running jobs may not complete on time.

We have identified that the issue is due to a third-party library dependency (pydantic). Our engineering team is actively working with the third-party to restore normal operation.
If you are affected, please contact Databricks Support at help@databricks.com for potential workarounds.

Further updates will be provided within 2 hours or as events warrant.

Latest AWS Databricks outages

ES-1596201 - 21 days ago
ES-1596190 - 21 days ago
ES-1586721 - about 1 month ago
ES-1570676 - about 1 month ago
ES-1545723 - 2 months ago

The Status Page Aggregator Built for IT Teams

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 4522 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook