Outage in GCP Databricks

ES-1022960

Resolved Major
January 30, 2024 - Started almost 2 years ago - Lasted about 2 hours

Incident Report

We are investigating an issue with one of the Databricks services.

Incident Details:
- Workspace authentication requests may fail or timeout.
- Cluster start/resize/termination requests may fail or time out.
- Jobs relying on cluster start/resize/termination may not execute.
- Jobs submitted through APIs/Schedulers may not execute.
- UI and Databricks SQL queries may time out.
- Users may experience failures launching Databricks Serverless SQL Warehouses.
- Users may not be able to access UC APIs.

Incident Start Time: 19:02 UTC January 30 2024

We will provide an update in the next hour, or as soon as the issue has been identified.

Need to monitor GCP Databricks outages?

  • Monitor all your external dependencies in one place
  • Get instant alerts when outages are detected
  • Be the first to know if service is down
  • Show real-time status on private or public status page
  • Keep your team informed
Latest Updates ( sorted recent to last )
almost 2 years ago - at 01/30/2024 07:32PM

We are investigating an issue with one of the Databricks services.

Incident Details:
- Workspace authentication requests may fail or timeout.
- Cluster start/resize/termination requests may fail or time out.
- Jobs relying on cluster start/resize/termination may not execute.
- Jobs submitted through APIs/Schedulers may not execute.
- UI and Databricks SQL queries may time out.
- Users may experience failures launching Databricks Serverless SQL Warehouses.
- Users may not be able to access UC APIs.

Incident Start Time: 19:02 UTC January 30 2024

We will provide an update in the next hour, or as soon as the issue has been identified.

almost 2 years ago - at 01/30/2024 07:59PM

We have identified the problem with the Databricks service. Our team is continuing to work on a mitigation.

Incident Details:
- Workspace authentication requests may fail or timeout.
- Cluster start/resize/termination requests may fail or time out.
- Jobs relying on cluster start/resize/termination may not execute.
- Jobs submitted through APIs/Schedulers may not execute.
- UI and Databricks SQL queries may time out.
- Users may experience failures launching Databricks Serverless SQL Warehouses.
- Users may not be able to access UC APIs.

Incident Start Time: 19:02 UTC January 30 2024

We will provide an update in the next hour, or as soon as the issue has been mitigated.

almost 2 years ago - at 01/30/2024 09:08PM

We have identified the problem with the Databricks service. Our team is continuing to work on a mitigation.

Incident Details:
- Workspace authentication requests may fail or timeout.
- Cluster start/resize/termination requests may fail or time out.
- Jobs relying on cluster start/resize/termination may not execute.
- Jobs submitted through APIs/Schedulers may not execute.
- UI and Databricks SQL queries may time out.
- Users may experience failures launching Databricks Serverless SQL Warehouses.
- Users may not be able to access UC APIs.

Incident Start Time: 19:02 UTC January 30 2024

We will provide an update in the next hour, or as soon as the issue has been mitigated.

Latest GCP Databricks outages

ES-1657639 - about 2 months ago
ES-1656060 - about 2 months ago
ES-1633306 - 3 months ago
ES-1570676 - 4 months ago
ES-1545723 - 5 months ago

The Status Page Aggregator with Early Outage Detection

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 5420 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook