Use cases
Software Products E-commerce MSPs Schools Development & Marketing DevOps Agencies Help Desk
Company
Internet Status Blog Pricing Log in Get started free

DataRobot Outage History

Every past DataRobot outage tracked by IsDown, with detection times, duration, and resolution details.

There were 146 DataRobot outages since February 2020. The 30 outages from the last 12 months are summarized below, with incident details, duration, and resolution information.

Minor April 10, 2026

April 2026: Delay in processing actual messages

Detected Apr 10, 2026 6:24 AM EDT · Resolved Apr 10, 2026 7:01 AM EDT · Duration 37 minutes

DataRobot experienced a 37-minute delay in processing messages on the JP MTS due to an autoscaling malfunction. Engineering resolved the issue by manually scaling up the deployment and applying infrastructure configuration changes. The service returned to normal operation with continued monitoring to ensure cluster stability.

Minor March 30, 2026

March 2026: Degraded Performance on DataRobot MTS due to Quay outage

Detected Mar 30, 2026 4:43 PM EDT · Resolved Mar 31, 2026 4:53 AM EDT · Duration about 12 hours

A Quay.io outage caused degraded performance across the DataRobot platform for 12.2 hours, affecting the website, AI Catalog, Data Ingest, AI Apps, AutoML, Predictions, and API services. The engineering team monitored the external Quay outage and worked on mitigation efforts. The incident was resolved when Quay.io functionality was restored and all DataRobot environments were fully stabilized.

Minor March 13, 2026

March 2026: Performance Degradation on Managed AI Cloud

Detected Mar 13, 2026 1:49 PM EDT · Resolved Mar 13, 2026 2:47 PM EDT · Duration about 1 hour

DataRobot's Managed AI Cloud experienced performance degradation affecting the API for 58 minutes. A fix was implemented and the incident was resolved after monitoring confirmed the solution was effective.

Minor March 11, 2026

March 2026: Intermittent UI disruptions on Managed AI Cloud

Detected Mar 11, 2026 4:05 PM EDT · Resolved Mar 17, 2026 2:36 PM EDT · Duration 6 days

DataRobot experienced intermittent UI disruptions on their Managed AI Cloud platform that affected both the website and API for approximately 6 days. The incident was classified as minor severity. A fix was implemented and the issue was resolved after monitoring confirmed stability.

Major March 11, 2026

March 2026: Network issue related to Kubernetes in US cluster

Detected Mar 11, 2026 10:36 AM EDT · Resolved Mar 11, 2026 12:44 PM EDT · Duration about 2 hours

DataRobot experienced a network issue related to Kubernetes in their US cluster that impacted model deployment and predictions functionality. The incident lasted 2.1 hours, during which engineering identified the root cause and implemented a mitigation solution. The issue was fully resolved after monitoring confirmed the environment had completely recovered.