May 2026: Widespread intermittent service issues for new workloads in US Production
DataRobot experienced intermittent service issues in US Production for 20.1 hours that prevented users from launching new workloads for Notebooks, Custom Models, and Custom Applications, while existing workloads remained unaffected. The disruption was caused by an ongoing AWS Availability Zone outage that led to resource allocation failures. Engineering resolved the underlying workload scheduling issue and confirmed all services were restored.