Technolutions experienced intermittent availability of certain databases in the US region for 4.6 hours, primarily affecting databases on the LIMA cluster and the Slate component. The issue was caused by a routine third-party database engine update that introduced a behavioral change leading to worker thread exhaustion, which triggered an automated failover to secondary infrastructure where some databases were slow to recover due to similar thread exhaustion. The incident was resolved by disabling the problematic functionality as advised by the vendor and increasing the worker thread ceiling to prevent recurrence.
Trusted by 1,000+ teams
Stop finding out about outages from your users. Monitor 6,320+ cloud services and get alerted the second something breaks.
There have been no continued impacts.
The incident, which primarily affected databases on the LIMA cluster, was caused by an automated failover to secondary infrastructure. A routine servicing update to the third-party database engine introduced a behavioral change that, under specific conditions, resulted in an exhaustion of worker threads. Upon failing over, some databases were slower to recover as a result of a related exhaustion of worker threads on this secondary node. We have disabled the functionality that resulted in this behavioral change, as advised by the vendor, and have increased the ceiling for worker threads to reduce the potential for reoccurrence.
Everything remains stable at this time, and we will continue to monitor for any further issues.
We are continuing to monitor for any further issues.
All affected databases recovered approximately a half-hour ago. We're continuing to address some internal connectivity issues, but we're not seeing any observable impacts at this time. There may be brief connection interruptions as failbacks complete later.
We are investigating the intermittent availability of certain databases in the US region. Some databases are still recovering following a failover to secondary infrastructure and may be unavailable during the completion of the failover. We will provide updates shortly.
With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.
Start free trialNo credit card required · Cancel anytime · 6320 services available
Integrations with