Our Engineering team has confirmed the full resolution of this issue, which impacted our Cloud Control Panel, API, Managed Kubernetes clusters, and Block Storage Volumes.
Between 16:47 - 17:18 UTC, users experienced 500 errors and/or latency when attempting to reach the Cloud Control Panel and API.
Between 16:47 - 18:49 UTC, users were unable to successfully attach/detach Block Storage Volumes.
Between 16:47 - 20:50 UTC, users were unable to create new Managed Kubernetes clusters, as well as a subset of users were unable to destroy or upgrade existing clusters.
If you continue to experience problems, please open a ticket with our support team from within your Cloud Control Panel.
Our Engineering team has put further remediations in place and reconciliation efforts for Managed Kubernetes clusters have completed. At this time, creation for new clusters is enabled again and all cluster operations should be succeeding.
With the root cause identified and remediations in place, we are no longer observing any impact to users. We'll continue to monitor the situation and post an update once we confirm the issue is fully resolved. Thank you for your patience.
Our Engineering team continues work to remediate the root cause of this incident. Further changes have been made and Managed Kubernetes cluster creation will now fail outright should users attempt to provision a cluster. Attach and detach operations on Block Storage Volumes (except for Managed Kubernetes) are now available and are succeeding as expected.
Our Engineering team has identified additional impact related to the root cause of this incident. At this time, creation, deletion, and upgrading of Managed Kubernetes clusters may result in failure due to a failing dependency of control planes. Attach and detach operations on Block Storage Volumes also remain unavailable.
As a result of necessary remediation steps, attach and detach operations on Block Storage Volumes in all regions are currently unavailable. Our Engineering team is continuing work to fully remediate the root cause. The Cloud Control Panel and API remain accessible.
The implemented actions have been successful in remediating the impact of this incident and our Engineering team has confirmed error rates are returning to normal thresholds. At this time, the Cloud Control Panel and API are accessible again.
Our Engineering team has identified the cause of the issue with the Cloud Control Panel and API and is now taking steps to mitigate impact. We're seeing some recovery, but users may still be unable to reach the Control Panel or API. We'll post an update once we confirm the implemented actions have remediated impact.
We are investigating reports of issues reaching the DigitalOcean Cloud Control Panel and API. At this time, users may see 500 errors or latency when attempting to reach both the Control Panel and API. We'll provide an update shortly.
IsDown is an uptime monitoring solution for your critical business dependencies. Keep tabs on your SaaS and cloud providers in real-time and never miss another outage again. Get instant alerts and stay informed when an incident impacts your operations.Start free trial
No credit card required · Cancel anytime · 2362 services available
Quickly identify external outages that impact your business. We are monitoring more than 2300 services in real time.
Your team on top of problems
IsDown aggregates the information from the status pages of all your services, making it easy to monitor the health of all your services in one place. Say goodbye to managing each status page individually - our service simplifies the process.
No more wasting time. Uptime monitoring in real time
Say goodbye to wasting time trying to diagnose issues with your services - our 24/7 monitoring service does the work for you. We'll notify you if there is an incident, so you can focus on other tasks.
Receive alerts in your preferred channels
Our outage monitoring keeps you informed, no matter where you are. Get instant notifications in your email, Slack, Teams, or Discord when an outage is detected, so you can take action quickly.
Easily integrate with your current tools and workflows
Enhance your processes with more information using our integration of Zapier, Webhooks, PagerDuty, and Datadog. Stay notified and in control. Upgrade your operations today.
Avoid notifications clutter
Maximize your control with customizable notifications from each service. Filter by components and severity to only receive the most important updates. Streamline your processes and stay informed with our advanced notification features.
Multiple dashboards, shareable with the world
Create one dashboard for each of your teams/clients/projects and monitor only the services that each uses. Have a dedicated dashboard with custom notification settings. Easily make your dashboard public and share it with the world.
Prepare for scheduled maintenances
Never again be caught off guard by unexpected maintenance from your services. A feed of the next scheduled maintenances is available.
Weekly Digest of the services' outages
Every Monday, you'll receive a weekly summary of what happened the previous week as well as the maintenance schedule for the following week.
DevOps & On-Call Teams
You already monitor your internal systems. What about the external services? Monitor the services your business depends on. Don't waste time looking elsewhere when external outages are the cause of issues.
IT Support Teams
Detect external outages before your clients tell you. Anticipate possible issues and make the necessary arrangements. Having proactive communication, builds trust over clients and prevents flow of support tickets.
5 minute setup,
instant value for your team
Start with a trial account that will allow you to try and monitor up to 40 services for 14 days.
There are 2362 services to choose from and you can start monitoring, and we're adding more every week.
You can get notifications by email, Slack, and Discord. You can also use Zapier or Webhooks to build your workflows.
You'll start getting alerts when we detect outages in your external dependencies! No more wasting time looking in the wrong place!
Try it out! How much time you'll save your team, by having the outages information close to them?