Outage in AWS

Increased Error Rates for Cluster Upgrades - N. Virginia

Resolved Minor
September 29, 2023 - Started 7 months ago - Lasted about 2 hours

Need to monitor AWS outages?
Stay on top of outages with IsDown. Monitor the official status pages of all your vendors, SaaS, and tools, including AWS, and never miss an outage again.
Start Free Trial

Outage Details

We can confirm that customers are experiencing elevated error rates for cluster upgrade operations in the US-EAST-1 Region. This is a result of actions we took to prevent additional customer impact following the EC2 network propagation event yesterday in a single Availability Zone (use1-az2). For EKS customers, we have purposely disabled cluster upgrade operations to prevent any impact to existing clusters and running applications. We have identified the root cause for the underlying issue and are working towards mitigating the current impact. We will enable cluster upgrade operations once the issue is resolved. We will provide an update within the next 60 minutes, or sooner if we have additional information to share.
Components affected
Amazon EKS (us-east-1)
Latest Updates ( sorted recent to last )
UPDATE 7 months ago - at 09/29/2023 08:08PM

We are making progress towards addressing the elevated error rates for cluster upgrade operations in the US-EAST-1 Region. In an abundance of caution, we left EKS traffic shifted out of the previously affected Availability Zone (use1-az2) while additional mitigations for the previous event were put in place. This meant traffic was shifted away from the affected zone, which would prevent workloads from running in the affected Availability Zone. During this period, customers would be unable to perform cluster upgrades and configuration updates in the Region. We are nearly complete with shifting traffic back in for EKS dependencies, and we will then enable cluster upgrade operations for EKS. We expect to see recovery within the next 30 minutes.

UPDATE 7 months ago - at 09/29/2023 06:56PM

We can confirm that customers are experiencing elevated error rates for cluster upgrade operations in the US-EAST-1 Region. This is a result of actions we took to prevent additional customer impact following the EC2 network propagation event yesterday in a single Availability Zone (use1-az2). For EKS customers, we have purposely disabled cluster upgrade operations to prevent any impact to existing clusters and running applications. We have identified the root cause for the underlying issue and are working towards mitigating the current impact. We will enable cluster upgrade operations once the issue is resolved. We will provide an update within the next 60 minutes, or sooner if we have additional information to share.

Stay informed of vendor status updates

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 3153 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook

Setup in 5 minutes or less

How much time you'll save your team, by having the outages information close to them?

14-day free trial · No credit card required · Cancel anytime