Outage in Elastic Cloud

Issues with azure southeastasia-2 AZ

Resolved Minor
February 08, 2023 - Started over 1 year ago - Lasted 1 day
Official incident page

Need to monitor Elastic Cloud outages?
Stay on top of outages with IsDown. Monitor the official status pages of all your vendors, SaaS, and tools, including Elastic Cloud, and never miss an outage again.
Start Free Trial

Outage Details

Azure engineers are reporting about cooling event in southeastasia-2 AZ datacenter. We are observing degrade performance for clusters that having instances allocated in this AZ. Azure engineers are continue actively working to mitigate the temperature issues in the datacenter. Currently there is no ETA to share for restoration of the impacted scale units.
Latest Updates ( sorted recent to last )
RESOLVED over 1 year ago - at 02/09/2023 06:28AM

This incident has been resolved.

MONITORING over 1 year ago - at 02/09/2023 01:10AM

The migration of an impacted deployment hosts has been completed. The full restore for the replica shards on clusters where unhealthy instances were replaced may take additional time to complete. We are moving status of the incident to monitoring state and will provide updates as necessary.

IDENTIFIED over 1 year ago - at 02/08/2023 11:59PM

The new capacity has been provisioned. Our engineers have started migration of an impacted deployment hosts. Next update will be provided in an hour.

IDENTIFIED over 1 year ago - at 02/08/2023 09:43PM

Azure has provided status that all storage resources have been restored and monitors are healthy. With significant progress on compute resources with nearly all nodes restored back up. We will provide a status update in an hour or as we get more information. Please see https://azure.status.microsoft/en-us/status for more detailed information.

IDENTIFIED over 1 year ago - at 02/08/2023 08:27PM

Azure has provided status that all storage resources have been restored. Around 70% of compute resources have been restored and team is making good progress on remaining resources. Please see https://azure.status.microsoft/en-us/status for more detailed information. We will provide another update in an about an hour or when we receive more information.

IDENTIFIED over 1 year ago - at 02/08/2023 06:55PM

Azure has recovered most of the impacted storage resources and are performing post recovery checks. Compute resources recovery is making good progress. They are actively working on restoring the remaining resources and services. We will provide another update in an hour. Please see https://azure.status.microsoft/en-us/status for more details.

IDENTIFIED over 1 year ago - at 02/08/2023 04:57PM

Azure engineers are continuing thought the structured power-up process of stroage and compute resources. More details in https://azure.status.microsoft/en-us/status. Next update will be provided in 2 hours or as soon as we have more to share.

IDENTIFIED over 1 year ago - at 02/08/2023 02:41PM

Azure engineers successfully restored cooling systems in the impacted areas of the datacenter. They are commencing a structured power-up sequence for previously powered-down compute and storage resources. More details in https://azure.status.microsoft/en-us/status . Next update will be provided in 2 hours or as soon as we have more to share.

IDENTIFIED over 1 year ago - at 02/08/2023 01:13PM

Azure engineers are actively working in restoring cooling units to mitigate issues in the datacenter. Once operational threshold temperatures have stabilized, they will begin work on the restoration of Compute and Storage. More details in https://azure.status.microsoft/en-us/status . Next update will be provided in 2 hours or as soon as we have more to share.

IDENTIFIED over 1 year ago - at 02/08/2023 11:12AM

Azure engineers are actively working to mitigate issues in the datacenter to provide capacity for deployment hosts migration. Next update will be provided in 2 hours or as soon as we have more to share.

IDENTIFIED over 1 year ago - at 02/08/2023 09:10AM

Azure engineers are continue actively working to mitigate issues in the datacenter to provide capacity for deployment hosts migration. Next update will be provided in 2 hours or as soon as we have more to share.

IDENTIFIED over 1 year ago - at 02/08/2023 08:00AM

Azure engineers are continue actively working to mitigate issues in the datacenter to provide capacity for deployment hosts migration. Next update will be provided in an hour.

IDENTIFIED over 1 year ago - at 02/08/2023 06:55AM

Azure engineers are continue actively working to mitigate issues in the datacenter to provide capacity for deployment hosts migration. Next update will be provided in an hour.

IDENTIFIED over 1 year ago - at 02/08/2023 05:45AM

Azure engineers are continue actively working to mitigate issues in the datacenter to provide capacity for deployment hosts migration. Next update will be provided in an hour.

IDENTIFIED over 1 year ago - at 02/08/2023 04:41AM

Azure engineers are continue actively working to mitigate issues in the datacenter. We have moved non-HA Elasticsearch, Kibana, APM, Enterprise search clusters out of southeastasia-2 availability zone, restoring from latest snapshot, to recover availability of those clusters. Next update will be provided in an hour.

IDENTIFIED over 1 year ago - at 02/08/2023 03:09AM

Azure engineers are reporting about cooling event in southeastasia-2 AZ datacenter. We are observing degrade performance for clusters that having instances allocated in this AZ. Azure engineers are continue actively working to mitigate the temperature issues in the datacenter. Currently there is no ETA to share for restoration of the impacted scale units.

Never miss when a third-party service is down

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 3170 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook

Setup in 5 minutes or less

How much time you'll save your team, by having the outages information close to them?

14-day free trial · No credit card required · Cancel anytime