Number of Incidents
1 outages
Since last incident
23 days
Stay on top of outages with IsDown with real-time notifications and overview dashboards. Monitor the official status pages of all your vendors, SaaS, and tools, including Azure.
Minor · 23 days ago · lasted 1 day
Multiple services recovering after power/cooling issue - Australia East
Impact Statement: Starting at approximately 08:30 UTC on 30 August 2023, a utility power surge in the Australia East region tripped a subset of the cooling units offline in one datacenter, within one of the Availability Zones. While working to restore cooling, temperatures in the datacenter increased so we proactively powered down a small subset of selected compute and storage scale units, to avoid damage to hardware. Multiple downstream services were impacted, with targeted communications being distributed via Azure Service Health.Current Status: Storage infrastructure has recovered. A subset of services still experiencing residual impact are on the path to mitigation.Mitigation: We worked on recovering the failed cooling units and reducing the overall temperature within the impacted area. Once temperature levels were within operational thresholds, we began to restore power to the affected infrastructure and started a phased process to bring this infrastructure back online. Once st...
Minor · 3 months ago · lasted about 10 hours
Azure Monitor - Logs Data Access Issues - Mitigated
Summary of Impact: Between at 23:15 UTC on 06 Jul 2023 and 09:00 UTC on 07 July 2023, customers using Azure Monitor may have experienced issues accessing logs, data gaps, and missed / latent alerts. Preliminary Root Cause: We determined a configuration issue triggered a bug, causing a significant unexpected increase in traffic, causing one of our backend services responsible for processing and storing logs to reach throughput limits. This caused the service's control plane to become unresponsive, leading to services becoming unable to retrieve authentication tokens. The resulting impact spread to multiple regions.Mitigation: We throttled heavy consumers and spun up additional infrastructure to process the incoming load, as well as the backlog of requests.Next Steps: We will follow up in 3 days with a preliminary Post Incident Report (PIR), which'll cover the main root cause and repair items. We'll follow that up 14 days later with a final PIR where we will share a deep dive into the...
Minor · 8 months ago · lasted about 1 hour
Datacenter Cooling Event - Southeast Asia
Issue Summary: Starting around 20:19 UTC on 7 February 2023, a utility power surge in the Southeast Asia region tripped all of the chiller units for one datacenter offline. While working to restore the chiller units, temperatures in the datacenter increased so we had proactively powered down compute, storage and networking resources to avoid damage to hardware. All impacted infrastructure is in the same single datacenter, within one of the region’s three Availability Zones (AZs). We are continuing to work through our structured power-up process, initially targeting storage resources, followed by compute resources. We are actively monitoring as we work through this extended process, and storage restoration is progressing well. Downstream services that have been identified as impacted include Azure App Services, Azure Backup, Azure Cosmos DB, Azure Database for MySQL & flexible server, Azure Database for PostgreSQL & flexible server, Azure Log Analytics, Azure Red Hat OpenShif...
Minor · 8 months ago · lasted 1 day
Datacenter Cooling Event - Southeast Asia - Extended Mitigation
Impact Statement:Starting around 20:19 UTC on 7 February 2023, a utility power surge in the Southeast Asia region tripped a subset of cooling units offline in one of the Availability Zones. While working to restore the cooling units, temperatures in the datacenter increased and we have proactively powered down a small subset of compute and storage units to avoid damage to hardware and reduce cooling system load.All impacted storage and compute scale units are in the same datacenter, within one of the region’s three Availability Zones (AZs). Multiple downstream services have been identified as impacted. Current Status – 03:30 UTCWe are continuing to focus our efforts on mitigation for services which were impacted due to this incident. The core storage and compute services restoration have been completed successfully.We have several other key services fully recovered and for some of which we are still working on post recovery checks. We are closely monitoring the datacenter metrics fo...
Minor · 8 months ago · lasted about 3 hours
Azure Networking - Multiple regions - Validating Recovery
Between 07:05 UTC and 09:45 UTC on 25 January 2023, customers may have experienced issues with networking connectivity, manifesting as network latency and/or timeouts when attempting to connect to Azure resources in Public Azure regions, as well as other Microsoft services including M365, PowerBI.We've determined the network connectivity issue was occurring with devices across the Microsoft Wide Area Network (WAN). This impacted connectivity between clients on the internet to Azure, as well as connectivity between services in datacenters, as well as ExpressRoute connections. Current Status:We have identified a recent change to WAN as the underlying cause, and have taken steps to roll back this change. Our telemetry shows consistent signs of recovery from 09:45 UTC onwards across multiple regions and services. Most customers should now see full recovery as WAN networking has recovered fully. We are working to monitor and ensure full recovery for services that were impacted.The next u...
“If you are in SRE, IT, or Security and work in an environment with a lot of SaaS (which, let's face it, is all of them) - IsDown is your new best friend. Helpfully aggregates various Statuspages from services into a very clear dashboard. Worth every penny.”
“Your support is one of the best I have ever worked with. Thanks for having a great product AND great support.”
Get notified only when an outage impacts a certain component.
IsDown monitors Azure, and also its competitors (also 2500 other cloud services). Check the current status of the most popular alternatives to Azure.
The data and notifications you need, in the tools you already use.
SaaS rules the world, and all teams depend on them to do their most productive work. IsDown helps you monitor all your cloud services, so you can focus on what matters.
Try it out! How much time you'll save your team, by having the outages information close to them?