Outage in Astro

Degraded Worker Autoscaling on Hosted Shared Azure Cluster (US-East2)

Resolved Minor

March 04, 2026 - Started about 1 month ago - Lasted 1 day
Official incident page

Incident Report

We are currently investigating an issue affecting Airflow worker autoscaling for some deployments hosted on the Shared Azure Cluster in the US-East2 region. As a result of this issue, worker pods may not scale up as expected in response to workload demand. This can lead to tasks remaining in a queued state longer than usual and, in some cases, failing due to queued timeouts. Our engineering team is actively working to identify the root cause and restore normal autoscaling behaviour. We will provide further updates as more information becomes available. Impact: • Affected deployments may experience delayed task execution. • Worker pods may not scale up from zero or may not scale as expected under load. We will continue to share updates on this page as we make progress toward resolution.

Trusted by 1,000+ teams

The Status Page Aggregator with Early Outage Detection

Stop finding out about outages from your users. Monitor 6,320+ cloud services and get alerted the second something breaks.

Start Free Trial

No credit card
14-day trial
2-minute setup

Latest Updates ( sorted recent to last )

RESOLVED about 1 month ago - at 03/05/2026 11:05PM

This incident has been resolved.

MONITORING about 1 month ago - at 03/05/2026 04:49PM

Azure confirmed that the fix has been successfully deployed to East US 2 region. We are monitoring the results.

IDENTIFIED about 1 month ago - at 03/04/2026 02:31PM

We have identified the root cause of the autoscaling issue affecting certain deployments on the Shared Azure Cluster in the US-East2 region. The issue is related to underlying infrastructure behaviour within the cloud provider environment.

Azure has acknowledged the issue and is currently rolling out a hotfix across regions. We are actively monitoring the rollout and will provide further updates as they become available.

INVESTIGATING about 1 month ago - at 03/04/2026 02:10PM

We are currently investigating an issue affecting Airflow worker autoscaling for some deployments hosted on the Shared Azure Cluster in the US-East2 region.

As a result of this issue, worker pods may not scale up as expected in response to workload demand. This can lead to tasks remaining in a queued state longer than usual and, in some cases, failing due to queued timeouts.

Our engineering team is actively working to identify the root cause and restore normal autoscaling behaviour. We will provide further updates as more information becomes available.

Impact:
• Affected deployments may experience delayed task execution.
• Worker pods may not scale up from zero or may not scale as expected under load.

We will continue to share updates on this page as we make progress toward resolution.

Latest Astro outages

Runtime 3.2-1 Yanked - Incompatible with Env Manager - about 3 hours ago

Degraded service for some deployments in Azure West Europe - 3 days ago

Astro Alerts Degraded Performance - 4 days ago

Deployment changes fail due to an unrelated "hibernation" error - 15 days ago

Deployment CRUD Operations Experiencing Failures - about 1 month ago

The Status Page Aggregator with Early Outage Detection

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 6320 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook