Outage in Buildkite

Increased latency and error rates

Resolved Major
October 20, 2025 - Started 11 days ago - Lasted about 8 hours
Official incident page

Incident Report

We're observing increased latency and error rates due to an inability to scale up. We're currently investigating and will provide status updates as they become available.

Need to monitor Buildkite outages?

One place to monitor all your cloud vendors. Get instant alerts when an outage is detected.

Latest Updates ( sorted recent to last )
RESOLVED 11 days ago - at 10/20/2025 10:02PM

Our services have been fully recovered for the last hour, so we are marking this as resolved.

Our engineers will continue to monitor AWS and will keep services scaled up to prevent impact from any additional failures.

MONITORING 11 days ago - at 10/20/2025 09:04PM

Latency and error rates have all returned to baseline levels. We have seen full recovery of our services.

We continue to actively monitor our services and the AWS reports on us-east-1 impact to ensure stability is maintained.

MONITORING 11 days ago - at 10/20/2025 08:33PM

We're seeing signs of recovery across the board. Error rates have reduced to baseline levels. Latency is trending towards baseline.

We continue to actively monitor our services and the AWS reports on us-east-1 impact.

MONITORING 11 days ago - at 10/20/2025 07:29PM

We're seeing slow recovery of all our services. Latency and error rates are decreasing across the board. We are continuing to monitor the situation.

IDENTIFIED 11 days ago - at 10/20/2025 06:54PM

Our mitigations improved latency for the Agent API, although latency and error rates are still visible across other services. The us-east-1 issue is reporting some recovery and we are seeing further improvements in our services. We are actively monitoring the situation and implementing mitigations where possible.

IDENTIFIED 11 days ago - at 10/20/2025 05:36PM

We have implemented mitigations and see an improvement in latency for the Agent API. Latency and error rates continue to be elevated across Rest, GraphQL and Web service as well as notifications being delayed.

We are continuing to work through mitigations and will provide an update in 1 hour.

IDENTIFIED 11 days ago - at 10/20/2025 05:04PM

We're continuing to see increased latency across much of our sub-systems due to an on going AWS outage. We are unable to launch new tasks in us-east-1 and are investigating potential mitigations to restore service.

INVESTIGATING 11 days ago - at 10/20/2025 03:17PM

We're currently working on mitigations for scaling up, but at this stage service is degraded with increased latency across API, notifications, and builds starting.

INVESTIGATING 11 days ago - at 10/20/2025 02:18PM

We're observing increased latency and error rates due to an inability to scale up. We're currently investigating and will provide status updates as they become available.

Latest Buildkite outages

Ongoing AWS incident - 11 days ago
Increased latency - 16 days ago

The Status Page Aggregator Built for IT Teams

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 4522 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook