Need to monitor Buildkite outages?
Stay on top of outages with IsDown. Monitor the official status pages of all your vendors, SaaS, and tools, including Buildkite, and never miss an outage again.
Start Free Trial
The system has stabilised and we continue to work on more long term mitigations to latency issues we've been experiencing over the past few days.
Our systems are operating at normal levels, we continue to monitor performance.
We saw increased load create increased notification latency. Our mitigations continue to allow Job Dispatch and Web UI latency down to within acceptable levels.
We are continuing to work on ways to improve notification latency.
Notifications are delayed and we're investigating this.
Job dispatch and the web interface are back to normal. While we continue to see delays to outbound notifications of up to 6 minutes, latency is improving. We understand this has some impact on customers and we continue to work on longer term mitigations to this ongoing issue.
We have seen job dispatch stabilise down to within our SLA, but continue to have higher latency than SLA for notifications.
We continue to investigate how to improve notification latency
We have deployed a change to prioritize job dispatch over notifications (i.e. commit statuses). The impact of this is that customers will see commit statuses delayed by up to 30 minutes. This is a once-off impact and notifications latency is expected to return to normal after the initial backlog has been processed.
We continue to take steps to stabilize Job Dispatch and we hope to have those changes implemented in the next 90 minutes
Notifications (including commit statuses) will continue to be delayed and may get worse as we limit their load on the system in order to prioritize job dispatch. We continue to work on stabilizing system load and will provide an ETA when available.
We continue to investigate database load and work on ways to reduce the impact to Job Dispatch primarily.
We have multiple streams of work going on to improve Job Dispatch as our first priority
We are investigating latency spikes across many of our asynchronous processing queues which is causing slowness in notifications and job assignments. We have identified an issue with database load and we continue to investigate while taking steps to mitigate database load and keep the system stable.
We are investigating reports of sluggish UI and latency in assigning jobs to agents
With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.
Start free trialNo credit card required · Cancel anytime · 3153 services available
Integrations with
How much time you'll save your team, by having the outages information close to them?
14-day free trial · No credit card required · Cancel anytime