Use cases
Software Products E-commerce MSPs Schools Development & Marketing DevOps Agencies Help Desk
Company
Internet Status Blog Pricing Log in Get started free

Blacksmith Outage History

Every past Blacksmith outage tracked by IsDown, with detection times, duration, and resolution details.

There were 111 Blacksmith outages since April 2025. The 99 outages from the last 12 months are summarized below, with incident details, duration, and resolution information.

Minor May 20, 2026

May 2026: Delays in job adoption in eu-west

Detected May 20, 2026 12:34 PM EDT · Resolved May 20, 2026 3:15 PM EDT · Duration about 3 hours

Blacksmith experienced job processing delays in the eu-west region due to a spike in jobs that caused temporary capacity shortage. The service team worked on re-balancing capacity to reduce the delays, while also monitoring a related GitHub Actions incident. The issue was resolved after 2.7 hours through capacity re-balancing efforts.

Minor May 19, 2026

May 2026: GitHub webhooks degraded causing job queueing

Detected May 19, 2026 10:52 AM EDT · Resolved May 19, 2026 7:40 PM EDT · Duration about 9 hours

GitHub webhook delivery degradation caused jobs to queue up in the Blacksmith service for 8.8 hours. The upstream GitHub issue was resolved, but the system had to work through a substantial backlog of accumulated webhook events and queued tasks. Full recovery was achieved as the backlog was processed and runner pools returned to normal operation.

Minor May 15, 2026

May 2026: Github Actions degraded

Detected May 15, 2026 4:24 AM EDT · Resolved May 15, 2026 4:32 AM EDT · Duration 8 minutes

GitHub Actions experienced a degradation that lasted 8 minutes, affecting the Actions component of the GitHub service. The Blacksmith team identified the issue and monitored its impact while GitHub addressed the underlying problem. The incident was resolved after the brief 8-minute duration.

Minor May 14, 2026

May 2026: Job Queueing due to Capacity Event

Detected May 14, 2026 5:22 PM EDT · Resolved May 14, 2026 6:30 PM EDT · Duration about 1 hour

Blacksmith experienced job queueing issues due to a high volume of incoming jobs that exceeded system capacity. The incident caused delays in job adoption and processing. The service disruption lasted 1.1 hours and was classified as a minor incident.

Major May 12, 2026

May 2026: Webhook Service Outage

Detected May 12, 2026 4:44 PM EDT · Resolved May 12, 2026 7:49 PM EDT · Duration about 3 hours

The Blacksmith webhook service experienced a major outage lasting 3.1 hours. A fix was implemented within approximately 3 minutes of the initial investigation. The service was then monitored to ensure the resolution was effective.

Minor May 12, 2026

May 2026: Job adoption delays

Detected May 12, 2026 11:12 AM EDT · Resolved May 12, 2026 3:00 PM EDT · Duration about 4 hours

A load-related issue in Blacksmith's control plane caused delays in job adoption for 3.8 hours. The engineering team implemented multiple fixes to address the performance bottleneck. The service was restored and monitored to ensure stable job processing.

Major May 11, 2026

May 2026: Blacksmith control plane outage

Detected May 11, 2026 11:57 AM EDT · Resolved May 12, 2026 3:07 AM EDT · Duration about 15 hours

Blacksmith experienced a major control plane outage lasting 6.6 hours due to issues with their upstream database provider. The incident caused delays with job adoption and created a backlog of jobs that needed to be re-queued. A fix was implemented and jobs were re-queued, but slow job adoption issues persisted while the team worked with the database provider to resolve the root cause.

Minor May 9, 2026

May 2026: Elevated queue time for mac runners

Detected May 9, 2026 5:00 PM EDT · Resolved May 9, 2026 6:40 PM EDT · Duration about 2 hours

Blacksmith experienced elevated queue times for Mac runners during a 4-minute service incident. The issue was classified as minor and was under investigation. The incident has been resolved.

Major May 8, 2026

May 2026: Elevated tail latency in git checkout operations

Detected May 8, 2026 2:25 PM EDT · Resolved May 8, 2026 4:33 PM EDT · Duration about 2 hours

Blacksmith experienced elevated latency in git checkout operations in the US West region due to an upstream network provider degradation. Some git checkout operations took significantly longer than normal baseline performance. The issue was resolved after 2.1 hours when the upstream provider fixed the network degradation and checkout times returned to normal.

Minor May 7, 2026

May 2026: Job adoption delays

Detected May 7, 2026 12:00 PM EDT · Resolved May 7, 2026 1:21 PM EDT · Duration about 1 hour

Blacksmith experienced a load-related issue in the control plane that caused delays in GitHub workflow job adoption. The incident was classified as minor and lasted 1.4 hours. The team identified the root cause and worked on implementing a mitigation to resolve the delays.