Outage in Linode

Service Issue: RTX 4000 Ada GPU Errors Across Multiple Regions

Resolved Minor

March 04, 2026 - Started about 1 month ago - Lasted about 16 hours
Official incident page

Incident Report

Summary AI Generated

Linode experienced a critical service issue affecting NVIDIA RTX 4000 Ada GPU nodes across multiple regions (Osaka, Seattle, and Chicago), causing unrecoverable error states that led to failures in Vulkan initialization and GPU-accelerated workloads. The incident also impacted some LKE clusters in the Osaka region with Control Plane connectivity issues, resulting in timed-out API requests and errors. The issue was resolved after 16.1 hours, with the root cause identified as a regression in the underlying host hypervisor or GPU firmware.

We are investigating a critical service issue affecting NVIDIA RTX 4000 Ada GPU nodes across multiple regions, including Osaka (osa1), Seattle (sea1), and Chicago (ord1). Affected GPU nodes may report an unrecoverable error state leading to failures in Vulkan initialization and GPU-accelerated workloads. Additionally, some LKE clusters in the Osaka region are currently experiencing Control Plane connectivity issues, resulting in timed-out API requests and errors. Our engineering teams are currently investigating the root cause, focusing on a potential regression in the underlying host hypervisor or GPU firmware. We will provide more information as it becomes available

Trusted by 1,000+ teams

Need to monitor Linode outages?

Stop finding out about outages from your users. Monitor 6,320+ cloud services and get alerted the second something breaks.

Start Free Trial

No credit card
14-day trial
2-minute setup

Latest Updates ( sorted recent to last )

RESOLVED about 1 month ago - at 03/05/2026 06:11PM

We haven’t observed any additional issues with the service, and will now consider this incident resolved. If you continue to experience problems, please open a Support ticket for assistance.

MONITORING about 1 month ago - at 03/05/2026 05:01PM

At this time we have been able to correct the issues affecting the service. We will be monitoring this to ensure that it remains stable. If you continue to experience problems, please open a Support ticket for assistance.

IDENTIFIED about 1 month ago - at 03/05/2026 04:23PM

Our team has identified the issue affecting the service. We are working quickly to implement a fix, and we will provide an update as soon as the solution is in place.

INVESTIGATING about 1 month ago - at 03/05/2026 02:48PM

We are continuing to investigate and will provide the next update as progress is made.

INVESTIGATING about 1 month ago - at 03/05/2026 10:50AM

We are aware of a recurrence of this issue across multiple regions. We are continuing to investigate and will provide the next update as progress is made.

MONITORING about 1 month ago - at 03/05/2026 07:34AM

Our team has identified the issue affecting the service and implemented a fix. We will be monitoring this to ensure that it remains stable. If you continue to experience problems, please open a Support ticket for assistance.

INVESTIGATING about 1 month ago - at 03/05/2026 06:55AM

We are continuing to investigate the issue. We will provide the next update as progress is made.

INVESTIGATING about 1 month ago - at 03/05/2026 05:48AM

Our subject matter experts are actively investigating the issue. We will provide the next update as progress is made.

INVESTIGATING about 1 month ago - at 03/05/2026 02:07AM

We are investigating a critical service issue affecting NVIDIA RTX 4000 Ada GPU nodes across multiple regions, including Osaka (osa1), Seattle (sea1), and Chicago (ord1).
Affected GPU nodes may report an unrecoverable error state leading to failures in Vulkan initialization and GPU-accelerated workloads. Additionally, some LKE clusters in the Osaka region are currently experiencing Control Plane connectivity issues, resulting in timed-out API requests and errors.
Our engineering teams are currently investigating the root cause, focusing on a potential regression in the underlying host hypervisor or GPU firmware. We will provide more information as it becomes available

Latest Linode outages

Service Issue - Dedicated CPU - Several Regions - about 17 hours ago

Service Issue - Networking in US-IAD (Washington) - 6 days ago

Emerging Service Issue - Network - London 2(gb-lon) - 8 days ago

Emerging Service Issue - Object Storage - us-east-1 - 8 days ago

Service Issue - New Linode Creation - 22 days ago

The Status Page Aggregator with Early Outage Detection

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 6320 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook