Outage in DigitalOcean

Serverless Inference - High error rates for open source models ( Qwen 3 32B)

Resolved Minor

April 07, 2026 - Started 22 days ago - Lasted about 3 hours
Official incident page

Incident Report

Summary AI Generated

DigitalOcean's Serverless Inference service experienced high error rates and elevated latency for the Qwen 3 32B model in the tor1 region for 3 hours starting at 10:46 UTC. The issue was caused by higher-than-expected request volume without sufficient resources to scale, resulting in capacity constraints and multiple workers in a pending state. The service was restored by expanding the node pool size to improve available capacity, along with implementing stability improvements to prevent similar issues.

Serverless inference for alibaba-qwen3-32b (Qwen 3 32B) in tor1 is experiencing high error rates starting at 10:46 UTC.

Trusted by 1,000+ teams

Never miss outages in third-party dependencies

Stop finding out about outages from your users. Monitor 6,320+ cloud services and get alerted the second something breaks.

Start Free Trial Learn More

Latest Updates ( sorted recent to last )

RESOLVED 22 days ago - at 04/07/2026 03:50PM

Service has been fully restored, and the model is now operating normally. We have implemented improvements to enhance stability and reduce the likelihood of similar issues in the future.

IDENTIFIED 22 days ago - at 04/07/2026 12:55PM

We are currently investigating reports of elevated latency affecting requests to this model when using Serverless Inference and Agents.

Earlier observations indicated increased error rates for the open-source Qwen 3 32B model. The Ray dashboard also showed multiple workers in a pending state, suggesting capacity constraints.

Our analysis determined that the model was experiencing higher-than-expected request volume without sufficient resources to scale accordingly. To address this, the node pool size has been increased to improve available capacity. However, there are still insufficient nodes to fully support the desired number of model replicas.

Following the node pool expansion, a new pod-related error has been identified. Our Engineering team is actively working to resolve this issue and restore full service performance.

INVESTIGATING 22 days ago - at 04/07/2026 12:49PM

Serverless inference for alibaba-qwen3-32b (Qwen 3 32B) in tor1 is experiencing high error rates starting at 10:46 UTC.

Latest DigitalOcean outages

Elevated 5xx “context canceled” errors impacting serverless inference - 1 day ago

Serverless Inference - Intermittent Rate Limiting Affecting Some Customers Using Anthropic Models - 2 days ago

Intermittent errors impacting some Serverless Inference models in ATL1 - 6 days ago

App Platform Deployments - 7 days ago

Cloud UI for Managed Kubernetes - 7 days ago

Never miss outages in third-party dependencies

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 6320 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook