Outage in DigitalOcean

Gradient AI model availability

Resolved Minor

March 17, 2026 - Started about 1 month ago - Lasted about 5 hours
Official incident page

Incident Report

Summary AI Generated

DigitalOcean's Gradient AI service experienced availability issues affecting multiple AI models including Llama3.1-8b, Qwen3-32b, several embedding models, and Guardrails functionality, while the Llama3.3-70b model suffered degraded performance. The incident lasted 4.8 hours and impacted users' ability to access models and run inference operations. Engineering teams implemented a fix that fully restored service and returned all affected models to healthy status.

Our Engineering team is investigating reports of Gradient AI model availability issues impacting multiple models. Users may experience issues with models availability, including Llama3.1-8b and Qwen3-32b, as well as embedding models such as GTE Large (v1.5), All-MiniLM-L6-v2, Multi-QA-mpnet-base-dot-v1, and Qwen3 Embedding 0.6B. Additionally, Guardrails are not available, affecting associated agents, and users attempting to run inference on the Llama3.3-70b model will see degraded performance. We apologize for the inconvenience and will share an update once we have more information.

Trusted by 1,000+ teams

Never miss outages in third-party dependencies

Stop finding out about outages from your users. Monitor 6,320+ cloud services and get alerted the second something breaks.

Start Free Trial Learn More

Latest Updates ( sorted recent to last )

RESOLVED about 1 month ago - at 03/17/2026 07:49PM

Our Engineering team has implemented a fix, the issues impacting model availability and performance have been resolved. All models, including those previously degraded, are back up and healthy. Service has been fully restored.

INVESTIGATING about 1 month ago - at 03/17/2026 03:00PM

Our Engineering team is investigating reports of Gradient AI model availability issues impacting multiple models. Users may experience issues with models availability, including Llama3.1-8b and Qwen3-32b, as well as embedding models such as GTE Large (v1.5), All-MiniLM-L6-v2, Multi-QA-mpnet-base-dot-v1, and Qwen3 Embedding 0.6B.

Additionally, Guardrails are not available, affecting associated agents, and users attempting to run inference on the Llama3.3-70b model will see degraded performance.

We apologize for the inconvenience and will share an update once we have more information.

Latest DigitalOcean outages

Elevated 5xx “context canceled” errors impacting serverless inference - 2 days ago

Serverless Inference - Intermittent Rate Limiting Affecting Some Customers Using Anthropic Models - 3 days ago

Intermittent errors impacting some Serverless Inference models in ATL1 - 6 days ago

App Platform Deployments - 7 days ago

Cloud UI for Managed Kubernetes - 7 days ago

Never miss outages in third-party dependencies

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 6320 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook