DigitalOcean's Gradient AI service experienced availability issues affecting multiple AI models including Llama3.1-8b, Qwen3-32b, several embedding models, and Guardrails functionality, while the Llama3.3-70b model suffered degraded performance. The incident lasted 4.8 hours and impacted users' ability to access models and run inference operations. Engineering teams implemented a fix that fully restored service and returned all affected models to healthy status.
Our Engineering team has implemented a fix, the issues impacting model availability and performance have been resolved. All models, including those previously degraded, are back up and healthy. Service has been fully restored.
Our Engineering team is investigating reports of Gradient AI model availability issues impacting multiple models. Users may experience issues with models availability, including Llama3.1-8b and Qwen3-32b, as well as embedding models such as GTE Large (v1.5), All-MiniLM-L6-v2, Multi-QA-mpnet-base-dot-v1, and Qwen3 Embedding 0.6B.
Additionally, Guardrails are not available, affecting associated agents, and users attempting to run inference on the Llama3.3-70b model will see degraded performance.
We apologize for the inconvenience and will share an update once we have more information.
With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.
Start free trialNo credit card required · Cancel anytime · 6020 services available
Integrations with