IONOS Cloud's AI Modelhub experienced performance degradation for 9.3 hours due to increased traffic volumes causing capacity constraints, particularly affecting the GPT-OSS 120B model. Users encountered longer response times and intermittent timeouts when accessing affected models. The issue was resolved by scaling capacity after the team identified the source of increased load.
Trusted by 1,000+ teams
Stop finding out about outages from your users. Monitor 6,320+ cloud services and get alerted the second something breaks.
Response times of the model have improved significantly. We are marking the incident as resolved.
Our AI Model hub Team has identified a likely culprit for the increased load. The team is working towards increasing capacity to ensure that GPT-OSS 120B stays available for all customers. Customers may still experience intermittent timeouts.
We are experiencing increased traffic volumes for specific models, including GPT-OSS 120B, which is currently causing capacity constraints. Users may encounter longer response times or intermittent timeouts. Our AI Modelhub Team is actively working to scale capacity and resolve these issues
With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.
Start free trialNo credit card required · Cancel anytime · 6320 services available
Integrations with