March 2026: AI Model Hub: Increased Error Rate
IONOS Cloud's AI Model Hub experienced increased error rates with the Llama 405B model due to hardware degradation and subsequent capacity constraints following a hardware failure. The incident affected the Llama 3.1 405B Instruct model's performance and reliability over 58.9 hours. The service was restored with capacity constraints remaining, and users experiencing ongoing issues were advised to use GPT-OSS 120B as a temporary alternative while optimizations are deployed.