Grafana's IRM service experienced degraded performance for 9.9 hours due to elevated 500 API responses, specifically affecting label handling functionality across multiple regions including US-Central and EU-West. The issue impacted Alertmanager and Rules Configuration APIs but did not affect core IRM access or alert ingestion/notification/delivery capabilities. The incident was resolved after deploying a fix to the IRM application that restored service for affected customers experiencing label-related issues.
Trusted by 1,000+ teams
Stop finding out about outages from your users. Monitor 6,320+ cloud services and get alerted the second something breaks.
This incident has been resolved.
We've released a fix to the IRM app that should restore service for affected customers with issues related to labels. Thanks for your patience while investigating. We're continuing to monitor as we confirm the resolution in place.
We are continuing to work on a fix for this. To further clarify, this issue is not about accessing IRM or alert ingestion/notification/delivery, but rather with handling labels.
The degraded performance is about labels, and we have seen this degradation in more regions.
We are continuing to work on a fix for this issue.
The issue has been identified and a fix is being implemented.
We are continuing to investigate this issue.
We are experiencing access issues in IRM as there are elevated 500 API responses in prod-us-central-0.
With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.
Start free trialNo credit card required · Cancel anytime · 6320 services available
Integrations with