Use Cases
Software Products MSPs Schools Development & Marketing DevOps Agencies Help Desk
 
Internet Status Blog Pricing Log In Try IsDown for free now

Outage in Grafana

Prometheus writes, Logs, and Synthetic Monitoring in prod-eu-west-3 are degraded

Resolved Minor
March 24, 2026 - Started 3 days ago - Lasted 1 day
Official incident page

Incident Report

Summary AI Generated

Grafana experienced degraded Prometheus writes in the prod-eu-west-3 region starting at 08:45Z, which later expanded to impact Logs and Synthetic Monitoring services. The incident affected ingestion, API, and public probes, causing errors in check execution metrics, potential missed alerts for Synthetic Monitoring, and gaps in recording rules for Logs due to delayed remote writes to Mimir. The issue was resolved after 27.7 hours with a fix implemented by the engineering team.

We are currently experiencing degraded writes for mimir-prod-22 in prod-eu-west-3 since 08:45Z.

Need to monitor Grafana outages?

  • Monitor all your external dependencies in one place
  • Get instant alerts when outages are detected
  • Be the first to know if service is down
  • Show real-time status on private or public status page
  • Keep your team informed
Latest Updates ( sorted recent to last )
RESOLVED 1 day ago - at 03/25/2026 12:52PM

This incident has been resolved.

INVESTIGATING 2 days ago - at 03/25/2026 07:43AM

This is also now impacting Logs and Synthetic Monitoring in prod-eu-west-3.

For Synthetic Monitoring, users might observe errors pushing check execution metrics, and this can eventually lead to missing data.
In addition, users might observe errors evaluating Synthetic Monitoring provisioned alert rule evaluations, and this can lead to missed alerts.

For Logs, there is no immediate impact on alerts, however, remote writes to Mimir is delayed which means users may see gaps in their recording rules.

INVESTIGATING 2 days ago - at 03/25/2026 07:04AM

We are moving this back to 'Investigating' as we are now observing a substantial drop in successful ingestion and increase in write path errors, and elevated rule evaluation latency and error. Reads are mostly fine. Our Engineering team is actively investigating this and we will provide further updates as our investigation progresses.

MONITORING 2 days ago - at 03/24/2026 09:23PM

We have not observed any recent errors, but we will continue to monitor while we work with our CSP.

MONITORING 3 days ago - at 03/24/2026 09:19AM

A fix has been implemented and we are monitoring the results.

INVESTIGATING 3 days ago - at 03/24/2026 09:08AM

We are currently experiencing degraded writes for mimir-prod-22 in prod-eu-west-3 since 08:45Z.

The Status Page Aggregator with Early Outage Detection

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 6020 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook