Outage in Grafana

Prometheus writes, Logs, and Synthetic Monitoring in prod-eu-west-3 are degraded

Resolved Minor

March 24, 2026 - Started 26 days ago - Lasted 1 day
Official incident page

Incident Report

Summary AI Generated

Grafana experienced degraded Prometheus writes in the prod-eu-west-3 region starting at 08:45Z, which later expanded to impact Logs and Synthetic Monitoring services. The incident affected ingestion, API, and public probes, causing errors in check execution metrics, potential missed alerts for Synthetic Monitoring, and gaps in recording rules for Logs due to delayed remote writes to Mimir. The issue was resolved after 27.7 hours with a fix implemented by the engineering team.

We are currently experiencing degraded writes for mimir-prod-22 in prod-eu-west-3 since 08:45Z.

Components affected

Grafana Azure Netherlands - prod-eu-west-3: Ingestion Grafana Azure Netherlands - prod-eu-west-3: Public Probes Grafana Azure Netherlands - prod-eu-west-3 Grafana Azure Netherlands - prod-eu-west-3: API

Trusted by 1,000+ teams

The Status Page Aggregator with Early Outage Detection

Stop finding out about outages from your users. Monitor 6,320+ cloud services and get alerted the second something breaks.

Start Free Trial

No credit card
14-day trial
2-minute setup

Latest Updates ( sorted recent to last )

RESOLVED 24 days ago - at 03/25/2026 12:52PM

This incident has been resolved.

INVESTIGATING 25 days ago - at 03/25/2026 07:43AM

This is also now impacting Logs and Synthetic Monitoring in prod-eu-west-3.

For Synthetic Monitoring, users might observe errors pushing check execution metrics, and this can eventually lead to missing data.
In addition, users might observe errors evaluating Synthetic Monitoring provisioned alert rule evaluations, and this can lead to missed alerts.

For Logs, there is no immediate impact on alerts, however, remote writes to Mimir is delayed which means users may see gaps in their recording rules.

INVESTIGATING 25 days ago - at 03/25/2026 07:04AM

We are moving this back to 'Investigating' as we are now observing a substantial drop in successful ingestion and increase in write path errors, and elevated rule evaluation latency and error. Reads are mostly fine. Our Engineering team is actively investigating this and we will provide further updates as our investigation progresses.

MONITORING 25 days ago - at 03/24/2026 09:23PM

We have not observed any recent errors, but we will continue to monitor while we work with our CSP.

MONITORING 26 days ago - at 03/24/2026 09:19AM

A fix has been implemented and we are monitoring the results.

INVESTIGATING 26 days ago - at 03/24/2026 09:08AM

We are currently experiencing degraded writes for mimir-prod-22 in prod-eu-west-3 since 08:45Z.

Latest Grafana outages

Query Caching - Degraded Performance - 1 day ago

Issues on Stack creation - 2 days ago

Degraded Ticket Visibility in Support System - 3 days ago

K6 Sporadic DNS Issues - 5 days ago

Grafana Cloud Logs - Write degradation in us-east-3 - 8 days ago

The Status Page Aggregator with Early Outage Detection

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 6320 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook