Use cases
Software Products E-commerce MSPs Schools Development & Marketing DevOps Agencies Help Desk
Company
Internet Status Blog Pricing Log in Get started free

Grafana Outage History

Every past Grafana outage tracked by IsDown, with detection times, duration, and resolution details.

There were 720 Grafana outages since July 2018. The 199 outages from the last 12 months are summarized below, with incident details, duration, and resolution information.

Minor May 8, 2026

May 2026: Cloud Metrics -High Write Latency and Errors in prod-us-central-7

Detected May 8, 2026 9:16 PM UTC · Resolved May 8, 2026 10:33 PM UTC · Duration about 1 hour

Grafana Cloud Metrics in the prod-us-central-7 region experienced high write latency and errors affecting data ingestion and rule evaluation from approximately 20:40-21:00 UTC. Users encountered slow performance and errors when writing metrics data during this 20-minute window. The team identified the root cause, implemented a fix, and confirmed stability after monitoring the system for an additional 1.5 hours.

Major May 7, 2026

May 2026: Metrics read errors in prod-ap-south-1 region

Detected May 7, 2026 7:18 AM UTC · Resolved May 7, 2026 7:59 AM UTC · Duration 41 minutes

Grafana experienced metrics read errors in the prod-ap-south-1 region starting at 06:24 UTC, causing users to encounter error messages when querying metrics. The issue affected the AWS India region's querying functionality for 41 minutes. Engineering released a fix at 07:50 UTC, resolving the query errors and restoring normal service.

Minor May 6, 2026

May 2026: Datasource Query Performance Issues

Detected May 6, 2026 8:07 PM UTC · Resolved May 6, 2026 8:37 PM UTC · Duration 30 minutes

Grafana experienced datasource query performance issues in the prod-us-east-4 region, affecting Grafana Cloud Integrations. The incident lasted 6 minutes and was classified as minor. The team investigated and resolved the performance degradation that was impacting query response times.

Minor May 5, 2026

May 2026: Elevated Error Rate of Browser Checks in PoP Oregon

Detected May 5, 2026 4:11 PM UTC · Resolved May 5, 2026 8:15 PM UTC · Duration about 4 hours

Grafana experienced elevated error rates affecting browser checks in the PoP Oregon region, impacting the AWS US East prod-us-east-0 Public Probes component. The team identified the root cause and implemented a fix after 4.1 hours, with services recovering during the monitoring phase before full resolution was confirmed.

Major May 4, 2026

May 2026: k6 Partial Outage

Detected May 4, 2026 10:58 PM UTC · Resolved May 5, 2026 2:13 AM UTC · Duration about 3 hours

Grafana experienced a major outage affecting k6 and Synthetic Monitoring services across all global regions, impacting APIs and public probes on AWS, Azure, and GCP platforms. The incident lasted 3.3 hours before a fix was implemented and services were restored. The team successfully resolved the issue after identifying the root cause and monitoring the recovery.

Major May 1, 2026

May 2026: Ingestion Errors for AWS Cloud Provider Observability Metric Streams in prod-us-central-7

Detected May 1, 2026 9:14 AM UTC · Resolved May 1, 2026 10:30 AM UTC · Duration about 1 hour

Grafana experienced ingestion errors for AWS Cloud Provider Observability Metric Streams in the prod-us-central-7 region starting around 06:30 UTC. Users in this region encountered metric ingestion failures when using AWS Metric Streams. The issue was resolved after 1.3 hours with a fix implemented by engineering.

Minor April 28, 2026

April 2026: Investigating Issues Saving SQL Datasource Credentials

Detected Apr 28, 2026 6:46 PM UTC · Resolved Apr 29, 2026 1:38 PM UTC · Duration about 19 hours

Grafana experienced an 18.9-hour incident where users were unable to save credentials for SQL-based data sources, affecting a subset of customers across multiple regions. The issue impacted Grafana Cloud Integrations functionality, preventing proper configuration of SQL datasource connections. The incident was resolved after the team identified the root cause and implemented a fix.

Minor April 28, 2026

April 2026: Gateway Slowness Detected in Prod (US-East-1)

Detected Apr 28, 2026 9:20 AM UTC · Resolved Apr 30, 2026 3:12 PM UTC · Duration 2 days

Grafana experienced gateway slowness in the US-East-1 production environment, with dropped successful requests potentially preventing users from accessing their instances. The incident lasted 53.9 hours but was ultimately determined to be a false alarm that did not actually affect any users.

Major April 27, 2026

April 2026: InfluxDB Datasource - Intermittent Failures

Detected Apr 27, 2026 5:08 PM UTC · Resolved Apr 27, 2026 11:25 PM UTC · Duration about 6 hours

The Grafana InfluxDB datasource plugin experienced intermittent failures affecting Grafana Cloud Integrations for 6.3 hours. Users encountered connection issues when trying to access data through the InfluxDB plugin. The team identified the root cause, implemented a fix, and fully resolved the incident after monitoring to confirm service recovery.

Major April 23, 2026

April 2026: Cloudwatch Datasource Outage

Detected Apr 23, 2026 2:26 PM UTC · Resolved Apr 23, 2026 8:02 PM UTC · Duration about 6 hours

Grafana experienced a major outage affecting Cloudwatch datasources in the GCP US Central region for 5.6 hours. Users were unable to access or query data from their Cloudwatch datasources during this period. The issue was resolved after the team implemented a fix and confirmed full service recovery.