Outage in PostHog

Event ingestion: processing delays

Resolved Minor

February 26, 2026 - Started 5 days ago - Lasted about 15 hours

Incident Report

Summary AI Generated

PostHog experienced a 15.1-hour event ingestion processing delay caused by a degraded shard during routine maintenance and elevated part counts on ClickHouse, which led to insert rejections and Kafka consumer lag. Events took longer than usual to appear in PostHog apps and queries, though no data was lost during the incident. The root cause was identified and resolved, with ingestion resumed and the backlog processed over approximately 2 hours.

We’ve identified processing delays in the event ingestion pipeline. Events may take longer than usual to appear in the product. Data is not lost but may not show in PostHog apps and queries until the processing delay is resolved.

Need to monitor PostHog outages?

Monitor all your external dependencies in one place
Get instant alerts when outages are detected
Be the first to know if service is down
Show real-time status on private or public status page
Keep your team informed

Start monitoring for free

Latest Updates ( sorted recent to last )

MONITORING 5 days ago - at 02/26/2026 03:05PM

We are still processing the ingestion queue, we should be fully caught up in about 2 hours.

MONITORING 5 days ago - at 02/26/2026 12:00PM

We have identified the root cause of the ingestion lag and cluster overload, and have resolved the issue.

We have now resumed ingestion and are processing the event ingestion lag.

IDENTIFIED 5 days ago - at 02/26/2026 08:23AM

During routine maintenance a shard has entered a degraded state in terms of performance and is causing us to fall behind on ingesting data. We are working to remedy the issue and will report back as soon as we have a remedy in place.

IDENTIFIED 5 days ago - at 02/26/2026 05:34AM

EU event ingestion experienced delays due to elevated part counts on ClickHouse. The high part count caused some insert rejections, leading to Kafka consumer lag on event processing. Replication queues have been restarted and merge backlogs are draining. Part counts are returning to normal.

INVESTIGATING 5 days ago - at 02/26/2026 02:05AM

Latest PostHog outages

Errors loading some queries / dashboards - about 15 hours ago

Web app not accessibe - about 19 hours ago

Feature flags API: elevated error rates - about 19 hours ago

PostHog AI: elevated API error rates - about 23 hours ago

Event ingestion: processing delays - 4 days ago

The Status Page Aggregator with Early Outage Detection

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 6020 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook