Use cases
Software Products E-commerce MSPs Schools Development & Marketing DevOps Agencies Help Desk
Company
Internet Status Blog Pricing Log in Get started free

LiveKit Outage History

Every past LiveKit outage tracked by IsDown, with detection times, duration, and resolution details.

There were 80 LiveKit outages since February 2022. The 45 outages from the last 12 months are summarized below, with incident details, duration, and resolution information.

Major May 28, 2026

May 2026: Elevated Reports of Participant Connection Latency and Errors In US East Region

Detected May 28, 2026 10:53 AM EDT · Resolved May 28, 2026 4:15 PM EDT · Duration about 5 hours

LiveKit experienced elevated participant connection latency and errors in the US East region due to database connection timeouts, affecting all services in that region for 5.4 hours. The issue was resolved by draining traffic from US East and routing it to other regions, which eliminated the connection errors. Service returned to normal operation with traffic continuing to route through alternative regions while US East was being restored.

Major May 26, 2026

May 2026: Partial data outage on LiveKit Cloud Dashboard

Detected May 26, 2026 3:22 PM EDT · Resolved May 26, 2026 5:35 PM EDT · Duration about 2 hours

LiveKit Cloud Dashboard experienced a partial data outage affecting Agent Observability access for some sessions and causing ended sessions to incorrectly display as active in the 24-hour filter. The issue was identified as partial data unavailability, and a fix was implemented to prevent further data loss while missing data was backfilled. The incident was resolved after 2.2 hours with all missing data successfully restored.

Minor May 22, 2026

May 2026: Network instability in Frankfurt

Detected May 22, 2026 4:26 PM EDT · Resolved May 23, 2026 12:37 PM EDT · Duration about 20 hours

LiveKit experienced network instability in Frankfurt starting at 18:15 UTC that impacted cross-region traffic. Traffic was redirected away from Frankfurt by 19:55 UTC, causing participants who originally connected through that region to experience slightly increased latency. The Frankfurt cluster was re-enabled after 20.2 hours and monitored for 16 additional hours before the incident was fully resolved.

Minor May 15, 2026

May 2026: Intermittent connectivity disruptions in US regions

Detected May 15, 2026 12:57 PM EDT · Resolved May 15, 2026 10:50 PM EDT · Duration about 10 hours

LiveKit experienced intermittent connectivity disruptions affecting a subset of sessions in US regions, particularly US East, due to internal network degradation between clusters. The disruptions caused brief interruptions (less than 5 minutes) to media relay across multiple sessions at various times between May 12-15, 2026. The incident was resolved after 9.9 hours when no further connectivity issues were observed, with the team continuing to investigate why their automatic reroute mechanism failed to activate during these isolated instances.

Major May 6, 2026

May 2026: Outbound TLS calls are failing

Detected May 6, 2026 3:46 PM EDT · Resolved May 6, 2026 7:04 PM EDT · Duration about 3 hours

LiveKit's Global SIP service experienced a major outage where outbound TLS calls were failing for approximately 3.3 hours. The issue was initially thought to be Twilio-specific but was later determined to affect all outbound TLS calls regardless of provider. A fix was implemented and the incident was resolved after monitoring confirmed no further errors.

Minor May 6, 2026

May 2026: Cloud Agents experiencing deployment failures in US East

Detected May 6, 2026 12:01 AM EDT · Resolved May 6, 2026 5:50 AM EDT · Duration about 6 hours

LiveKit Cloud Agents in the US East region experienced deployment failures and degraded performance for 5.7 hours due to an etcd service outage that caused Kubernetes API server unresponsiveness. The incident initially affected only new agent deployments but later expanded to impact existing deployments as well. The service was fully restored after the Kubernetes infrastructure was repaired with assistance from the data center provider.

Minor April 24, 2026

April 2026: Investigating SIP participant timeouts and signalling connection errors

Detected Apr 24, 2026 2:12 PM EDT · Resolved Apr 29, 2026 10:44 AM EDT · Duration 5 days

LiveKit experienced SIP participant timeouts in Singapore and increased signaling connection errors primarily affecting India, with potential impact on connection latency and inbound SIP calls across multiple regions including Australia, US Central, and South Africa. The issue affected a very small minority of requests and was resolved through fine-tuning adjustments to improve stack performance. The incident lasted 116.5 hours with minimal user reports throughout the duration.

Minor April 15, 2026

April 2026: Increased Latency in RoomService APIs

Detected Apr 15, 2026 7:23 PM EDT · Resolved Apr 16, 2026 12:10 AM EDT · Duration about 5 hours

LiveKit experienced increased latencies in RoomService APIs (CreateRoom, DeleteRoom, and UpdateRoomMetadata) that initially affected the US West region but later expanded to other regions, caused by issues with a specific table in their distributed database. During the 4.8-hour incident, users experienced slow API responses and a temporary 10% failure rate when the fix was being applied. The issue was fully resolved after implementing a workaround and working with the database vendor.

Major April 3, 2026

April 2026: LiveKit Cloud Dashboard Down

Detected Apr 3, 2026 1:13 PM EDT · Resolved Apr 3, 2026 1:26 PM EDT · Duration 13 minutes

The LiveKit Cloud Dashboard (cloud.livekit.io) experienced a major outage that prevented users from accessing the service. The incident lasted 13 minutes before a fix was implemented and normal access was restored.

Minor April 1, 2026

April 2026: Degraded Connectivity Issues – EU (Frankfurt)

Detected Apr 1, 2026 6:37 AM EDT · Resolved Apr 1, 2026 9:05 AM EDT · Duration about 2 hours

LiveKit experienced degraded connectivity issues affecting Real Time Communication and TURN services in the EU (Frankfurt) region for 2.5 hours. The service team identified the root cause and implemented a fix to restore normal connectivity. The incident was resolved after a monitoring period to confirm service stability.