Outage in elastic.io

System Down - Google Kubernetes Service

Resolved Minor
January 20, 2023 - Started almost 2 years ago - Lasted about 19 hours
Official incident page

Need to monitor elastic.io outages?
Stay on top of outages with IsDown. Monitor the official status pages of all your vendors, SaaS, and tools, including elastic.io, and never miss an outage again.
Start Free Trial

Outage Details

We are investigating the reports on ordinary flow start/stop delays. This seems to be connected with the Google Cloud Platform and Kubernetes service malfunction. Our team is investigating the situation.
Latest Updates ( sorted recent to last )
RESOLVED almost 2 years ago - at 01/21/2023 10:04AM

System is back to full operation.

MONITORING almost 2 years ago - at 01/21/2023 04:17AM

The platform is re-launched and service is resumed. Unfortunately with reduced capacity. Significant delays in processing data are to be expected, especially where message sizes are large.

Data loss is believed to be minimal - our webhook and queueing system were down only shortly while redeploying platform versions.
We apologise sincerely for the disruption in service and will continue to monitor and optimise over the weekend. Delays in data processing are to be expected over the weekend as we work to clear the message backlog.

Kind regards
Your elastic.io Team

MONITORING almost 2 years ago - at 01/20/2023 09:58PM

Most of the flows have started. We are monitoring the situation.

IDENTIFIED almost 2 years ago - at 01/20/2023 09:31PM

We are starting the integration flows in batches. System is stable.

IDENTIFIED almost 2 years ago - at 01/20/2023 08:58PM

The platform remains down and rebuilding is taking longer than expected. The team is working full speed to re-launch the platform.

Thank you for your patience.

IDENTIFIED almost 2 years ago - at 01/20/2023 07:16PM

We have re-created the cluster and recovering the integration flows.

IDENTIFIED almost 2 years ago - at 01/20/2023 05:44PM

The issue is with our Kubernetes Master node. We have been working with Google support to find a solution and we are now rebuilding the platform and hope to be in full service mode asap.

We are doing everything possible to minimise any possible data loss.

We will keep you informed as situation improves.

IDENTIFIED almost 2 years ago - at 01/20/2023 05:13PM

We are in the middle of recovery.

INVESTIGATING almost 2 years ago - at 01/20/2023 04:01PM

We are continuing to investigate this issue.

INVESTIGATING almost 2 years ago - at 01/20/2023 03:48PM

We are in touch with Google support to resolve this a.s.a.p. - Kubernetes API server seems to be the issue

INVESTIGATING almost 2 years ago - at 01/20/2023 03:41PM

All our operations are affected right now.

INVESTIGATING almost 2 years ago - at 01/20/2023 02:57PM

We are investigating the reports on ordinary flow start/stop delays. This seems to be connected with the Google Cloud Platform and Kubernetes service malfunction. Our team is investigating the situation.

Latest elastic.io outages

Delays with sample retrieval - almost 2 years ago
Flow start/stop process is delayed - almost 2 years ago
Flow executions delayed - about 2 years ago
Logs delayed - over 2 years ago

Vendor Downtime? Keep Your Team Informed with an Internal Status Page

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 3263 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook

Setup in 5 minutes or less

How much time you'll save your team, by having the outages information close to them?

14-day free trial · No credit card required · Cancel anytime