Outage in LiftIgniter

Issues with services in US East due to capacity issues with cloud provider

Resolved Minor
April 08, 2022 - Started over 2 years ago
Official incident page

Need to monitor LiftIgniter outages?
Stay on top of outages with IsDown. Monitor the official status pages of all your vendors, SaaS, and tools, including LiftIgniter, and never miss an outage again.
Start Free Trial

Outage Details

Due to some capacity issues being experienced by our cloud provider (Google Cloud) in US East, we are or were experiencing issues with some of our services. Our query endpoint (query.petametrics.com), that is used to serve recommendations, saw (503 status) error rates rise to over 1%, peaking at 2.6% briefly. Error rates were nonzero between 14:23 and 14:38 UTC. Error rates went down to zero after we provisioned alternate capacity. The period of increased error rates was also a period of increased latency for the successful requests. We are currently investigating the impact on some of our other services, including a service used for rendering emails.
Latest Updates ( sorted recent to last )
RESOLVED over 2 years ago - at 04/09/2022 12:59AM

Capacity is back to normal and all services are operating normally. We've identified improvements to make to our systems to make them even more robust to similar issues.

MONITORING over 2 years ago - at 04/08/2022 03:35PM

All our services are back to working normally. We are still waiting for the underlying capacity issues to be fixed, and will be reviewing our setup to see how we can reduce the impact of such incidents in the future.

IDENTIFIED over 2 years ago - at 04/08/2022 03:15PM

As of 15:04 UTC, our email-rendering services are back online and working properly, so all our front-facing services are working properly now.

We have identified that the capacity issue is affecting one of our backend services used for managing user histories, and are continuing to investigate that.

IDENTIFIED over 2 years ago - at 04/08/2022 02:50PM

Due to some capacity issues being experienced by our cloud provider (Google Cloud) in US East, we are or were experiencing issues with some of our services.

Our query endpoint (query.petametrics.com), that is used to serve recommendations, saw (503 status) error rates rise to over 1%, peaking at 2.6% briefly. Error rates were nonzero between 14:23 and 14:38 UTC. Error rates went down to zero after we provisioned alternate capacity. The period of increased error rates was also a period of increased latency for the successful requests.

We are currently investigating the impact on some of our other services, including a service used for rendering emails.

Be the first to know when LiftIgniter and other third-party services go down

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 3278 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook

Setup in 5 minutes or less

How much time you'll save your team, by having the outages information close to them?

14-day free trial · No credit card required · Cancel anytime