Outage in GitLab

Pipelines not completing

Resolved Major
October 08, 2024 - Started about 2 months ago - Lasted 1 day

Need to monitor GitLab outages?
Stay on top of outages with IsDown. Monitor the official status pages of all your vendors, SaaS, and tools, including GitLab, and never miss an outage again.
Start Free Trial

Outage Details

We are currently investigating an issue where pipelines are not progressing between stages. This causes them to remain in a running state indefinitely, and merge requests are not being marked as ready. https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676
Components affected
GitLab Background Processing
Latest Updates ( sorted recent to last )
about 2 months ago - at 10/08/2024 03:39PM

We are currently investigating an issue where pipelines are not progressing between stages. This causes them to remain in a running state indefinitely, and merge requests are not being marked as ready. https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676

about 2 months ago - at 10/08/2024 04:23PM

We continue to investigate potential causes of this issue. Please follow https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for the latest updates.

about 2 months ago - at 10/08/2024 04:48PM

Work is still in progress to identify the cause of service degradation to pipelines. Update frequency will be increased to every 30 minutes or as soon as we have material updates to share. Please follow https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for more information.

about 2 months ago - at 10/08/2024 05:25PM

Work is ongoing to identify the root of this issue, including an analysis of our Redis infrastructure. Please continue to follow https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676.

about 2 months ago - at 10/08/2024 05:55PM

Investigation continues. Remember restarting the pipeline or individual jobs may serve as a workaround. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676.

about 2 months ago - at 10/08/2024 06:29PM

We are currently investigating errors on a specific Redis shard that may be contributing to the problem. For the latest updates and detailed information, please check https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676.

about 2 months ago - at 10/08/2024 07:23PM

We're taking corrective actions to alleviate some unexpected pressure in one of our Redis shards, which may alleviate the problem. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for more details.

about 2 months ago - at 10/08/2024 08:24PM

After alleviating the saturation reported in our Redis infrastructure we are seeing readings go back to healthy levels. We will monitor now for new occurrences and reports to confirm this is resolved. Please review https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for details.

about 2 months ago - at 10/08/2024 11:57PM

Despite resolving the Redis saturation issue, we're still receiving reports of hanging pipelines on GitLab.com. We understand the frustration this causes and are urgently investigating. Updates at https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676.

about 2 months ago - at 10/09/2024 12:02AM

Reminder: If you have urgent pipelines affected by the ongoing issue, you can cancel and retry them. A retry may make the pipeline run properly. We're still working on a permanent fix. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676.

about 2 months ago - at 10/09/2024 01:02AM

Investigation continues. Remember, restarting the pipeline on individual jobs may serve as a workaround. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676.

about 2 months ago - at 10/09/2024 01:26AM

No material updates to report. We continue to investigate potential causes of this issue. Please follow https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for the latest updates.

about 2 months ago - at 10/09/2024 02:33AM

No material updates to report. We continue investigations and will provide further updates in 1 hour. Please follow https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for the latest.

about 2 months ago - at 10/09/2024 03:33AM

We have resolved the Redis saturation issue and see that all pipelines are functioning normally now. We will continue to monitor while continuing investigation into other aspects. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for details.

about 2 months ago - at 10/09/2024 04:18AM

We have resolved the Redis saturation issue and see that all pipelines are functioning normally now. We will continue to monitor while continuing investigation into other aspects. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for details.

about 2 months ago - at 10/09/2024 04:25AM

We have resolved the Redis saturation issue and see that all pipelines are functioning normally now. We will continue to monitor while continuing investigation into other aspects. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for details.

about 2 months ago - at 10/09/2024 08:26AM

We are moving this incident back to Investigating, as after resolving the Redis saturation, we are still seeing Pipelines not progressing between stages. Merge Requests are functioning normally now. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for details.

about 2 months ago - at 10/09/2024 09:23AM

Some Merge requests are still impacted by this incident. Merge requests are stuck with the message "Your merge request is almost ready!". The workaround is to recreate the impacted Merge Request. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for details.

about 2 months ago - at 10/09/2024 09:28AM

We have identified a potential commit that could have caused this incident. We are working on reverting it. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676 for details.

about 2 months ago - at 10/09/2024 10:16AM

We have enabled a feature flag that will prevent new pipelines and Merge Requests from being impacted. Pipelines and MR that were previously stuck will need to be recreated. See https://gitlab.com/gitlab-com/gl-infra/production/-/issues/18676

Vendor Downtime? Keep Your Team Informed with an Internal Status Page

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 3273 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook

Setup in 5 minutes or less

How much time you'll save your team, by having the outages information close to them?

14-day free trial · No credit card required · Cancel anytime