Outage in UiPath

ML Skill deployments are failing for Europe Community and Enterprise customers

Resolved Minor
December 26, 2022 - Started over 1 year ago - Lasted 4 days
Official incident page

Need to monitor UiPath outages?
Stay on top of outages with IsDown. Monitor the official status pages of all your vendors, SaaS, and tools, including UiPath, and never miss an outage again.
Start Free Trial

Outage Details

ML Skill deployment in our Europe clusters are having some issue, due to which they are getting failed after sometime. We are actively looking into it.
Components affected
UiPath AI Center
Latest Updates ( sorted recent to last )
RESOLVED over 1 year ago - at 12/30/2022 06:18AM

Due to an unexpected high load on the cluster, we were hitting a threshold limit on our backend compute instances, which inturn caused failed MLSkill deployment issue at AICenter side. We worked with our cloud provider team to diagnose and fix the issue at backend.

The deployments are working fine now and we are marking this incident resolved now. We will continue to keep the system under monitoring and work on preventing this issue to resurface. We are sorry for all the inconvenience caused and thank you for your patience and understanding in this.

MONITORING over 1 year ago - at 12/29/2022 03:17PM

A new fix was implemented to mitigate the incident. The new ML Skill deployments are working fine now. We are performing a few tests and keeping the system under monitoring.

INVESTIGATING over 1 year ago - at 12/29/2022 09:00AM

After performing a few tests, we have experienced some failures in the ML Skills deployment again. We are looking further into it.

MONITORING over 1 year ago - at 12/29/2022 06:43AM

We have applied a fix on the cluster and all ML Skills are getting deployed fine now. We are currently monitoring the results and testing a few different use cases as well.

IDENTIFIED over 1 year ago - at 12/28/2022 05:28PM

As of now cluster is in healthy state . Existing deployments will not be effected until if customers try to make any changes to existing deployment and also new customers having some intermittent issues with creating new deployments. Seems like there are some hardware issues while scheduling new instances on the nodes. We are actively working with MSFT team to resolve the issue and also we are actively working on creating completely new cluster in parallel to unblock. Since it is holiday time its taking more time than expected. Apologies for any inconvenience this has caused.

IDENTIFIED over 1 year ago - at 12/28/2022 06:03AM

The mitigation actions taken so far for this incident is not resolving the issue completely. We are continuing to work on this issue with our backend team. We apologise for all the inconvenience this incident has caused. Please be rest assured that this issue is being worked upon with high priority.

IDENTIFIED over 1 year ago - at 12/27/2022 04:41PM

We are making progress in resolving the issues with backend compute instances. Now, only few of the instances are pending to be in healthy state now. Once completed, we should see improvement in ML Skill deployment state.

IDENTIFIED over 1 year ago - at 12/27/2022 10:33AM

We are continuing to work on a fix for this issue with our cloud provider.

IDENTIFIED over 1 year ago - at 12/27/2022 03:31AM

The issue has been identified and a fix is being implemented.

INVESTIGATING over 1 year ago - at 12/27/2022 01:55AM

Backend infrastructure is partially in good state now and only some customers should see issues while creating deployments . We are still working to recover full cluster infrastructure.

INVESTIGATING over 1 year ago - at 12/26/2022 06:42PM

We are working with Microsoft team to investigate if there are any issues with the backend infrastructure

INVESTIGATING over 1 year ago - at 12/26/2022 03:18PM

The issue has been found with the backend compute instance. We are working with our backend team on this.

INVESTIGATING over 1 year ago - at 12/26/2022 11:47AM

ML Skill deployment in our Europe clusters are having some issue, due to which they are getting failed after sometime. We are actively looking into it.

Start monitoring UiPath and all your cloud vendors in minutes

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 3153 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook

Setup in 5 minutes or less

How much time you'll save your team, by having the outages information close to them?

14-day free trial · No credit card required · Cancel anytime