IsDown continuously monitor the official Google Cloud Cloud Machine Learning status page for ongoing outages. Check the stats for the latest 30 days and a list of the last Google Cloud Cloud Machine Learning outages.
Number of Outages
1
Average Downtime
348 mins
Total Downtime
348 mins
Since last incident
23 days
Stay on top of outages with IsDown. Monitor the official status pages of all your vendors, SaaS, and tools, including Google Cloud, and never miss an outage again.
Minor · 22 days ago · lasted about 6 hours
Summary: Multi Region: Vertex AI Batch Prediction & Vertex AI Online Prediction Model Deployment Issues Description: Our engineers have identified the issue and a fix is being rolled out. Online Prediction is mitigated as of 2023-05-11 12:30 US/Pacific. Batch Prediction is expected to rollout the fix by 2023-05-11 18:00 US/Pacific. The issue is currently contained in the following four regions: us-west2, europe-west3, asia-east2, us-east1. We will provide more information by Thursday, 2023-05-11 19:00 US/Pacific. Diagnosis: Customers are unable to deploy new models on Online Prediction, when a new GKE cluster is created. Some customers Batch Prediction jobs are experiencing timeouts. Workaround: None at this time.
Minor · 3 months ago · lasted about 15 hours
Summary: Multi- Regional: Vertex AI Feature Store issues Description: We are experiencing an issue with Vertex AI Feature Store. Our engineering team continues to investigate the issue. We will provide an update by Friday, 2023-03-17 16:00 US/Pacific with current details. Diagnosis: Users cannot ingest data for Feature Store Workaround: None at this time.
Minor · 3 months ago · lasted 1 day
Summary: Cloud AI Platform and Vertex AI Training elevated error rates for GPU jobs in us-central1, us-east1, and europe-west3 Description: Mitigation work is currently underway by our engineering team. At this time, we believe the issue has been resolved for the us-central1 region and are working to confirm. We do not have an ETA for mitigation in us-east1 and europe-west3 at this point. We will provide more information by Friday, 2023-03-03 23:30 US/Pacific. Diagnosis: Cloud AI Platform and Vertex AI Training GPU jobs may experience elevated failure rates in us-central1, us-east1, and europe-west3. Workaround: None at this time.
Minor · over 1 year ago · lasted 6 months
Summary: Global: Jobs failing with internal error for GKE version 1.18 Description: We are experiencing an issue with Cloud AI where distributed training jobs, BYOSA jobs and VPC peering jobs that run on GKE v1.18 will fail with internal error. Our engineering team continues to investigate the issue. We will provide an update by Tuesday, 2021-10-05 17:30 US/Pacific with current details. We apologize to all who are affected by the disruption. Diagnosis: All training jobs (Distributed training jobs, BYOSA jobs and VPC peering jobs) that run on GKE v1.18 to fail with internal error. Workaround: None at this time.
Minor · over 2 years ago · lasted about 1 year
Our engineers have determined this issue to be linked to a single Google incident. For regular status updates, please visit https://status.cloud.google.com/incident/compute/21002. No further updates will be made through this incident.
IsDown monitors Google Cloud, and also its competitors (also 2500 other cloud services). Check the current status of the most popular alternatives to Google Cloud.
The data and notifications you need, in the tools you already use.
Stay on top of outages with IsDown. Monitor the official status pages of all your vendors, SaaS, and tools, including Google Cloud, and never miss an outage again.
Try it out! How much time you'll save your team, by having the outages information close to them?