Use cases
Software Products E-commerce MSPs Schools Development & Marketing DevOps Agencies Help Desk
Company
Internet Status Blog Pricing Log in Get started free

Outage in Georgia Tech IT

PACE emergency shutdown due to cooling failure

Resolved Major
May 26, 2026 - Started 4 days ago - Lasted about 4 hours

Incident Report

The cooling system in the Coda datacenter research hall has failed. All PACE compute nodes (Phoenix, Firebird, and ICE) have been shut down to avoid overheating. All running jobs have been cancelled, and no new jobs can start. Storage remains available via Globus, login nodes, and OnDemand.
Components affected
Georgia Tech IT Academic Services

Trusted by 1,000+ teams

The Status Page Aggregator with Early Outage Detection

Stop finding out about outages from your users. Monitor 6,320+ cloud services and get alerted the second something breaks.

IsDown status aggregator dashboard
Latest Updates ( sorted recent to last )
4 days ago - at 05/26/2026 03:14PM

The cooling system in the Coda datacenter research hall has failed. All PACE compute nodes (Phoenix, Firebird, and ICE) have been shut down to avoid overheating. All running jobs have been cancelled, and no new jobs can start. Storage remains available via Globus, login nodes, and OnDemand.

4 days ago - at 05/26/2026 03:22PM

Summary: The cooling system in the Coda datacenter research hall has failed. All PACE compute nodes (Phoenix, Firebird, and ICE) have been shut down to avoid overheating. All running jobs have been cancelled, and no new jobs can start. Storage remains available via Globus, login nodes, and OnDemand. 

Please visit https://status.gatech.edu/ for ongoing updates. 

Details: The high-temperature cooling tower in the Coda datacenter, which provides cooling to the research hall hosting all PACE compute nodes, has failed. All jobs have been cancelled. To avoid overheating and damage to the systems, all PACE compute nodes have been shut down. The enterprise hall, hosting login and storage nodes, remains cooled. Investigation of the issue is ongoing. 

Impact: All running jobs have been cancelled. Refunds will be issued for any job on Phoenix or Firebird cancelled due to this failure. Login nodes and storage remain available. There is no impact to CEDAR storage. 

Current actions:
The data center team is actively investigating and working to restore cooling
PACE is monitoring system temperatures closely
We are proactively reducing thermal load
Reservations are in place for Phoenix, ICE and Firebird
An emergency shutdown procedure is underway 
What you should do:
Please avoid submitting new jobs until further notice
Save work and ensure checkpointing is enabled where possible
Monitor the PACE Blog and your email for updates before resuming normal workloads
Next update:
We will provide updates as more information becomes available or if service status changes.

4 days ago - at 05/26/2026 05:43PM

Cooling has been restored to the datacenter. The PACE team is powering on all clusters and will complete validation testing before releasing the systems.

4 days ago - at 05/26/2026 07:10PM

Cooling has been restored to the Coda datacenter after a valve repair. ICE has now returned to service after testing. Phoenix & Firebird are being prepared for resumed service.
Please resubmit any jobs that were cancelled due to the outage.

Latest Georgia Tech IT outages

BME (Whitaker) Network - 3 days ago
Canvas Outage - 22 days ago
SSO login error - 25 days ago

The Status Page Aggregator with Early Outage Detection

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 6320 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook