Use Cases
Software Products MSPs Schools Development & Marketing DevOps Agencies Help Desk
 
Internet Status Blog Pricing Log In Try IsDown for free now

Outage in Farm HPC cluster

nas-6-1 still not functioning

Resolved Major
January 06, 2026 - Started about 1 month ago - Lasted 2 days
Official incident page

Incident Report

nas-6-1 is still losing contact with its disks, which causes the ZFS file-system to suspend operation. Admins are investigating. This directly impacts the following PI group directories: darcantugrp3 dubcovskygrpBackup ejf4grp ejf4grp epontikegrp epontikegrp group group jmearlesgrp lmillergrp lmillergrp magertongrp magertongrp zhougrp zimanyigrp zimanyigrp This directly impacts the following users: afefm agoga amdiggs amnjoshi andresmr apjha caih chaiti cmhansen ddyoyo98 dguevara dgunruh ejf4 epontike eranario hkaman hzchris jbkeeler jmearles jyoun jyyi kertcher lilyb lion397 makovtun mevishal mhstan mmomayye momtanu mstigler nasiegel nbhagat nhaigh nikocj nparekh nwisuthi petersl pvraja rkdesai7 sonishiy spacemat vbgupta yuqing18 zfei zimanyi zmcrawfo ztzhao

Need to monitor Farm HPC cluster outages?

  • Monitor all your external dependencies in one place
  • Get instant alerts when outages are detected
  • Be the first to know if service is down
  • Show real-time status on private or public status page
  • Keep your team informed
Latest Updates ( sorted recent to last )
RESOLVED about 1 month ago - at 01/08/2026 07:13PM

nas-6-1 has been successfully serving data for 24 hours now, so it looks like the combination of updates resolved the issue it was experiencing.

MONITORING about 1 month ago - at 01/08/2026 12:41AM

Additional fixes have been put into place, and nas-6-1 currently working correctly again. Admin are monitoring the situation.

INVESTIGATING about 1 month ago - at 01/06/2026 08:25PM

nas-6-1 is still losing contact with its disks, which causes the ZFS file-system to suspend operation. Admins are investigating.

This directly impacts the following PI group directories: darcantugrp3 dubcovskygrpBackup ejf4grp ejf4grp epontikegrp epontikegrp group group jmearlesgrp lmillergrp lmillergrp magertongrp magertongrp zhougrp zimanyigrp zimanyigrp

This directly impacts the following users: afefm agoga amdiggs amnjoshi andresmr apjha caih chaiti cmhansen ddyoyo98 dguevara dgunruh ejf4 epontike eranario hkaman hzchris jbkeeler jmearles jyoun jyyi kertcher lilyb lion397 makovtun mevishal mhstan mmomayye momtanu mstigler nasiegel nbhagat nhaigh nikocj nparekh nwisuthi petersl pvraja rkdesai7 sonishiy spacemat vbgupta yuqing18 zfei zimanyi zmcrawfo ztzhao

Latest Farm HPC cluster outages

Slurm not available - 25 days ago
Quobyte Unavaiable - 7 months ago
SSH availability - 7 months ago

The Status Page Aggregator with Early Outage Detection

With IsDown, you can monitor all your critical services' official status pages from one centralized dashboard and receive instant alerts the moment an outage is detected. Say goodbye to constantly checking multiple sites for updates and stay ahead of outages with IsDown.

Start free trial

No credit card required · Cancel anytime · 5850 services available

Integrations with Slack Microsoft Teams Google Chat Datadog PagerDuty Zapier Discord Webhook