Emergency Cluster DOWNTIME: starting 1/8/2013 approximately 12:30 p.m.
Posted: January 8, 2013
The CHPC clusters went down approximately 12:30 p.m. (Tuesday, 1/8/2013) to protect the equipment due to a cooling problem in the Komas machine room. These clusters include ember, updraft and sanddunearch.
By about 2:30 p.m. the cooling issues were mitigated. Ice had built up and was diverting the water out of the cooling tower.
All clusters were back online and running jobs by 5:15 p.m. Please let us know if you see any issues by sending email to firstname.lastname@example.org.