Komas Datacenter Downtime: Tuesday March 12, 2013 for critical service to the cooling tower

Posted: March 8, 2013

Event date: March 12, 2013

Duration: Clusters in Komas Datacenter will be down beginning at 7:00 a.m. until about 5:00 p.m.

Systems Affected/Downtime Timelines:

All clusters in the Komas Datacenter including Ember, Updraft and Sanddunearch will be down from 7 a.m. until about 5 p.m.. Scratch space will remain up unless the temperature gets too high during this maintenance, at which time we will also need to down these servers as well.

The folks that service our cooling system (CMMS) in the Komas datacenter have notified us that the cooling system is in critical need for service, and they are very concerned about the current situation. It is believed if we see temperatures above 60 to 65 degrees we may have an outage on the cooling system.

We have set the reservations to drain the queues on the clusters, and we expect to make it to Tuesday morning and a graceful shutdown. However there is a chance that if the cooling fails in the meantime, we'll have an emergency downtime before the scheduled time.

CMMS will take the coolers offline at 8 a.m. Tuesday morning, so we need to shut the clusters down at 7 a.m. They expect to be finished by 2:00 p.m. so we can begin bringing the clusters backup as soon as that is complete. It usually takes 2-3 hours for the clusters to be up and made available to users.

Please let us know if you have any questions by sending email to issues@chpc.utah.edu.