2009 CHPC Downtimes and History

CHPC Major Downtime: Tuesday, October 13th, 2009 - ALL DAY

Posted: October 7, 2009

Duration: From 8:00 a.m. until evening, Tuesday October 13th, 2009

Systems Affected/Downtime Timelines:

  • All HPC Clusters
  • Intermittent outage of CHPC supported networks
  • All CHPC supported desktops mounting CHPCFS file systems.

Arches Downtime Duration:

During this downtime maintenance will be performed on the cooling system in the Komas datacenter, requiring all clusters housed in the data center to be down most of the day. CHPC will take advantage of this down time to do a number of additional tasks, including work on the network and file servers.

All file systems served from CHPCFS will be unavailable for a good part of the day. This includes HPC home directory space as well as departmental file systems supported by CHPC. We will work to get things online as soon as possible.

Instructions to User:

  • All HPC Clusters
  • Intermittent outage of CHPC supported networks
  • All CHPC supported desktops mounting CHPCFS file systems.

During this downtime maintenance will be performed on the cooling system in the Komas datacenter, requiring all clusters housed in the data center to be down most of the day. CHPC will take advantage of this down time to do a number of additional tasks, including work on the network and file servers.

All file systems served from CHPCFS will be unavailable for a good part of the day. This includes HPC home directory space as well as departmental file systems supported by CHPC. We will work to get things online as soon as possible.


Komas Machine Room Power Outage: HPC Clusters down from about Noon 7/08/09 until about Noon 7/09/09 (except updraft online 11:30 p.m. 7/09/09)

Posted: July 8, 2009

Duration: Began at 11:45 a.m. on July 8th, 2009 - Most clusters back online by Noon July 9th, 2009

Systems Affected/Downtime Timelines: All clusters, file servers and network equipment in the Komas Machine room.

Arches Downtime Duration: All power dropped initially to Komas at approx. 11:45am. on July 8th, 2009. Most of the clusters were back online by Noon on July 8th, 2009, except for the following:

  • updraft - LOST /scratch/serial and /scratch/general, online at 11:30 p.m. on 7/09/2009
  • delicatearch lost a couple of switches, NODES da193-da214 will remain down until replacements arrive

Instructions to User: All clusters, file servers and network equipment in the Komas Machine room.

All power dropped initially to Komas at approx. 11:45am. on July 8th, 2009. Most of the clusters were back online by Noon on July 8th, 2009, except for the following:

  • updraft - LOST /scratch/serial and /scratch/general, online at 11:30 p.m. on 7/09/2009
  • delicatearch lost a couple of switches, NODES da193-da214 will remain down until replacements arrive

CHPC Major Downtime: Tuesday, March 17th, 2009 - ALL DAY

Posted: March 10, 2009

Duration: From 8:00 a.m. until evening

Systems Affected/Downtime Timelines:

  • All HPC Clusters
  • Intermittent outage of CHPC supported networks (INSCC, 7th & 8th floor of WBB and CHPC supported networks in Sutton)
  • All CHPC supported desktops mounting CHPCFS file systems.

Arches Downtime Duration:

During this downtime maintenance will be performed on the cooling system in the Komas datacenter, requiring all clusters housed in the data center to be down most of the day. CHPC will take advantage of this down time to do a number of additional tasks, including work on the network and file servers.

All file systems served from CHPCFS will be unavailable for a good part of the day. This includes HPC home directory space as well as departmental file systems supported by CHPC. We will work to get things online as soon as possible.

Instructions to User:

  • All HPC Clusters
  • Intermittent outage of CHPC supported networks (INSCC, 7th & 8th floor of WBB and CHPC supported networks in Sutton)
  • All CHPC supported desktops mounting CHPCFS file systems.

During this downtime maintenance will be performed on the cooling system in the Komas datacenter, requiring all clusters housed in the data center to be down most of the day. CHPC will take advantage of this down time to do a number of additional tasks, including work on the network and file servers.

All file systems served from CHPCFS will be unavailable for a good part of the day. This includes HPC home directory space as well as departmental file systems supported by CHPC. We will work to get things online as soon as possible.