The University of Utah Home Page The University of Utah

Login

Login

CHPC Documentation

CHPC Downtimes and History

Unscheduled Network Outage of arches clusters and telluride - 4/7/2008

Posted: April 7, 2008

Duration: Unknown

Systems Affected/Downtime Timelines: Connectivity of all of the arches clusters and telluride.

We've had a switch go down which affects connectivity to all of the arches clusters and telluride. Our staff are working on the problem. We'll send an update when we know more.

CHPC DOWNTIME: Starts Tuesday March 18, 2008 at 5PM

Posted: March 12, 2008

SCOPE: HPC(Arches), Network, desktop access to filesystems

DURATION:

Network access to INSCC should be restored at about 10pm on March 18th

Desktop access to fileservers (home directory access) will be restored during the morning of March 19th - a message will be sent to users when the systems are up and ready to be used

Arches will be back up sometime later in the day on March 19th - again a message will be sent when it is ready for use. Reservations have been set so that no jobs that will not finish before 5pm March 18th will be started. Jobs waiting in the queue will be started once the downtime has finished.


CHPC Major Downtime: 3/18/2008

Posted: February 27, 2008

Duration: From 5 p.m. Tuesday 3/18 until 5 p.m. 3/19

Systems Affected/Downtime Timelines: All CHPC networks, arches, fileservers, desktops

Major CHPC Downtime

Core Infrastructure down 5 p.m on 3/18, back up by Midnight including: fileservers etc., SSB and INSCC dependencies.

Arches Cluster down 5 p.m. 3/18 until 5 p.m. 3/19


CHPC Batch Systems Paused: Tuesday February 5, 2008

Posted: February 4, 2008

Duration: Downtime starts at 4pm and will last about an hour.

Systems affected:

All of Arches and any computation cluster under batch control

The clusters impacted by this are: sanddunearch; delicatearch; marchingmen; tunnelarch; landscapearch and telluride.

We will be pausing the moab schedulers on all CHPC computational clusters under batch control, tomorrow, Tuesday February 5th, 2008 at 4:00 p.m., for about an hour. This is to perform system maintenance on one of our administrative systems.

Scope: This means that no new jobs will be started during this period of time. You may still queue jobs up, look at the queues and running jobs will continue to run. The clusters impacted by this are: sanddunearch; delicatearch; marchingmen; tunnelarch; landscapearch and telluride.

Please let us know if you have any questions.


CHPC DOWNTIME: Thursday January 3, 2008

Posted: December 19, 2007

Event date: January 3, 2008

Duration: Downtime starts at 3pm and will last until sometime early morning on January 4, 2008

Systems affected:

All of Arches and CHPC/INSCC Network

After this downtime all users will be using the campus uNID and password for authentication on all HPC systems (and other Linux systems admined by CHPC). Windows users will use the uNID and current password for authentication.

Arches:

All clusters will be down from 3pm to allow for updates to the OS and for the other changes outlined below. The Batch Queues will be drained of all running jobs. Reservations are in place so that jobs will not be started if they will not finish before the start of the downtime. Jobs that are queued but not running will be started after the downtime ends. The one exception to this is if you are being moved to using your unid for authentication during this downtime (see below); in this case any queued jobs you have will need to be deleted. The clusters will down until sometime the following morning.

**MIGRATION TO NEW FILESERVER: Some CHPC Users, those on the CHPC owned home directory filesystems i.e., those with home directories /uufs/inscc.utah.edu/common/home/USERID - will be migrated to a new, larger fileserver during this downtime. If you are one of these users your new home directory path will be /uufs/chpc.utah.edu/common/home/UNID

**CHANGE TO UNID: All CHPC users that are not already using their UNID as the CHPC login will be changed to doing so. If you do not have a UNID you will need to get one BEFORE this downtime. All University of Utah students and employees automatically have a UNID. But if you are a not a part of the University of Utah, you need to fill out a Person of Interest (PoI) form to get assigned a UNID. This form can be found at http://www.hr.utah.edu/forms/lib/u-affiliate-poi-form.pdf.

Network Outage:

All networking in CHPC/INSCC will be down from about 5-7pm