Emergency downtime (July 12, 2014) for subset of kingspeak nodes completed July 14, 2014 (about Noon)

Posted: July 12, 2014

UPDATE: July 14, 2014 about Noon: reservation on the nodes released

Switch and nodes power cycled after the queue was drained for nodes on related switches.


July 12, 2014 2:00 p.m.

A subset of kingspeak nodes has been experiencing problems over the past few days. CHPC is draining all nodes on the switches where these troublesome nodes reside to allow us to troubleshoot and find the root cause of the problem. We will bring nodes online again once we have identified the issue. We are hoping this will be early next week. We will update you as we learn more.

The troublesome nodes are:

  • kp019 - chpc general
  • kp101 - strong
  • kp103 - strong
  • kp105 - avey
  • kp108 - wjohnson
  • kp117 - sdss
  • kp121 - sdss
  • kp124 - sdss
  • kp125 - sdss
  • kp126 - sdss
  • kp127 - sdss
  • kp129 - sdss
  • kp132 - sdss
  • kp141 - molinero
  • kp145 - molinero
  • kp148 - hci
  • kp149 - hci
  • kp150 - hci
  • kp152 - varley
  • kp153 - varley
  • kp155 - gertz