We believe we’ve restored all CELS general compute services. For other issues (LCRC, ALCF, etc.) stay tuned to the respective user notification lists for updates.
On Dec 14, 2018, at 6:17 AM, Stacey, Craig <stace> wrote:
Between 2 and 3 am, the lab’s central chilled water plant had a failure. The temperatures in the 240 data center spiked highly and a number of systems were affected by the heat.
Cooling was restored by 4:30, but we’ve been trying to get the affected services online. You may notice some problems reaching some servers or services, though most are back to normal at this point.
I’ll send an all-clear when we believe things are back fully.