CHTC nodes down last night/this morning (9/17, 9/18)


Date: Tue, 18 Sep 2018 11:18:23 -0500
From: chtc-users@xxxxxxxxxxx
Subject: CHTC nodes down last night/this morning (9/17, 9/18)
Hi everyone,

The Discovery building had a cooling issue last night, affecting CHTC servers located there. All nodes of the HPC cluster, two submit servers (submit-3 and submit-5), and a large number of our HTC execute nodes had to be powered off last night and into this this morning.

As of now, all nodes are back up. Jobs will have been interrupted on both systems, but should still be in the queue and will be automatically re-run.Â

As always, if you notice any issues or have any concerns, please email us at chtc@xxxxxxxxxxx

Cheers,
Your CHTC team
[← Prev in Thread] Current Thread [Next in Thread→]
  • CHTC nodes down last night/this morning (9/17, 9/18), chtc-users <=