CHTC nodes down this past weekend (6/30-7/1)


Date: Mon, 02 Jul 2018 09:42:14 -0500
From: chtc-users@xxxxxxxxxxx
Subject: CHTC nodes down this past weekend (6/30-7/1)
Hi everyone,

The Discovery building had a cooling issue over the weekend, affecting some CHTC servers located there, and likely prompted by the extremely hot weather and related demand on campus cooling systems. All nodes of the HPC cluster, two submit servers (submit-3 and submit-5), and a large number of our HTC execute nodes had to be powered off at various points over the weekend.Â

As of now, all nodes are back up. Jobs will have been interrupted on both systems, but should still be in the queue and will be automatically re-run.Â

As always, if you notice any issues or have any concerns, please email us at chtc@xxxxxxxxxxx

Cheers,
Your CHTC team
[← Prev in Thread] Current Thread [Next in Thread→]
  • CHTC nodes down this past weekend (6/30-7/1), chtc-users <=