Ongoing outage of HTC Gluster file system


Date: Tue, 25 Feb 2020 17:09:06 -0600
From: chtc-users@xxxxxxxxxxx
Subject: Ongoing outage of HTC Gluster file system
Hi CHTC Users,

This message is for users of the high throughput (HTC) system who:
- use our Gluster file system to stage their data OR
- use MPI or licensed software modules on our HTC system

Portions of the Gluster file system have been failing intermittently since the weekend. Symptoms include specific files disappearing or error messages like "File not found" or "Transport endpoint not connected."

We have no indication that any data has been lost at this point, but jobs that depend on reading from, writing to, or using modules in Gluster will likely see failures due to the ongoing outage.

Jobs on the HTC system that do not depend on Gluster for either data staging or module use should not be impacted by this outage.

We will send a follow-up email to this address once we feel that we have restored Gluster to a more stable state where data access and module use will work consistently.

Please direct any additional questions to chtc@xxxxxxxxxxx.

Cheers,
Your CHTC Team

[← Prev in Thread] Current Thread [Next in Thread→]
  • Ongoing outage of HTC Gluster file system, chtc-users <=