[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Condor on NFS

Date: Mon, 24 Jan 2005 22:57:31 -0500
From: Jacob Joseph <jmjoseph@xxxxxxxxxxxxxx>
Subject: [Condor-users] Condor on NFS

Hi. I currently have the Condor home directory shared by NFS to all members of our cluster. This is great for centralized configuration. However, it seems that even a momentary NFS outage (<1-2min) is enough to kill all jobs. They do restart when NFS comes back.

We use NFS over UDP so that clients are able to withstand server reboots with mount options "hard" and "intr" to be sure that jobs simply hang until the server comes back. Rather than waiting, Condor kills the jobs. Is there a configurable timeout I should have set. How can I otherwise make Condor resilient to such NFS outages?

Thanks,
Jacob

References:
- [Condor-users] Sorting submitter ads by RANK instead of priority
  - From: Ian Chesal

Prev by Date: [Condor-users] Sorting submitter ads by RANK instead of priority
Next by Date: RE: [Condor-users] Sorting submitter ads by RANK instead of priority
Previous by thread: [Condor-users] Sorting submitter ads by RANK instead of priority
Next by thread: RE: [Condor-users] Sorting submitter ads by RANK instead of priority
Index(es):
- Date
- Thread