Hi Jordan,
ok just to get this out of the way – I am a phd student not an IT officer
hence unfortunately I have some limitations as to the knowledge of the extended
network set up of Condor. This is a new system for all of us around here that we
are trying to sort out.
The way I am submitting my jobs is via a manual submission process which
involves creating a folder structure in a particular network location which
includes my executable file which I want to run, some input files to the
executable and a submitting script that condor uses to distribute the jobs in
the system. The submission is done via my machine using cmd and the
“condor_submit” command which pinpoints to the submit.sub file in the network
location. The output files of each node are downloaded on the network folder I
am starting the submit.sub job from. The log and spool files are by default
stored on my local drive – hence the limitation.
Regarding the UiDomain settings:
##--------------------------------------------------------------------
## Network domain parameters:
##--------------------------------------------------------------------
## Internet domain of machines sharing a common UID space. If
your
## machines don't share a common UID space, set it to
## UID_DOMAIN = $(FULL_HOSTNAME)
## to specify that each machine has its own UID space.
UID_DOMAIN=
## Internet domain of machines sharing a common file system.
## If your machines don't use a network file system, set it to
## FILESYSTEM_DOMAIN = $(FULL_HOSTNAME)
## to specify that each machine has its own file system.
#FILESYSTEM_DOMAIN = $(FULL_HOSTNAME)
## This macro is used to specify a short description of your pool.
## It should be about 20 characters long. For example, the name of
## the UW-Madison Computer Science Condor Pool is ``UW-Madison
CS''.
COLLECTOR_NAME = My Pool -
$(CONDOR_HOST)
######################################################################
I think those settings are set up correctly as I
am still able to run a few hundreds of jobs without any problem.
Antonis
Antonis
From: Jordan Williamson
Sent: Thursday, March 28, 2013 6:24 PM
Subject: Re: [HTCondor-users] move spool and log folders on
highspeednetwork drive Antonis, I haven't personally used the NFS functionality, so I'm not sure how much help I can be, but I'll try. If you haven't yet read the file transfer documentation, you can do so here: http://research.cs.wisc.edu/htcondor/manual/v7.6/2_5Submitting_Job.html#SECTION00353000000000000000 What are your FileSystemDomain and UidDomain settings set to? From the docs: So, if a pool does have access to a shared file system, the pool administrator must correctly configure Condor such that all the machines mounting the same files have the same FileSystemDomain configuration. Similarly, all machines that share common user information must be configured to have the same UidDomain configuration. On Thu, Mar 28, 2013 at 2:13 PM, Antonis Sergis <sergis_antonis@xxxxxxxxxxx> wrote:
_______________________________________________ HTCondor-users mailing list To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a subject: Unsubscribe You can also unsubscribe by visiting https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users The archives can be found at: https://lists.cs.wisc.edu/archive/htcondor-users/ |