I'm having some problems setting up a dedicated condor cluster to run parallel jobs.
I have 2 machines: condorA(to submit and schedule jobs) and condorB (to execute jobs). The config in condorB is:
SUSPEND = Scheduler =!= $(DedicatedScheduler) && ($(SUSPEND))
PREEMPT = Scheduler =!= $(DedicatedScheduler) && ($(PREEMPT))
RANK_FACTOR = 1000000
RANK = (Scheduler =?= $(DedicatedScheduler) * \
$(RANK_FACTOR)) + $(RANK)
START = (Scheduler =?= $(DedicatedScheduler)) || ($(START))
MPI_CONDOR_RSH_PATH = $(LIBEXEC)
CONDOR_SSHD = /usr/sbin/sshd
CONDOR_SSH_KEYGEN = /usr/bin/ssh-keygen
STARTD_EXPRS = $(STARTD_EXPRS), DedicatedScheduler
It stays in Idle state forever. condor_q -analyze reports this:
071.000: Request has not yet been considered by the matchmaker.