[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] not starting jobs in condor ver 8.3.6




On Jul 9, 2015, at 3:27 PM, Todd L Miller <tlmiller@xxxxxxxxxxx> wrote:

 Try adding ALLOW_NEGOTIATOR and ALLOW_NEGOTIATOR_SCHEDD to your config, setting each to 10.200.60.*, or the specific machine hosting the negotiator (10.200.60.19).

Thanks Todd,
I did it and … still jobs do not start. 

This is new content of the relevant part of  /etc/condor/condor_config.local
###############################################################################
# Security settings
###############################################################################
# Allow local host and the central manager to manage the node
ALLOW_ADMINISTRATOR = $(FULL_HOSTNAME), $(CONDOR_HOST)
# master needs this two particular versions
ALLOW_READ = *.lns.mit.edu,10.200.60.*
ALLOW_WRITE = *.lns.mit.edu,10.200.60.*
# Fix to version 8.3.6 suggested by ToddL
ALLOW_NEGOTIATOR = 10.200.60.*
ALLOW_NEGOTIATOR_SCHEDD = 10.200.60.*
###############################################################################

At 15:38  I made this change on IP=121 which is condor master and has 6 job-slots open.

I executed service condor restart
next submitted 12 jobs - all are idle, despite 14 jobs slots are open.

At 15:42 I made similar change on IP=122, which is just condor worker node. I did not expected it changes anything since the change on IP=121 did not helped.

To be absolutely sure I rebooted few minutes later both VMs and verified again, condor jobs do not start, despite 6+8 jobs slots are open.
 
Perhaps  you can find time to look again in to the log files, which are copied  from both VMs :
$ scp -rp root@xxxxxxxxxxxxxxxxxxxx:/var/log/condor condor-122
$ scp -rp root@xxxxxxxxxxxxxxxxxxxx:/var/log/condor condor-121 
and posted them here:
https://www.dropbox.com/sh/8z7pxbdc5j4yh43/AADh4J3WjukpJKsX55lsTMuva?dl=0

Thanks for looking in to it
Jan