Hi,
I am using condor 7.2.0, and configured system for Dynamic slot.
when I submit 2 job at if the status shows Slot1@xxx it match only one
job to Slot1.1@xxx and for second job says
1 match but reject the job for unknown reasons
and negotiator log says following
1/9 19:03:06 Socket to <192.168.111.5:9661> already in cache, reusing
1/9 19:03:06 Over submitter resource limit (0) ... only consider
startd ranks
1/9 19:03:06 Sending SEND_JOB_INFO/eom
1/9 19:03:06 Getting reply from schedd ...
1/9 19:03:06 Got JOB_INFO command; getting classad/eom
1/9 19:03:06 Request 00053.00000:
1/9 19:03:06 Concurrency Limit: ccp is 3.000000
1/9 19:03:06 Rejected 53.0 idealgrid@xxxxxxxxxxxxxxxxx
<192.168.111.5:9661>: no match found
1/9 19:03:06 Sending SEND_JOB_INFO/eom
1/9 19:03:06 Getting reply from schedd ...
1/9 19:03:06 Got NO_MORE_JOBS; done negotiating
1/9 19:03:06 This schedd hit its scheddlimit.
1/9 19:03:06 ---------- Finished Negotiation Cycle ----------
After restarting the negotiator the second job perfectly matches and get
executed in a Slot1.2@xxx machine that time the negotiator log says
1/9 19:12:05 Socket to <192.168.111.5:9661> not in cache, creating one
1/9 19:12:05 SocketCache: Found unused slot 0
1/9 19:12:05 Sending SEND_JOB_INFO/eom
1/9 19:12:05 Getting reply from schedd ...
1/9 19:12:05 Got JOB_INFO command; getting classad/eom
1/9 19:12:05 Request 00053.00000:
1/9 19:12:05 Concurrency Limit: ccp is 3.000000
1/9 19:12:05 Connecting to startd slot1@xxx at
<192.168.111.200:9619>
1/9 19:12:05 File descriptor limits: max 1024, safe 820
1/9 19:12:05 Sending PERMISSION, claim id, startdAd to schedd
1/9 19:12:05 Matched 53.0 idealgrid@xxxxxxxxxxxxxxxxx
<192.168.111.5:9661> preempting none <192.168.111.200:9619> slot1@xxx
Why I Negotiator restart required to match the second Job, Help me in
this..
by
Johnson