Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Condor-users] jobs are only running at condor_master machine
- Date: Mon, 29 Aug 2005 20:27:32 +0200
- From: Narunjan Kumar <naranjan@xxxxxxxxx>
- Subject: [Condor-users] jobs are only running at condor_master machine
Hello
i have setup a condor pool of two machines.
1st is condor master
2nd is slave node.
when i submit the jobs through condor master it runs but at at
condor master machine.
jobs donot go any other machine even the machine are idle.
when i submit the jobs with the 2nd machine they remains idle in the
Que and never runs even on the same machine .
in either case i have found same error message in
---------- Started Negotiation Cycle ----------
8/29 20:16:45 Phase 1: Obtaining ads from collector ...
8/29 20:16:45 Getting all public ads ...
8/29 20:16:45 Sorting 7 ads ...
8/29 20:16:45 Getting startd private ads ...
8/29 20:16:45 Got ads: 7 public and 2 private
8/29 20:16:45 Public ads include 1 submitter, 2 startd
8/29 20:16:45 Phase 2: Performing accounting ...
8/29 20:16:45 Phase 3: Sorting submitter ads by priority ...
8/29 20:16:45 Phase 4.1: Negotiating with schedds ...
8/29 20:16:45 Negotiating with condor@xxxxxxxxxxxxxxxxxxxxxxx at
<**.26.146.226:1173>
8/29 20:17:15 select returns 0, connect failed
8/29 20:17:15 Will keep trying for 30 seconds...
8/29 20:17:16 Connect failed for 30 seconds; returning FALSE
8/29 20:17:16 Failed to connect to <**.26.146.226:1173>
8/29 20:17:16 Error: Ignoring schedd for this cycle
8/29 20:17:16 ---------- Finished Negotiation Cycle ----------
what is the problem here
why the central manger is unable to connect with other machine nodes
in the pool.
if I see the condor_status then it shows both computer in the list
Name OpSys Arch State Activity LoadAv Mem ActvtyTime
masterpc LINUX INTEL Unclaimed Idle 0.050 750 0+00:30:04
slavepc LINUX INTEL Unclaimed Idle 0.000 750 0+00:25:15
Machines Owner Claimed Unclaimed Matched Preempting
INTEL/LINUX 2 0 0 2 0 0
Total 2 0 0 2 0 0
any help
thanks in advance
Narunjan