Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Condor-users] two central managers on the same network
- Date: Fri, 12 Feb 2010 17:11:54 +0100
- From: Tom Ghekiere <tom.ghekiere@xxxxxxxx>
- Subject: [Condor-users] two central managers on the same network
Hi all,
We currently have a Condor cluster (version 7.0.0) with a master node
(hostname: ladon; IP: 192.168.0.100 ) that we would like to replace by a
newer one (hostname: mo99; IP: 192.168.0.101). Both servers are also
connected to the internet. All worker nodes are only connected to the
LAN (192.168.0.xxx). All systems have Linux OS.
I installed condor on mo99 and configured it (changing all the 'ladon'
to 'mo99' and 192.168.0.100 to 192.168.0.101 and also changing
COLLECTOR_NAME).
I removed a worker node (node14) from the ladon pool and configured it
to join mo99.
I started condor on mo99 and node14.
This is what condor_status on node14 returns:
Name OpSys Arch State Activity LoadAv Mem
ActvtyTime
slot1@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:04
slot2@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:05
slot3@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:06
slot4@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:07
slot5@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:08
slot6@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:09
slot7@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:10
slot8@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:03
Total Owner Claimed Unclaimed Matched
Preempting Backfill
X86_64/LINUX 8 0 0 8 0
0 0
Total 8 0 0 8 0
0 0
On mo99, however, condor_status returns:
Name OpSys Arch State Activity LoadAv Mem
ActvtyTime
slot1@xxxxxxxxxxxx LINUX X86_64 Owner Idle 0.060 2011
9+23:15:55
slot2@xxxxxxxxxxxx LINUX X86_64 Owner Idle 0.000 2011
9+23:15:56
slot3@xxxxxxxxxxxx LINUX X86_64 Owner Idle 0.000 2011
9+23:15:57
slot4@xxxxxxxxxxxx LINUX X86_64 Owner Idle 0.000 2011
9+23:15:58
slot5@xxxxxxxxxxxx LINUX X86_64 Owner Idle 0.000 2011
9+23:15:59
slot6@xxxxxxxxxxxx LINUX X86_64 Owner Idle 0.000 2011
9+23:16:00
slot7@xxxxxxxxxxxx LINUX X86_64 Owner Idle 0.000 2011
9+23:16:01
slot8@xxxxxxxxxxxx LINUX X86_64 Owner Idle 0.000 2011
9+23:15:54
...
...
...
slot1@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:14
slot2@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:19
slot3@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+01:35:18
slot4@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+02:15:54
slot5@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+00:49:52
slot6@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+00:51:16
slot7@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+00:48:06
slot8@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 2011
0+00:52:06
Total Owner Claimed Unclaimed Matched
Preempting Backfill
X86_64/LINUX 160 8 62 90 0
0 0
Total 160 8 62 90 0
0 0
This is exactly the same as what condor_status returns on ladon (master
that needs to be phased out). To me, it seems mo99 is connecting with
the wrong pool.
Does anyone have an idea of what I am doing wrong and perhaps how to
solve this?
Thanks,
Tom
begin:vcard
fn:Tom Ghekiere
n:Ghekiere;Tom
org:Ghent University-IBBT-IBiTech;Department of Electronics and Information Systems - MEDISIP
adr:;;De Pintelaan 185 (block B);Ghent;;B-9000 ;Belgium
email;internet:tom.ghekiere@xxxxxxxx
tel;work:+32 9 332 43 22
x-mozilla-html:TRUE
version:2.1
end:vcard