Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Condor-users] Problems with setting up a Windows Condor Pool
- Date: Tue, 07 Apr 2009 13:40:46 +0200
- From: Max Raith <max_raith123@xxxxxx>
- Subject: [Condor-users] Problems with setting up a Windows Condor Pool
Hey Guys,
well, I am still new to condor. The whole System is quite complex, but I am willing to learn how to set it up and use it. Knowing that this mailing list is very helpful I hope you could give me some advice.
Current Status:
I was able to set up a Personal Condor to test the basic condor functions. Everything worked fine. As a second step,
I wanted to build up a condor-pool consisting of 2 Pc(Win XP). They are connected and I verifed with the ping command that they can communicate.
So by following the installation instructions, I started to install condor on the PC which should be the central Manager.
I have chosen a name for the newly created pool. When it comes to the permisson settings( I used the cmd-box and ipconfig /all to get the IP adresses) i tipped in:
It looked like :
... Read : *
...Write : <ip_of_PC1>, <ip_of_PC2
...Administration : <ip_of_Pc1>.
I repeated the installation on Pc2 with the same settings.
After all, i checked if the demons run on both pcs.
On Pc1 ( the central manager) the Demons : collecor,master,procd, negotiator, and schedd are running
On Pc2 the Demons : master,procd,schedd, stard are running
When I tried the condor_status command, the cmd stayd blank.
So how should I set up and configure Condor correctly, if i want this setting :One is the central manager which only submit, and one is the working only pc?
It would be very kind if you gave me a step by step introduction.
Thank you so much!
Max Raith
Masterlog :
4/7 12:00:24 UnsetEnv(NET_REMAP_ENABLE): SetEnvironmentVariable failed, errno=203
4/7 12:00:24 ******************************************************
4/7 12:00:24 ** Condor (CONDOR_MASTER) STARTING UP
4/7 12:00:24 ** C:\condor\bin\condor_master.exe
4/7 12:00:24 ** SubsystemInfo: name=MASTER type=MASTER(2) class=DAEMON(1)
4/7 12:00:24 ** Configuration: subsystem:MASTER local:<NONE> class:DAEMON
4/7 12:00:24 ** $CondorVersion: 7.2.1 Feb 19 2009 BuildID: 133382 $
4/7 12:00:24 ** $CondorPlatform: INTEL-WINNT50 $
4/7 12:00:24 ** PID = 2512
4/7 12:00:24 ** Log last touched time unavailable (No such file or directory)
4/7 12:00:24 ******************************************************
4/7 12:00:24 Using config source: C:\condor\condor_config
4/7 12:00:24 Using local config sources:
4/7 12:00:24 C:\condor/condor_config.local
4/7 12:00:24 DaemonCore: Command Socket at <194.95.205.146:1137>
4/7 12:00:24 Started DaemonCore process "C:\condor/bin/condor_collector.exe", pid and pgroup = 2492
4/7 12:00:27 Started DaemonCore process "C:\condor/bin/condor_negotiator.exe", pid and pgroup = 1488
4/7 12:00:27 Started DaemonCore process "C:\condor/bin/condor_schedd.exe", pid and pgroup = 2080
4/7 12:28:03 Sent signal 15 to COLLECTOR (pid 2492)
4/7 12:28:03 Sent signal 15 to NEGOTIATOR (pid 1488)
4/7 12:28:03 Sent signal 15 to SCHEDD (pid 2080)
4/7 12:28:03 The COLLECTOR (pid 2492) exited with status 0
4/7 12:28:03 The NEGOTIATOR (pid 1488) exited with status 0
4/7 12:28:03 The SCHEDD (pid 2080) died due to exception ACCESS_VIOLATION
4/7 12:28:03 All daemons are gone.
4/7 12:30:33 attempt to connect to <194.95.205.146:9618> failed: connect errno = 10061 connection refused.
4/7 12:30:33 ERROR: SECMAN:2004:Failed to create security session to <194.95.205.146:9618> with TCP|SECMAN:2003:TCP connection to <194.95.205.146:9618> failed
4/7 12:30:33 Failed to start non-blocking update to <194.95.205.146:9618>.
4/7 12:31:19 Started DaemonCore process "C:\condor/bin/condor_collector.exe", pid and pgroup = 2380
4/7 12:31:22 Started DaemonCore process "C:\condor/bin/condor_negotiator.exe", pid and pgroup = 692
4/7 12:31:22 Started DaemonCore process "C:\condor/bin/condor_schedd.exe", pid and pgroup = 3256