Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[condor-users] Flocking
Hello all!
I'm trying to use Flocking between Solaris and Linux Pools. And I
configured in both direction like below:
In Linux global config file (on "ipc654"):
FLOCK_FROM = isun01.inf.uni-jena.de
FLOCK_TO = isun01.inf.uni-jena.de
In Unix (on "isun01"):
FLOCK_FROM = ipc654.inf-ra.uni-jena.de
FLOCK_TO = ipc654.inf-ra.uni-jena.de
Is there some other variables which i have to configure? And how can i see
if flocking works? I tried in Solaris with command
"condor_status -pool ipc654.inf-ra.uni-jena.de"
and in Linux wiht:
"condor_status -pool isun01.inf.uni-jena.de"
I get following error message:
--------------------------------------------------------------------------
Error: Couldn't contact the condor_collector on ipc654.inf-ra.uni-jena.de.
Extra Info: the condor_collector is a process that runs on the central
manager of your Condor pool and collects the status of all the machines
and ......
--------------------------------------------------------------------------
And for a test i wrote a following submit file. Both binaries are in the
same directory where is this submit file. And i executed it from "isun01".
The job works on Solaris and thats way I uncommented the requirement for
Solaris to see if the flocking work. and it doesn't work. I put the part
of SchedLog behind of submit file.
Please help me to solve this Problem with flocking.
Sorry about my bad english.
Thank you
- degi
################
#
# Condor submit file for simple test job example
#
################
Universe = vanilla
Executable = hello.$$(OpSys).$$(Arch)
Requirements = (Arch == "INTEL" && OpSys == "LINUX")
#(Arch == "SUN4u" && OpSys == "SOLARIS28") ||\
input = /dev/null
output = he.out
error = he.error
log = he.log
Queue
-----------------------------------------------------------------------
SchedLog file
10/2 18:55:09 DaemonCore: Command received via UDP from host
<141.35.14.22:41440>
10/2 18:55:09 DaemonCore: received command 421 (RESCHEDULE), calling
handler (reschedule_negotiator)
10/2 18:55:09 Sent ad to central manager for baatarts@xxxxxxxxxxxxxxx
10/2 18:55:09 Called reschedule_negotiator()
10/2 18:55:09 Activity on stashed negotiator socket
10/2 18:55:09 Negotiating for owner: baatarts@xxxxxxxxxxxxxxx
10/2 18:55:09 Checking consistency running and runnable jobs
10/2 18:55:09 Tables are consistent
10/2 18:55:09 Out of servers - 0 jobs matched, 1 jobs idle, 1 jobs
rejected
10/2 19:00:09 Activity on stashed negotiator socket
10/2 19:00:09 Negotiating for owner: baatarts@xxxxxxxxxxxxxxx
10/2 19:00:09 Checking consistency running and runnable jobs
10/2 19:00:09 Tables are consistent
10/2 19:00:09 Out of servers - 0 jobs matched, 1 jobs idle, 1 jobs
rejected
10/2 19:00:09 Sent ad to central manager for baatarts@xxxxxxxxxxxxxxx
10/2 19:04:08 DaemonCore: Command received via UDP from host
<141.35.14.22:43792>
10/2 19:04:08 DaemonCore: received command 421 (RESCHEDULE), calling
handler (reschedule_negotiator)
10/2 19:04:08 Sent ad to central manager for baatarts@xxxxxxxxxxxxxxx
10/2 19:04:08 Called reschedule_negotiator()
10/2 19:04:08 Activity on stashed negotiator socket
10/2 19:04:08 Negotiating for owner: baatarts@xxxxxxxxxxxxxxx
10/2 19:04:08 Checking consistency running and runnable jobs
10/2 19:04:08 Tables are consistent
10/2 19:04:08 Out of servers - 1 jobs matched, 1 jobs idle, 1 jobs
rejected
Condor Support Information:
http://www.cs.wisc.edu/condor/condor-support/
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>