[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Condor-users] Globus to Condor cluster submission
- Date: Wed, 28 Oct 2009 01:19:10 +0530
- From: satyanarayan rao <satyanarayan.iiitm@xxxxxxxxx>
- Subject: [Condor-users] Globus to Condor cluster submission
Hello All,
I have configured a condor pool.
[grid-node@grid-server ~]$ condor_status
Name OpSys Arch State Activity LoadAv Mem ActvtyTime
slot1@an-lab-node1 LINUX INTEL Unclaimed Idle 0.000 497 0+00:14:18
slot2@an-lab-node1 LINUX INTEL Unclaimed Idle 0.000 497 0+00:14:18
slot1@an-lab-node3 LINUX INTEL Unclaimed Idle 0.000 497 0+00:14:24
slot2@an-lab-node3 LINUX INTEL Unclaimed Idle 0.160 497 0+00:14:25
slot1@grid-server. LINUX INTEL Unclaimed Idle 0.380 1006 0+00:25:04
slot2@grid-server. LINUX INTEL Unclaimed Idle 0.000 1006 0+00:25:05
Total Owner Claimed Unclaimed Matched Preempting Backfill
INTEL/LINUX 6 0 0 6 0 0 0
Total 6 0 0 6 0 0 0
[grid-node@grid-server ~]$
I have a submit file like
[grid-node@grid-server ~]$ cat memory-grid.submit
####################
#
# Example 2: Show off some fancy features including
# use of pre-defined macros and logging.
#
####################
Executable = /home/grid-node/hello
Universe = grid
Requirements = Memory <= 500
Rank = Memory <= 500
Image_Size = 28 Meg
#KeyboardIdle = 1
grid_resource = gt4 https://grid-server.iiitm.ac.in:8443/wsrf/services/ManagedJobFactoryService Condor
Error = err.$(Process)
#Input = in.$(Process)
Output = out.$(Process)
Log = foo.log
Queue 10
[grid-node@grid-server ~]$ condor_q
-- Submitter: grid-server.iiitm.ac.in : <192.168.33.188:34401> : grid-server.iiitm.ac.in
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
209.0 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.1 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.2 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.3 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.4 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.5 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.6 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.7 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.8 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.9 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
210.0 grid-node 10/28 01:10 0+00:00:33 R 0 0.0 gridftp_wrapper.sh
11 jobs; 10 idle, 1 running, 0 held
[grid-node@grid-server ~]$
[grid-node@grid-server ~]$ condor_q
-- Submitter: grid-server.iiitm.ac.in : <192.168.33.188:34401> : grid-server.iiitm.ac.in
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
209.0 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.1 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.2 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.3 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.4 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.5 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.6 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.7 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.8 grid-node 10/28 01:10 0+00:00:00 I 0 29.3 hello
209.9 grid-node 10/28 01:10 0+00:00:04 C 0 29.3 hello
210.0 grid-node 10/28 01:10 0+00:02:14 R 0 0.0 gridftp_wrapper.sh
212.0 grid-node 10/28 01:12 0+00:00:00 I 0 1.7 hello
213.0 grid-node 10/28 01:12 0+00:00:00 I 0 1.7 hello
214.0 grid-node 10/28 01:12 0+00:00:00 I 0 1.7 hello
215.0 grid-node 10/28 01:12 0+00:00:00 I 0 1.7 hello
216.0 grid-node 10/28 01:12 0+00:00:00 I 0 1.7 hello
217.0 grid-node 10/28 01:12 0+00:00:00 I 0 1.7 hello
218.0 grid-node 10/28 01:12 0+00:00:00 I 0 1.7 hello
219.0 grid-node 10/28 01:12 0+00:00:00 I 0 1.7 hello
220.0 grid-node 10/28 01:12 0+00:00:00 I 0 1.7 hello
19 jobs; 18 idle, 1 running, 0 held
You have new mail in /var/spool/mail/grid-node
[grid-node@grid-server ~]$
When i run this file, the whole job is being execute on the slots1@grid-server and slot2@grid-server, but it should execute on other two nodes of the condor pool...
When i tun the above file without grid universe it runs on other slots which are satisfying the parameters, what could be the problem
********************************************************
Globus Logs
2009-10-28T01:12:09.552+05:30 INFO handler.SubmitStateHandler [pool-1-thread-2,process:172] Job 92bcb220-c330-11de-bb54-b421c00b8b13 submitted with local job ID '211.000.000'
2009-10-28T01:12:11.804+05:30 INFO handler.SubmitStateHandler [pool-1-thread-10,process:172] Job 92a42110-c330-11de-bb53-b421c00b8b13 submitted with local job ID '213.000.000'
2009-10-28T01:12:11.825+05:30 INFO handler.SubmitStateHandler [pool-1-thread-6,process:172] Job 929c31d0-c330-11de-bb53-b421c00b8b13 submitted with local job ID '212.000.000'
2009-10-28T01:12:11.848+05:30 INFO handler.SubmitStateHandler [pool-1-thread-8,process:172] Job 9293cd60-c330-11de-bb53-b421c00b8b13 submitted with local job ID '214.000.000'
2009-10-28T01:12:12.210+05:30 INFO handler.SubmitStateHandler [pool-1-thread-3,process:172] Job 936acc70-c330-11de-bb54-b421c00b8b13 submitted with local job ID '215.000.000'
2009-10-28T01:12:13.163+05:30 INFO handler.SubmitStateHandler [pool-1-thread-5,process:172] Job 926606a0-c330-11de-bb53-b421c00b8b13 submitted with local job ID '216.000.000'
2009-10-28T01:12:13.979+05:30 INFO handler.SubmitStateHandler [pool-1-thread-4,process:172] Job 93d2dd10-c330-11de-bb54-b421c00b8b13 submitted with local job ID '217.000.000'
2009-10-28T01:12:13.983+05:30 INFO handler.SubmitStateHandler [pool-1-thread-10,process:172] Job 93deeb00-c330-11de-bb54-b421c00b8b13 submitted with local job ID '218.000.000'
2009-10-28T01:12:13.997+05:30 INFO handler.SubmitStateHandler [pool-1-thread-6,process:172] Job 9419f830-c330-11de-bb54-b421c00b8b13 submitted with local job ID '219.000.000'
2009-10-28T01:12:14.045+05:30 INFO handler.SubmitStateHandler [pool-1-thread-2,process:172] Job 9412cc40-c330-11de-bb54-b421c00b8b13 submitted with local job ID '220.000.000'
2009-10-28T01:12:15.613+05:30 INFO handler.DoneStateHandler [pool-1-thread-7,process:66] Job 92bcb220-c330-11de-bb54-b421c00b8b13 finished successfully
2009-10-28T01:12:34.790+05:30 INFO handler.DoneStateHandler [pool-1-thread-4,process:66] Job 929c31d0-c330-11de-bb53-b421c00b8b13 finished successfully
2009-10-28T01:12:35.955+05:30 INFO handler.DoneStateHandler [pool-1-thread-7,process:66] Job 92a42110-c330-11de-bb53-b421c00b8b13 finished successfully
2009-10-28T01:12:37.645+05:30 INFO handler.DoneStateHandler [pool-1-thread-9,process:66] Job 936acc70-c330-11de-bb54-b421c00b8b13 finished successfully
2009-10-28T01:12:37.995+05:30 INFO handler.DoneStateHandler [pool-1-thread-1,process:66] Job 93d2dd10-c330-11de-bb54-b421c00b8b13 finished successfully
2009-10-28T01:12:38.474+05:30 INFO handler.DoneStateHandler [pool-1-thread-5,process:66] Job 9419f830-c330-11de-bb54-b421c00b8b13 finished successfully
2009-10-28T01:12:39.019+05:30 INFO handler.DoneStateHandler [pool-1-thread-6,process:66] Job 926606a0-c330-11de-bb53-b421c00b8b13 finished successfully
2009-10-28T01:12:39.177+05:30 INFO handler.DoneStateHandler [pool-1-thread-3,process:66] Job 93deeb00-c330-11de-bb54-b421c00b8b13 finished successfully
2009-10-28T01:12:39.388+05:30 INFO handler.DoneStateHandler [pool-1-thread-10,process:66] Job 9293cd60-c330-11de-bb53-b421c00b8b13 finished successfully
2009-10-28T01:12:39.502+05:30 INFO handler.DoneStateHandler [pool-1-thread-5,process:66] Job 9412cc40-c330-11de-bb54-b421c00b8b13 finished successfully
Thank You
Regards
Satyanarayan Rao
--
Satyanarayan Rao
ABV-IIITM Gwalior
http://www.iiitm.ac.in
Ph: +91-9179346657