I ran one job interactively which is running over one node :
****Finishes in 40 min
15126.0 bawa 4/23 04:56 0+03:50:16 R 0 317.4 parallel_90.sh
Statistics for comparison:-
Interactively:-
==============
real 63m57.321s
user 42m17.957s
sys 1m24.413s
Statistics for
Condor Node:==========
condor_q -analyze 15126.0
-- Submitter:
t3nfs.atlas.csufresno.edu : <
192.168.100.2:9905> :
t3nfs.atlas.csufresno.edu
---
15126.000: Request is being serviced
The jobs are running since 1 day, If I see Real CPUTime of this job, its
[bawa@t3nfs Wstar_sin0_NewCalib17]$ condor_q 15126.0 -cputime
-- Submitter:
t3nfs.atlas.csufresno.edu : <
192.168.100.2:9905> :
t3nfs.atlas.csufresno.edu
ID OWNER SUBMITTED CPU_TIME ST PRI SIZE CMD
15126.0 bawa 4/23 04:56 0+00:06:47 R 0 317.4 parallel_90.sh
If I understand correctly, CPUtime(CPU time is time of running CPU) is just 6min 47 sec Out of RunTime which is 3 Hr 50 min. I suspect there is something serious in data transfer going on.(i/o)
Is there any suggestion how to debug that.
Thanks
-Harinder