[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] quill on farm with mutiple schedds



Dear Condor team


I set up quill on a small test farm consisting of 4 nodes. Jobs can be submitted from all 4 of the nodes so schedd is running on all of them. In addition each
of them is running the quill daemon with a unique quill name for each node.
One node also serves as the master and one also serves the postgres data base.
When issuing the condor_q command there seems to be communication to all 
quills (see below)
The problem is only that the response doesn't represent the status of 
the farm.
I know that 8 jobs are running just fine.

So the question now is
- what am I doing wrong?
- how to diagnose it
- how to fix it

thanks in advance


hans


wenzel@hotdog47 tcondor]$ /opt/condor/bin/condor_status

Name OpSys Arch State Activity LoadAv Mem ActvtyTime
vm1@hotdog47. LINUX       INTEL  Claimed    Busy       0.000   334  
0+00:00:02
vm2@hotdog47. LINUX       INTEL  Claimed    Busy       0.000   334  
0+00:00:02
vm3@hotdog47. LINUX       INTEL  Unclaimed  Idle       0.000   334  
0+00:25:06
vm1@hotdog48. LINUX       INTEL  Claimed    Busy       0.000   334  
0+00:03:13
vm2@hotdog48. LINUX       INTEL  Claimed    Busy       0.000   334  
0+00:02:31
vm3@hotdog48. LINUX       INTEL  Unclaimed  Idle       0.010   334  
0+00:25:06
vm1@hotdog49. LINUX       INTEL  Claimed    Busy       0.000   334  
0+00:00:02
vm2@hotdog49. LINUX       INTEL  Claimed    Busy       0.000   334  
0+00:00:03
vm3@hotdog49. LINUX       INTEL  Unclaimed  Idle       0.010   334  
0+00:25:06
vm1@hotdog54. LINUX       INTEL  Claimed    Busy       0.000   334  
0+00:00:43
vm2@hotdog54. LINUX       INTEL  Claimed    Busy       0.000   334  
0+00:00:43
vm3@hotdog54. LINUX       INTEL  Unclaimed  Idle       0.000   334  
0+00:25:06
                   Total Owner Claimed Unclaimed Matched Preempting 
Backfill
       INTEL/LINUX    12     0       8         4       0          
0        0
             Total    12     0       8         4       0          
0        0

---------------------------------------------------------------------------
[wenzel@hotdog47 tcondor]$ /opt/condor/bin/condor_q -submitter wenzel


-- Quill: hotdog49_quilld@xxxxxxxxxxxxxxxxx : <131.225.206.133:5432> : quill
ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD

0 jobs; 0 idle, 0 running, 0 held


-- Quill: hotdog48_quilld@xxxxxxxxxxxxxxxxx : <131.225.206.133:5432> : quill
ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD

0 jobs; 0 idle, 0 running, 0 held


-- Quill: hotdog54_quilld@xxxxxxxxxxxxxxxxx : <131.225.206.133:5432> : quill
ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD

0 jobs; 0 idle, 0 running, 0 held


-- Quill: hotdog47_quilld@xxxxxxxxxxxxxxxxx : <131.225.206.133:5432> : quill
ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD

0 jobs; 0 idle, 0 running, 0 held
--------------------------------------------------------------------------------