[Condor-users] Condor problems


Date: Thu, 10 Feb 2005 14:10:42 -0500
From: Darryl Cook <dlc@xxxxxxxxxxxxxxx>
Subject: [Condor-users] Condor problems
Ok, I have re-installed Condor on machines that I have and am *really* close to getting this thing to work now....but still having a couple of problems.

I am using two machines:   grid0  and node1.
I installed the central manager on node1.

I submit a job on grid0 to node1 with the following:

executable=/bin/ls
transfer_executable = false
globusscheduler = node1.cs.appstate.edu/jobmanager-fork
universe=globus
output=test1.out
log=test1.log
error=test1.error
requirements=true
queue

The job runs but gets rejected for some unknown reason. If I do a condor_q -analyze I get the following:1 reject your job because of their own requirements.

If I look at the globus-gatekeeper.log on node1 I see the following:
TIME: Thu Feb 10 14:06:05 2005
PID: 2911 -- Notice: 6: globus-gatekeeper pid=2911 starting at Thu Feb 10 14:06:05 2005


TIME: Thu Feb 10 14:06:05 2005
PID: 2911 -- Notice: 6: Got connection 192.10.1.100 at Thu Feb 10 14:06:05 2005


TIME: Thu Feb 10 14:06:05 2005
PID: 2912 -- Notice: 6: globus-gatekeeper pid=2912 starting at Thu Feb 10 14:06:05 2005


TIME: Thu Feb 10 14:06:05 2005
PID: 2912 -- Notice: 6: Got connection 192.10.1.100 at Thu Feb 10 14:06:05 2005


TIME: Thu Feb 10 14:06:05 2005
PID: 2912 -- Notice: 5: Authenticated globus user: /O=Grid/OU=GlobusTest/OU=simpleCA-grid0.cs.appstate.edu/OU=cs.appstate.edu/CN=Darryl Cook
TIME: Thu Feb 10 14:06:05 2005
PID: 2912 -- Notice: 5: Requested service: jobmanager [PING ONLY]
TIME: Thu Feb 10 14:06:05 2005
PID: 2912 -- Notice: 5: Authorized as local user: dlc
TIME: Thu Feb 10 14:06:05 2005
PID: 2912 -- Notice: 5: Authorized as local uid: 500
TIME: Thu Feb 10 14:06:05 2005
PID: 2912 -- Notice: 5: and local gid: 100
Failure: ping successful
TIME: Thu Feb 10 14:06:05 2005
PID: 2912 -- Failure: ping successful
TIME: Thu Feb 10 14:06:05 2005
PID: 2911 -- Notice: 5: Authenticated globus user: /O=Grid/OU=GlobusTest/OU=simpleCA-grid0.cs.appstate.edu/OU=cs.appstate.edu/CN=Darryl Cook
TIME: Thu Feb 10 14:06:05 2005
PID: 2911 -- Notice: 0: GRID_SECURITY_HTTP_BODY_FD=6
TIME: Thu Feb 10 14:06:05 2005
PID: 2911 -- Notice: 5: Requested service: jobmanager-fork
TIME: Thu Feb 10 14:06:05 2005
PID: 2911 -- Notice: 5: Authorized as local user: dlc
TIME: Thu Feb 10 14:06:05 2005
PID: 2911 -- Notice: 5: Authorized as local uid: 500
TIME: Thu Feb 10 14:06:05 2005
PID: 2911 -- Notice: 5: and local gid: 100
TIME: Thu Feb 10 14:06:05 2005
PID: 2911 -- Notice: 0: executing /usr/local/globus/libexec/globus-job-manager
TIME: Thu Feb 10 14:06:05 2005
PID: 2911 -- Notice: 0: GRID_SECURITY_CONTEXT_FD=9
TIME: Thu Feb 10 14:06:05 2005



So it sees my request but it gets terminated.....
I can run it with globus-job-run node1.cs.appstate.edu/jobmanager-fork /bin/ls and it runs fine.



anyone please have any clues....I have tried everything I know to get this to work.....


thanks,
Darryl Cook




[← Prev in Thread] Current Thread [Next in Thread→]