Date: | Fri, 4 Feb 2005 12:54:57 -0700 |
---|---|
From: | Masao Fujinaga <fujinaga@xxxxxxxxxxx> |
Subject: | [Condor-users] unable to submit or query jobs |
I had a smoothly functioning condor setup until this morning. I had been running 200 dagman jobs but this morning, there were some problems with our nfs server. Both the condor installation and the job submit directory are on the nfs filesystem. Since then, I can no longer submit or query jobs. If I do condor_q, it never returns. If I submit a new job, it gets to "Submitting job(s)" then does not return. Other commands such as " condor_reconfig -schedd" and "condor_reschedule" don't return as well. I have restarted condor as well as rebooted the machine. The only errors in the logs are in CollectorLog of the type DC_AUTHENTICATE: attempt to open invalid session condor:1266:1107387904:2996, failing. What can I do to purge the system of whatever is blocking it? I am running condor-6.6.7. -- Masao Fujinaga | Research Computing Support fujinaga@xxxxxxxxxxx | Computing and Network Services Tel.: (780) 492-2117 | University of Alberta Fax.: (780) 492-1729 | Edmonton, Alberta, CANADA |
[← Prev in Thread] | Current Thread | [Next in Thread→] |
---|---|---|
|
Previous by Date: | RE: [Condor-users] condor_q -global stressing our schedulers?, bgore |
---|---|
Next by Date: | RE: [Condor-users] What would cause a schedd to stop responding tocondor_q queries?, Ian Chesal |
Previous by Thread: | RE: [Condor-users] Trouble on WinXP with master node, Campbell Bradley L CRBE |
Next by Thread: | RE: [Condor-users] Using the same log for several clusters?, Ian Chesal |
Indexes: | [Date] [Thread] |