Hi all,
I am submitting a job to vanilla universe. I need to send condor_vacate_job time to time.
But recently, I see my jobs being in Halt state after I send condor_vacate_job signal.
The error message that I find in the log file is:
007 (002.000.000) 01/26 12:00:30 Shadow exception!
Error from starter on slot1@xxxxxxxxxx: STARTER at xxxxxx failed to send file(s) to <xxxxxxxxxxxxxx>; SHADOW at xxxxxxxxxx failed to write to file /var/local/condor/spool/cluster2.proc0.subproc0.tmp/tufa420.hmm: (errno 13) Permission denied
424871712 - Run Bytes Sent By Job
424874688 - Run Bytes Received By Job
...
012 (002.000.000) 01/26 12:00:30 Job was held.
Error from starter on slot1@xxxxxxxxxxxx: STARTER at xxxxxxxxxxx failed to send file(s) to <xxxxxxxxxxx> SHADOW at xxxxxxxxx failed to write to file /var/local/condor/spool/cluster2.proc0.subproc0.tmp/tufa420.hmm: (errno 13) Permission denied
Code 12 Subcode 13
I did not have this problem before, it is only recently that I see this shadow exception occurring. Could any of you have any idea whats
causing this problem? Just to test, I have submitted all my files with permission set to 777, but still this problem persists.
Thanks in advance.
--Tan
--
--
Tanzima Zerin Islam
Graduate Student
School of Electrical & Computer Engineering
Purdue University
_______________________________________________