Hi all, I am submitting a job to vanilla universe. I need to send condor_vacate_job time to time. But recently, I see my jobs being in Halt state after I send condor_vacate_job signal. The error message that I find in the log file is:
007 (002.000.000) 01/26 12:00:30 Shadow exception! Error from starter on slot1@xxxxxxxxxx: STARTER at xxxxxx failed to send file(s) to <xxxxxxxxxxxxxx>; SHADOW at xxxxxxxxxx failed to write to file /var/local/condor/spool/cluster2.proc0.subproc0.tmp/tufa420.hmm: (errno 13) Permission denied
424871712 - Run Bytes Sent By Job 424874688 - Run Bytes Received By Job ... 012 (002.000.000) 01/26 12:00:30 Job was held. Error from starter on slot1@xxxxxxxxxxxx: STARTER at xxxxxxxxxxx failed to send file(s) to <xxxxxxxxxxx> SHADOW at xxxxxxxxx failed to write to file /var/local/condor/spool/cluster2.proc0.subproc0.tmp/tufa420.hmm: (errno 13) Permission denied
Code 12 Subcode 13
I did not have this problem before, it is only recently that I see this shadow exception occurring. Could any of you have any idea whats causing this problem? Just to test, I have submitted all my files with permission set to 777, but still this problem persists.
Thanks in advance.
--Tan
-- -- Tanzima Zerin Islam Graduate Student School of Electrical & Computer Engineering Purdue University