[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-devel] [condor-fw] [nwp@xxxxxxxxxxxxxxxxxxxxxx: errorlog]
On Wed, May 16, 2012 at 02:26:41PM -0500, Nathan Panike wrote:
> On Wed, May 16, 2012 at 01:06:47PM -0500, John (TJ) Knoeller wrote:
>
> > Is it possible that the file size is 0, because the file has been
> > written to but not (yet) closed?
> > -tj
>
> I wondered about that, but the job completed at 1800 last night, and
> I mailed my query at 1200 today, so I rejected that in the belief a
> shadow would not hang out for 18 hours trying to write a file.
Since I've dealt with this problem before...
The file exists with zero size because it was the "Output" parameter
in the submit file. It was created with zero size when the job was
submitted.
One other note: this happens much more often on the CAE pool in the
early morning when their Windows nodes are being rebooted. Something
with the jobs being evicted seems to trigger this problem more often.
You might check if any of these jobs were being evicted at the time.
--
Daniel K. Forrest Space Science and
dan.forrest@xxxxxxxxxxxxx Engineering Center
(608) 890 - 0558 University of Wisconsin, Madison