Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] Vanilla jobs not automatically restart
- Date: Thu, 20 Dec 2007 17:07:47 -0800
- From: Stuart Anderson <anderson@xxxxxxxxxxxxxxxx>
- Subject: Re: [Condor-users] Vanilla jobs not automatically restart
On Wed, Nov 28, 2007 at 01:03:13PM -0600, Dan Bradley wrote:
>
> >On Wed, Nov 28, 2007 at 12:02:35PM -0600, Dan Bradley wrote:
> >
> >
> >>The 6.8 series does not provide a good way to avoid multiple runs of a
> >>job, because JobRunCount in the job ClassAd is actually the number of
> >>times the schedd has started a shadow for the job, not the number of
> >>times the job has actually been started. You can try using JobRunCount
> >>anyway, but it may sometimes be an overestimate of the number of times
> >>the job has started.
> >>
> >>In 6.9.5 (about to be released), there is a new attribute of the job
> >>ClassAd called NumJobStarts, which I think you should be able to use
> >>like so in the submit file:
> >>
> >>periodic_hold = num_job_starts > 0 && JobStatus == 1
> >>requirements = num_job_starts == 0
NumJobStarts is documented in 6.9.5 as "not (yet) defined for standard universe
jobs". Will this be defined in 7.0.0?
Thanks.
--
Stuart Anderson anderson@xxxxxxxxxxxxxxxx
http://www.ligo.caltech.edu/~anderson