Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] Vanilla jobs not automatically restart
- Date: Wed, 28 Nov 2007 10:45:26 -0800
- From: Stuart Anderson <anderson@xxxxxxxxxxxxxxxx>
- Subject: Re: [Condor-users] Vanilla jobs not automatically restart
On Wed, Nov 28, 2007 at 12:02:35PM -0600, Dan Bradley wrote:
>
> The 6.8 series does not provide a good way to avoid multiple runs of a
> job, because JobRunCount in the job ClassAd is actually the number of
> times the schedd has started a shadow for the job, not the number of
> times the job has actually been started. You can try using JobRunCount
> anyway, but it may sometimes be an overestimate of the number of times
> the job has started.
>
> In 6.9.5 (about to be released), there is a new attribute of the job
> ClassAd called NumJobStarts, which I think you should be able to use
> like so in the submit file:
>
> periodic_hold = num_job_starts > 0 && JobStatus == 1
> requirements = num_job_starts == 0
Dan,
I am confused by the distinction between JobRunCount and NumJobStarts.
Would you be willing to enumerate the circumstances when these two numbers can
differ?
Thanks.
--
Stuart Anderson anderson@xxxxxxxxxxxxxxxx
http://www.ligo.caltech.edu/~anderson