On Thu, Jul 2, 2015 at 2:09 PM, Todd Tannenbaum <tannenba@xxxxxxxxxxx> wrote:
> By "the scheduler removed them", do you mean they ended up in condor_history
> in the "X" (removed) state, or do you mean they disappeared from the queue
> and thus perhaps ended up in condor_history in the "C" state? Wondering if
> there is a race condition here.
>
They end up in JobStatus 3.
> Perhaps there are policy expressions involved, like condor_config
> system_periodic_remove or job ad periodic_remove?
>
In this particular case, there is a SYSTEM_PERIODIC_REMOVE expression:
(JobStatus == 5) && (JobRunCount >=5)
However, these jobs all had a run count of 1. In previous incidents, I
believe there were no *periodic_remove expressions involved.
Thanks,
BC
--
Ben Cotton
main: 888.292.5320
Cycle Computing
Better Answers. Faster.
http://www.cyclecomputing.com
twitter: @cyclecomputing
|