Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] job evictions
On Wed, Nov 19, 2014 at 12:28 PM, Suchandra Thapa <ssthapa@xxxxxxxxxxxx> wrote:
> How do I get detailed information about why a job was evicted from a job
> slot? We have an user whose jobs keep getting evicted even though the
> configuration doesn't have any preemption enabled.
Are you sure you don't have preemption enabled? There are three places
preemption might occur: in the negotiator, in the startd, and in the
schedd (only if using a dedicated scheduler). See section 3.5.9.5 of
the manual[1] (for versions 8.0 and prior) for an explanation of
disabling negotiator- and startd-based preemption.
Depending on your START configuration, the job may also be evicted due
to keyboard activity, CPU load, etc. I'd suggest looking in
StarterLog.slotX for the slot your job last ran on (check the
LastRemoteHost job attribute) to see why it got kicked off.
[1] http://research.cs.wisc.edu/htcondor/manual/v8.0/3_5Policy_Configuration.html#SECTION00459500000000000000
Thanks,
BC
--
Ben Cotton
main: 888.292.5320
Cycle Computing
Leader in Utility HPC Software
http://www.cyclecomputing.com
twitter: @cyclecomputing