On 3/31/21 1:55 PM, templon@xxxxxxxxx wrote:
What is the corresponding simplest way to achieve exactly this in HTCondor?
Note the word âexactlyâ :)
The answer was the
condor_drain
command, but it does not achieve exactly this, without a bit more.condor_drain
also evicts running jobs from slots, depending on what the value ofMaxJobRetirementTime
is. I did not know about this variable so we did not have it set, and aside from nodes not accepting new jobs (the question), they stopped running the already-running jobs - not the desired behavior.