Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] Move jobs of users from one to another worker node
- Date: Thu, 27 Nov 2025 17:32:23 +0100
- From: Steffen Grunewald <steffen.grunewald@xxxxxxxxxx>
- Subject: Re: [HTCondor-users] Move jobs of users from one to another worker node
On Thu, 2025-11-27 at 14:18:06 +0530, gagan tiwari wrote:
> Hi Steffen,
> Thanks for your response.
>
> So, please let me know how to achieve it even if the jobs start from the
> starting point on another node.
>
> Lets say user Tom has submitted 4 jobs. 2 of his jobs are running with
> cluster id 2021 and another 2 jobs with cluster id 2025. So, I need to
> move his job with cluster id 2025 to a different worker node.
This - to me at least - seems to involve other questions:
Do you want the node to be completely idle/unclaimed? condor_drain might help.
Do you want to match jobs with particular node features (special hardware,
licenses etc.)? Use Requirements on the job side, or/and START expressions
on the machine side.
If "a different node" means "any other", and none of the above applies, it
would be helpful to know about the "why".
If that "different node" is a particular one, then set the job Requirements
accordingly, and hold/release (or cancel/resubmit) the job to be re-matched.
If there's no rule that can be formalized, we cannot help I'm afraid.
In each case, if the executable hasn't set up any checkpointing itself, it
will restart from scratch.
- S