[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Move jobs of users from one to another worker node



Hi Gagan,

moving whole job states between machines is not possible with Condor (I am unsure, if another LRMS can achieve something like that especially since there is no freezer with cgroups v2 anymore. Maybe if you have MPI applications, but these would have to sort their memory handling themselves, I guess)

of course, you can always put a job onto hold, edit its rerquirements and resubmit it targeting another EP.

Cheers,
  Thomas

On  2025-11-27 09:48, gagan tiwari wrote:
Hi Steffen,
 Â Â Â Â Â Â Â Â Â ÂThanks for your response.

So, please let me know how to achieve it even if the jobs start from the starting point on another node.

Lets say user Tom has submitted 4 jobs. 2 of his jobs are running with cluster id 2021 and another 2 jobs with cluster id 2025. So, I need to move his job with cluster id 2025 to a different worker node.

Plz let me know how to achieve this.

Thanks,
Gagan






On Wed, Nov 26, 2025 at 3:05âPM Steffen Grunewald <steffen.grunewald@xxxxxxxxxx <mailto:steffen.grunewald@xxxxxxxxxx>> wrote:

    On Wed, 2025-11-26 at 14:54:49 +0530, gagan tiwari wrote:
     > Hi Guys,
     >Â Â Â Â Â Â Â Â Â Â ÂPlease let me know how to move all running
    jobs of a
     > user to another worker node so that they start from the same
    point on that
     > worker node.

    Hi -

    With the Standard Universe no longer with us, there's no
    "stupid" (memory and I/O)
    checkpointing anymore - there's no means to transfer a Vanilla
    Universe job to
    another slot and continue running it.
    Unless the user takes care of saving states preiodically, a job will
    restart from
    square one.

    Sorry not to be helpful,
     ÂSteffen
    _______________________________________________
    HTCondor-users mailing list
    To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx
    <mailto:htcondor-users-request@xxxxxxxxxxx> with a
    subject: Unsubscribe

    The archives can be found at: https://www-auth.cs.wisc.edu/lists/
    htcondor-users/ <https://www-auth.cs.wisc.edu/lists/htcondor-users/>


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe

The archives can be found at: https://www-auth.cs.wisc.edu/lists/htcondor-users/

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature