Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] rebooting a submission node
- Date: Tue, 17 May 2005 14:07:14 -0500
- From: Erik Paulson <epaulson@xxxxxxxxxxx>
- Subject: Re: [Condor-users] rebooting a submission node
On Tue, May 17, 2005 at 02:54:30PM -0400, Daniel Loughlin wrote:
> I'd like to take down a submission node that currently has a bunch of
> condor_shadow processes. I am running 6.7.1 (Yes, I'm upgrading soon.).
> If I condor_off this node, make my change, and reboot the machine, will
> the condor_shadow processes respawn after the condor_master and schedd
> startup after the reboot?
>
> I also want to avoid having the condor_starter processes on the
> execution nodes that communicate to condor_shadow on the submission node
> die off.
>
> Basically I want to know if it possible to reboot a submission node
> without causing headaches for the user in a production environment?
>
Jobs that have been submitted with a JobLeaseDuration set will survive
a reboot of the submit machine. Anything jobs without the job lease duration
will be checkpointed (if possible) and then resumed or restarted when
the condor_schedd starts running again, but will not be running while
the submit machine is down.
-Erik