Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [condor-users] condor_shadow timeout when loosing contact withstartd
- Date: 26 Jan 2004 14:00:52 -0600
- From: Geoff Lovett <geoff.lovett@xxxxxxxxxxxxxxxxxxx>
- Subject: Re: [condor-users] condor_shadow timeout when loosing contact withstartd
Can you show me from your config file how you have this set up? In
mine, I don't see a SHADOW_UPDATE_INTERVAL. Is this something you
added, or are you referring to this:
## How often should the schedd send a keep alive message to any
## startds it has claimed? (5 minutes)
ALIVE_INTERVAL = 300
Also, I don't see where the default for condor_starter is to update
every 20 minutes (I also searched for 1200s), and for the shadow to
except after 3 missed updates. I see that condor_schedd will relenquish
a match after a condor_shadow excepts 5 times (by default) using
MAX_SHADOW_EXCEPTIONS, but this seems different.
Thanks,
Geoff
On Mon, 2004-01-26 at 13:43, Zachary Miller wrote:
> On Mon, Jan 26, 2004 at 01:26:45PM -0600, Geoff Lovett wrote:
> > Hello, I've noticed that when condor_shadow looses contact with
> > condor_startd on an execute machine, it typically takes roughly 2 hours
> > for the shadow to notice that the startd is gone and cause an exception,
> > thereby putting the job back into the queue. My question is, can this
> > timeout be configured?
>
> i think you mean the condor_starter and not the condor_startd. the
> starter is the daemon which launches and manages the job on the execute
> machine.
>
> by default it sends an update every 20 minutes and then shadow should
> except after 3 missed updates, i.e. one hour. i'm not sure why it is
> taking 2 hours for you... maybe i'm wrong about something.
>
> anyhow, this is configurable via the condor_config:
>
> SHADOW_UPDATE_INTERVAL = 300
> # 300 seconds == 5 minutes
>
>
> cheers,
> -zach
>
> Condor Support Information:
> http://www.cs.wisc.edu/condor/condor-support/
> To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
> unsubscribe condor-users <your_email_address>
Condor Support Information:
http://www.cs.wisc.edu/condor/condor-support/
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>