HTCondor Project List Archives



[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [Condor-devel] BUG Report - condor_starter fails if DNS reverse lookup fails



> good one, folks. ;) thanks for the detailed investigation and
> write-up.  i know all of the code you wandered through quite well, and
> everything you said makes sense.
> 
> condor is *very* susceptible to DNS being down.  i wouldn't be
> surprised if you found 10 other cases where something blew up that
> shouldn't if DNS isn't working. :( so, it's a little daunting to try
> to make condor resilient to DNS being down, but it's worth trying. ;)

Not exactly the common case, I know... :-)

> time permitting i'll look more closely into your proposed solutions.
> i think changing the Daemon object to return the addr when the Name is
> still NULL might be ok, but i'd really have to think about that more
> closely before i agreed that was The Right Thing(tm).  

Yeah, there's no telling how a pervasive change like that can affect
things.

> anyway, i'll
> add this to my (overflowing) plate and get back to you ASAP.  are you
> hoping for a fix for a specific release for a specific need, or do you
> just want to see it fixed Soon(tm) to improve the overall stability
> and robustness of condor?

We fixed our DNS, so we're running again :-).  We'll opt for Soon(tm).

Thanks!
-Mike