Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] load-balanced central manager?

Date: Fri, 08 Nov 2013 13:06:10 -0600
From: Greg Thain <gthain@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] load-balanced central manager?

Hi,

Is it possible to use Condor in a way like there's multiple running
instances of every component (including negotiator) in a pool, and in
this way to provide a load-balanced fail-tolerant environment? Or is
it possible to use only one single negotiator in a pool at once (I
know it's possible to do fail-over with had)?

It is possible to do fail-over with HAD, but it picks the one negotiatorto be running at any one time,. Should the current active negotiator godown, it will pick another to start. Note that if the negotiator or thecollector crash, all existing jobs stay running, and the schedds willeven start new jobs running if they can re-use the claims they alreadyhave. Separately, it is also possible to tell the negotiator that it isresponsible for some subset of the machines in the pool, and onlyprovide matches to those machines.

I've read about flocking also. So in that way there'd be a number of
pools available with their own central managers. What happens before a
job get flocked?

Before a job can be flocked, it has to fail to match in the local pool(either due to load or a conflict between job and machine requirements).

  Does flocking help to provide some kind of load
balancing between several central managers? Or it makes the situation
even worse because it requires extra work from central managers?

Generally speaking, there isn't a huge load on the central manager,except in the largest of pools, and even then, claim reuse helpstremendously. What can be a problem with the central managers is whenthen need to communicate with schedds over high latency WAN links,especially when strong security is enabled.


-greg

References:
- [HTCondor-users] load-balanced central manager?
  - From: Pek Daniel

Prev by Date: Re: [HTCondor-users] regarding Multiple Pool settings Question
Next by Date: Re: [HTCondor-users] Need help on howto run simutlations with batch files, thanks.
Previous by thread: [HTCondor-users] load-balanced central manager?
Next by thread: Re: [HTCondor-users] load-balanced central manager?
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [HTCondor-users] load-balanced central manager?