Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] can Condor somehow be a HA?

Date: Tue, 23 May 2017 09:09:50 +0100
From: lejeczek <peljasz@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] can Condor somehow be a HA?



On 22/05/17 17:02, Todd Tannenbaum wrote:

On 5/22/2017 6:45 AM, lejeczek wrote:
hi fellas
I've only started looking at htcondor, not having a goodunderstanding of it yet I wonder - htcondor has thatconcept of "central manager" and I wonder if this makesit a valid candidate for HA setup?
Does anybody have any experience with/thoughts onhtcondor as HA and could share it here?
many thanks
L.
Hi,
First off, understand that if your installations centralmanager dies, currently running jobs will continue to runand even new jobs will continue to get scheduled in manycases (i.e. new jobs will still get scheduled to claimedslots). Even in production pools, most sites have noproblem with rebooting their central manager or eventaking it down for an hour or two - while the centralmanger is down, users may notice that condor_status stopsworking, but practically all other common tools continueto work (condor_submit, condor_q, condor_rm, etc). Thusmany pools don't ever bother with an HA solution for thecentral manager.
If you are still concerned, the HTCondor central manageris actually very lightweight and holds very little state(just user prioirties), and this is very amenable to ahigh availability (HA) setup. You essentially have twochoices:
1. HTCondor can be configured to have two central managers(hot/hot), and automatically fail over as needed. See thesection in the HTCondor Manual titled "High Availabilityof the Central Manger" at
http://research.cs.wisc.edu/htcondor/manual/v8.6/3_13High_Availability.html#SECTION004132200000000000000
2. If you already run your services in a managedvisualized setup (Mesos+Marathan, OpenStack, vSphere,HyperV, etc) that supports failover, you could setup yourHTCondor central manager for HA leveraging thoseenvironments, i.e. same way you would setup a redundantemail server, for instance.
Hope the above helps
Todd

thanks, that is a great "shedding lights on" for a novicelike myself.

References:
- [HTCondor-users] can Condor somehow be a HA?
  - From: lejeczek
- Re: [HTCondor-users] can Condor somehow be a HA?
  - From: Todd Tannenbaum

Prev by Date: [HTCondor-users] 8th International Conference on Information, Intelligence, Systems and Applications (IISA 2017): Last Call for Papers
Next by Date: [HTCondor-users] a segfault at the very onset
Previous by thread: Re: [HTCondor-users] can Condor somehow be a HA?
Next by thread: [HTCondor-users] HTCondor workshop in Europe 2017: Hurry up!
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [HTCondor-users] can Condor somehow be a HA?