Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Tuning condor_defrag

Date: Tue, 21 Apr 2015 17:44:11 -0500
From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] Tuning condor_defrag

On 4/21/2015 4:04 PM, Anthony Tiradani wrote:

Apologies if you get this twice, I think I sent this to the wrong address
originally.

For the moment we are avoiding large scale fragmentation of our cluster by
requesting that our stakeholders request jobs in 8 core chunks.  We only have a
small number of other jobs coming in with different request sizes.  However, we
want to be able to handle varying request sizes with greater frequency and that
requires tuning the condor_defrag to avoid slot starvation.

What solutions do other sites uses?  Preferably, I would like to have something
that "auto-tunes" the condor_defrag settings.

We have been doing some thinking about how to 'auto-tune' condor_defragas well, in a similar direction as to how the condor_rooster knows when(and which) machines to wake up from hibernation. With thecondor_rooster approach, when nobody is using an execute node andHTCondor takes it offline by hibernating, the negotiator leaves behindhints in the machine ClassAd that effectively say "if this machine wereactually awake, I could match it". The condor_rooster then incorporatesthese hints from the negotiator in its wake up policy.

Similarly, the matchmaker could leave behind hints like "I could makemore desirable matches to machine X if it was drained", whichcondor_defrag could then incorporate into its policy to "auto-tune".Several gory details about this line of thinking are written down inthis first-draft developer design document at http://goo.gl/eMwJCv.Interested in your feedback as always...


thanks
Todd

References:
- [HTCondor-users] Tuning condor_defrag
  - From: Anthony Tiradani

Prev by Date: [HTCondor-users] Tuning condor_defrag
Next by Date: Re: [HTCondor-users] the job is in idle status, it doesn't run
Previous by thread: [HTCondor-users] Tuning condor_defrag
Next by thread: [HTCondor-users] Anyway to keep track of values in ClassAds?
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [HTCondor-users] Tuning condor_defrag