[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] NEGOTIATE_ALL_JOBS_IN_CLUSTER

Date: Wed, 28 Jan 2009 09:23:26 -0600
From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
Subject: Re: [Condor-users] NEGOTIATE_ALL_JOBS_IN_CLUSTER

Steffen Grunewald wrote:

For a homogeneous pool, and "simple" job clusters (identical specs for all
jobs) NEGOTIATE_ALL_JOBS_IN_CLUSTER is suggested to be set to False.
On the other hand, there may be situations where the first job of a single
cluster continues to fail (for whatever reason: memory overcommit comes to
mind) thus blocking all others.


Hi Steffen  -

What version of Condor are you working with?

Starting back w/ Condor v7.0.x and above, the default built-in autoclustering mechanism in Condor should prevent the situations youdescribe above --- and do so in a much more efficient/scalable mannerthan setting NEGOTIATE_ALL_JOBS_IN_CLUSTER to TRUE (which is the kiss ofperformance death if you have thousands of jobs).

Is it possible to - e.g. once per given time period (4 hours?) - "flush"
the queue by temporarily setting the macro to True?

Maybe something else is going on? With Condor v7.0.x and above with thedefault auto-clustering, I assert you should never have to resort toNEGOTIATE_ALL_JOBS_IN_CLUSTER = True. Are you over-ridingautoclustering in your config file by expliciting settingSIGNIFICANT_ATTRIBUTES or some such on your condor_config on your submithosts?


best,
Todd

--
Todd Tannenbaum                       University of Wisconsin-Madison
Condor Project Research               Department of Computer Sciences
tannenba@xxxxxxxxxxx                  1210 W. Dayton St. Rm #4257

Follow-Ups:
- Re: [Condor-users] NEGOTIATE_ALL_JOBS_IN_CLUSTER
  - From: Steffen Grunewald

References:
- [Condor-users] NEGOTIATE_ALL_JOBS_IN_CLUSTER
  - From: Steffen Grunewald

Prev by Date: Re: [Condor-users] job is not kept in queue even though execute machine is busy
Next by Date: Re: [Condor-users] NEGOTIATE_ALL_JOBS_IN_CLUSTER
Previous by thread: [Condor-users] NEGOTIATE_ALL_JOBS_IN_CLUSTER
Next by thread: Re: [Condor-users] NEGOTIATE_ALL_JOBS_IN_CLUSTER
Index(es):
- Date
- Thread