Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] negotiator questions
- Date: Mon, 07 Mar 2016 15:02:48 +0000
- From: "Krieger, Donald N." <kriegerd@xxxxxxxx>
- Subject: Re: [HTCondor-users] negotiator questions
Thanks - very helpful.
Best - Don
> -----Original Message-----
> From: HTCondor-users [mailto:htcondor-users-bounces@xxxxxxxxxxx] On Behalf
> Of Brian Bockelman
> Sent: Monday, March 07, 2016 9:55 AM
> To: HTCondor-Users Mail List
> Subject: Re: [HTCondor-users] negotiator questions
>
>
> > On Mar 6, 2016, at 8:45 AM, Krieger, Donald N. <kriegerd@xxxxxxxx> wrote:
> >
> > Itâs my understanding that at each pass, the negotiator goes through each
> queued job to decide whether and which job slot to assign it.
> > I use the directive, Priority = 0-n to control which jobs run first but I also have
> 2 groups of jobs with disjoint IP address requirements to divide my jobs
> between two subsets of the condor pool.
> > My question is about the order and âcompletenessâ with which the negotiator
> âexaminesâ my queued jobs.
> >
> > I presume that it goes through every single one of them regardless of how
> well and how many it has matched to job slots as it proceeds through.
> > Yes?
>
> Mostly. The exceptions are:
> 1) If there are other users in the system, you might run out of fair share during
> the negotiation cycle and the negotiator moves onto other usersâ jobs before it
> gets through your complete list.
> 2) The schedd the negotiator is talking to times out or has some other error.
>
> >
> > I presume that it goes through them by âPriorityâ first and by fifo within each
> priority level.
> > Yes?
>
> Mostly true. You can assign other attributes (PreJobPrio* and PostJobPrio*)
> that affect the sort order for the jobs in the schedd. Further, things get more
> complicated if you are submitting from multiple schedds.
>
> >
> > I presume the time for each pass scales linearly as the total number of jobs
> queued.
> > Yes?
> >
>
> Nope. Identical jobs are grouped together into a single âresource requestâ.
> The negotiator builds a list of matches against the resource request, then sends
> them back to the schedd asynchronously.
>
> Hence, the more âidentical jobsâ (typically, the jobs resulting from a âqueue Nâ
> statement in the submit file), the less the negotiation time.
>
> > Can the negotiator get clobbered by a memory overrun due a very large
> number of queued jobs?
>
> The negotiator, no. It only considers a single resource request at a time - it
> does not buffer them all in memory (but it does buffer all machine ads in
> memory).
>
> The schedd, however, is more sensitive to number of queued jobs. Each takes
> up somewhere in the neighborhood of 50KB of RAM and a small amount of CPU
> time (which becomes noticeable once there are tens of thousands of idle jobs).
>
> Brian
>
>
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
>
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/