Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: [Condor-users] about priorities
- Date: Thu, 5 Aug 2004 15:46:44 +0100
- From: "Matt Hope" <Matt.Hope@xxxxxxxx>
- Subject: RE: [Condor-users] about priorities
off the top of my head that should work (though I can't remember if this will trigger preemption with only one user - easiest way is to try it.
> -----Original Message-----
> From: condor-users-bounces@xxxxxxxxxxx
> [mailto:condor-users-bounces@xxxxxxxxxxx]On Behalf Of
> Robert.Nordlund@xxxxxxxxxxxxxxxx
> Sent: 05 August 2004 15:41
> To: Condor-Users Mail List
> Subject: RE: [Condor-users] about priorities
>
>
>
>
>
>
> Any suggestions for the following condor_config entries on the execute
> machines?
>
> WANT_SUSPEND = FALSE
> WANT_VACATE = FALSE
> START = TRUE
> SUSPEND = FALSE
> CONTINUE = $(UWCS_CONTINUE)
> PREEMPT = FALSE
> KILL = $(UWCS_KILL)
> PERIODIC_CHECKPOINT = $(UWCS_PERIODIC_CHECKPOINT)
> PREEMPTION_REQUIREMENTS = $(UWCS_PREEMPTION_REQUIREMENTS)
> PREEMPTION_RANK = $(UWCS_PREEMPTION_RANK)
>
>
> Thanks,
> Bob Nordlund
>
>
>
> |---------+-------------------------------->
> | | "Matt Hope" |
> | | <Matt.Hope@xxxxxxxx> |
> | | Sent by: |
> | | condor-users-bounces@|
> | | cs.wisc.edu |
> | | |
> | | |
> | | 08/05/2004 10:36 AM |
> | | Please respond to |
> | | Condor-Users Mail |
> | | List |
> | | |
> |---------+-------------------------------->
>
> >-------------------------------------------------------------
> -------------------------------------------------|
> |
> |
> | To: "Condor-Users Mail List"
> <condor-users@xxxxxxxxxxx> |
> | cc:
> |
> | Subject: RE: [Condor-users] about priorities
> |
>
> >-------------------------------------------------------------
> -------------------------------------------------|
>
>
>
>
> the cluster/proc option should work fine for you then...
>
> just alter the rank expression on the execute machines and,
> assuming all
> other settings are at their defaults, it should all just work...
>
> try it with cpusoak test jobs, send a vacate to one of the
> machines and see
> what happens.
>
> > -----Original Message-----
> > From: condor-users-bounces@xxxxxxxxxxx
> > [mailto:condor-users-bounces@xxxxxxxxxxx]On Behalf Of
> > Robert.Nordlund@xxxxxxxxxxxxxxxx
> > Sent: 05 August 2004 15:25
> > To: Condor-Users Mail List
> > Subject: RE: [Condor-users] about priorities
> >
> >
> >
> >
> >
> >
> > Matt,
> >
> > Thanks for the quick reply. I'll try to clarify my
> > situation. Jobs are
> > submitted with a five-second delay. In the case where the
> > first ten jobs
> > are running and utilizing all available pool resources and
> > then a hardware
> > failure causes the available pool resources to decrease and
> > thereby cause a
> > job failure, is there a way to remove an active
> > lower-priority job(s) to
> > make room for the higher priority job that failed? This is a
> > dedicated
> > pool with only one scheduler. I control the horizontal and
> > the vertical.
> >
> > Thanks,
> > Bob Nordlund
> >
> >
> >
> >
> > |---------+-------------------------------->
> > | | "Matt Hope" |
> > | | <Matt.Hope@xxxxxxxx> |
> > | | Sent by: |
> > | | condor-users-bounces@|
> > | | cs.wisc.edu |
> > | | |
> > | | |
> > | | 08/05/2004 10:12 AM |
> > | | Please respond to |
> > | | Condor-Users Mail |
> > | | List |
> > | | |
> > |---------+-------------------------------->
> >
> > >-------------------------------------------------------------
> > -------------------------------------------------|
> > |
> > |
> > | To: "Condor-Users Mail List"
> > <condor-users@xxxxxxxxxxx>
> |
> > | cc:
> > |
> > | Subject: RE: [Condor-users] about priorities
> > |
> >
> > >-------------------------------------------------------------
> > -------------------------------------------------|
> >
> >
> >
> >
> > assumption - jobs are submitted in time order with sufficient
> > gap between
> > them to account for time differences between submit machines...
> >
> > place
> > RANK=-TARGET.QDate
> > on all execute machines.
> >
> > assumption - the jobs are all submitted from one cluster..
> > place
> > RANK=-TARGET.ProcId
> > on all execute machines.
> > If you wish to have multiple sets of these ordered jobs at
> > once make sure
> > you either include the cluster id as so (works for jobs up to
> > 100000000
> > processes big!)
> > RANK= (-TARGET.ClusterId * 100000000) - TARGET.ProcId
> >
> >
> > assumption you can control the submission process such that
> > every job is
> > given a unique monotonic increasing/decreasing number (easy
> > if all done
> > from one cluster but still possible otherwise)
> > use for example
> > +MY_ORDER_NUMBER=[x]
> > in the submit file and ensure this ticks on every time
> >
> > increasing
> > RANK=-TARGET.MY_ORDER_NUMBER
> >
> > decreasing
> > RANK=TARGET.MY_ORDER_NUMBER
> >
> > if you have to play nicely with other machines this rank
> > expression must be
> > modified to correctly handle jobs which don't have this
> > attribute (using
> > =!= and/or =?=)
> >
> > Igf you haven't got control of the execute machines rank
> > you'll have some
> > issues...
> >
> > also you will have to deal with the possibility that:
> >
> > jobs 1-10 start
> > job 5 is vacated before finishing for some reasona at roughly
> > the same time
> > job 11 starts.
> > job 11 then completes before the negotiation cycle can swap 6
> > for it. job
> > 11 has now run before 6 and nothing can change that.
> >
> > This may not be a problem but this was not clear from your request.
> >
> > all sorts of options available really.
> >
> > > -----Original Message-----
> > > From: condor-users-bounces@xxxxxxxxxxx
> > > [mailto:condor-users-bounces@xxxxxxxxxxx]On Behalf Of
> > > Robert.Nordlund@xxxxxxxxxxxxxxxx
> > > Sent: 05 August 2004 14:51
> > > To: Condor-Users Mail List
> > > Subject: Re: [Condor-users] about priorities
> > >
> > >
> > >
> > >
> > >
> > >
> > > Hello all.
> > >
> > > I am running a dedicated pool of Windows 2000 servers with
> > one central
> > > submit machine. I submit 20 jobs with multiple processes and
> > > want to make
> > > sure that they are run in succession, i.e. Job1 is more
> > important than
> > > Job2, etc. If the first 10 jobs are concurrently running how
> > > can I make
> > > sure that if Job1 fails due to a hardware failure that it
> > > will preempt the
> > > the claim of the lowest priority job? Is this something
> > > fairly easy to
> > > accomplish with priorities/preemption?
> > >
> > > Thank you,
> > >
> > > Bob Nordlund
> > >
> > >
> > >
> > > |---------+-------------------------------->
> > > | | Erik Paulson |
> > > | | <epaulson@xxxxxxxxxxx|
> > > | | > |
> > > | | Sent by: |
> > > | | condor-users-bounces@|
> > > | | cs.wisc.edu |
> > > | | |
> > > | | |
> > > | | 08/04/2004 10:43 PM |
> > > | | Please respond to |
> > > | | Condor-Users Mail |
> > > | | List |
> > > | | |
> > > |---------+-------------------------------->
> > >
> > > >-------------------------------------------------------------
> > > -------------------------------------------------|
> > > |
> > > |
> > > | To: Condor-Users Mail List
> > > <condor-users@xxxxxxxxxxx>
> > > |
> > > | cc:
> > > |
> > > | Subject: Re: [Condor-users] about priorities
> > > |
> > >
> > > >-------------------------------------------------------------
> > > -------------------------------------------------|
> > >
> > >
> > >
> > >
> > > On Wed, Aug 04, 2004 at 06:16:45PM -0500, David A. Kotz wrote:
> > > > I think what you want to do is to set the following in your
> > > > condor_config:
> > > >
> > > > SUSPEND = FALSE
> > > > PREEMPT = FALSE
> > > > KILL = FALSE
> > > >
> > >
> > > That will help prevent the local machine from evicting the
> > > job (you'll also
> > > probably want have START = TRUE, or some such)
> > >
> > > Those expresssions have nothing to do with preemption for priority
> > > purposes, though. Condor can "preempt" a job for two
> reasons - one,
> > > because the local conditions on the machine force us to get
> > > rid of a job,
> > > or two, because there is a user with a higher priority who
> > > wants to run a
> > > job. That preemption is controlled by
> > PREEMPTION_REQUIREMENTS. It is a
> > > pool-wide setting that is evaluated only when Condor has a higher
> > > priority user who wants to run a job. It is tested against
> > > each machine
> > > in the pool to see if the job currently running on that machine
> > > can be preempted.
> > >
> > > To prevent priority preemption, set:
> > >
> > > PREEMPTION_REQUIREMENTS = FALSE
> > >
> > > in the config file of your central manager.
> > >
> > > -Erik
> > >
> > > each machine of the pool.
> > > > - dave
> > > >
> > > >
> > > > On Wed, 2004-08-04 at 03:44 -0700, Fernando Rannou wrote:
> > > > > Hello,
> > > > > I'm a little confused on how Condor handles priorities.
> > > > > On Section 2.7.2 of the Condor manual it reads:
> > > > > ---------
> > > > > "The priority of each individual user changes according
> > > to the number
> > > > > of resources the individual is using.
> > > > > ...
> > > > >
> > > > > "For instance, if a low priority user is utilizing
> all available
> > > machines
> > > > > and suddenly a higher priority user submits jobs, Condor
> > > > > will immediately checkpoint and vacate jobs belonging to
> > > > > the lower priority user."
> > > > >
> > > > > ---------
> > > > > Which priority do they refer to? Effective, Real, Factor??
> > > > >
> > > > > We need that once a job is running it never, ever
> > > > > gets preempted, suspended, killed, or whatever
> > > > > no matter how long it's been running and
> > > > > no matter how low its priority is.
> > > > >
> > > > > How can I accomplished that?
> > > > > Thanks
> > > > >
> > > > > Fernando
> > > > >
> > > > >
> > > > > ----------------------------------------------------------
> > > > > IMPORTANT WARNING: This email (and any attachments) is
> > > only intended
> > > for the use of the person or entity to which it is
> > addressed, and may
> > > contain information that is privileged and confidential. You, the
> > > recipient, are obligated to maintain it in a safe, secure and
> > > confidential
> > > manner. Unauthorized redisclosure or failure to maintain
> > > confidentiality
> > > may subject you to federal and state penalties. If you are not the
> > > recipient, please immediately notify us by return email, and
> > > delete this
> > > message from your computer.
> > > > > ----------------------------------------------------------
> > > > > _______________________________________________
> > > > > Condor-users mailing list
> > > > > Condor-users@xxxxxxxxxxx
> > > > > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> > > > --
> > > > David A. Kotz <dkotz@xxxxxxxxxxxxx>
> > > >
> > > > _______________________________________________
> > > > Condor-users mailing list
> > > > Condor-users@xxxxxxxxxxx
> > > > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> > > _______________________________________________
> > > Condor-users mailing list
> > > Condor-users@xxxxxxxxxxx
> > > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> > >
> > >
> > >
> > >
> > >
> > > **************************************************************
> > > ***********
> > > PRIVILEGED AND CONFIDENTIAL: This communication, including
> > > attachments, is for the exclusive use of addressee and may
> > > contain proprietary, confidential and/or privileged
> > > information. If you are not the intended recipient, any use,
> > > copying, disclosure, dissemination or distribution is
> > > strictly prohibited. If you are not the intended recipient,
> > > please notify the sender immediately by return e-mail, delete
> > > this communication and destroy all copies.
> > > **************************************************************
> > > ***********
> > >
> > > _______________________________________________
> > > Condor-users mailing list
> > > Condor-users@xxxxxxxxxxx
> > > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> > >
> >
> >
> > *****************************************************************
> > Gloucester Research Limited believes the information
> > provided herein is reliable. While every care has been
> > taken to ensure accuracy, the information is furnished
> > to the recipients with no warranty as to the completeness
> > and accuracy of its contents and on condition that any
> > errors or omissions shall not be made the basis for any
> > claim, demand or cause for action.
> > *****************************************************************
> >
> > _______________________________________________
> > Condor-users mailing list
> > Condor-users@xxxxxxxxxxx
> > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> >
> >
> >
> >
> >
> > **************************************************************
> > ***********
> > PRIVILEGED AND CONFIDENTIAL: This communication, including
> > attachments, is for the exclusive use of addressee and may
> > contain proprietary, confidential and/or privileged
> > information. If you are not the intended recipient, any use,
> > copying, disclosure, dissemination or distribution is
> > strictly prohibited. If you are not the intended recipient,
> > please notify the sender immediately by return e-mail, delete
> > this communication and destroy all copies.
> > **************************************************************
> > ***********
> >
> > _______________________________________________
> > Condor-users mailing list
> > Condor-users@xxxxxxxxxxx
> > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> >
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> http://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
>
>
>
>
> **************************************************************
> ***********
> PRIVILEGED AND CONFIDENTIAL: This communication, including
> attachments, is for the exclusive use of addressee and may
> contain proprietary, confidential and/or privileged
> information. If you are not the intended recipient, any use,
> copying, disclosure, dissemination or distribution is
> strictly prohibited. If you are not the intended recipient,
> please notify the sender immediately by return e-mail, delete
> this communication and destroy all copies.
> **************************************************************
> ***********
>
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> http://lists.cs.wisc.edu/mailman/listinfo/condor-users
>