Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] portionable slots and greedy users

Date: Fri, 6 Oct 2023 13:44:28 -0500 (CDT)
From: Todd L Miller <tlmiller@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] portionable slots and greedy users

If you can't trust your users, hopefully the condor folks can offer aworkable solution. I can't think of an easy way out, sorry.

The first post is in the thread said that the long jobs were alsoall characterized by having high memory requirements. You could write asubmit transform that matches whatever "high memory" means in this contextand inserts a concurrency limit. See


https://htcondor.readthedocs.io/en/latest/admin-manual/setting-up-special-environments.html#concurrency-limits

for details, but the idea is you set the maximum number of concurrentlyrunning "high memory" jobs so that they can only use 95% of the pool.(Maybe aim for 75% first and increase the limit as necessary? 95%doesn't have a lot of slop...)

That's if you want to reserve 5% of the pool for non-high-memoryjobs. This will waste capacity if you don't have "enough" of such jobs,but should do a good job of ensuring small scheduling delays. If youinstead want the share of the pool's time spent running short jobs over(roughly) the whole day to be 5%, you can use the same trick, but withaccounting groups instead of concurrency limits.

You can also empirically determine shortness. Write submittransform that sets allowed_job_duration (or allowed_execute_duration asappropriate) to 300 and a periodic_release which removes the holdautomatically. The periodic_release can't change the value ofallowed_job_duration, but you can probably say something like:


allowed_job_duration = if( NumHolds == 0, 300, undefined )

instead of just "300". HoldReasonCode 46 (or 47) is reserved forallowed_job_duration (or allowed_execute_duration) being exceeded, so theperiodic_release expression should be easy to write.


-- ToddM

Follow-Ups:
- Re: [HTCondor-users] portionable slots and greedy users
  - From: Rita

References:
- [HTCondor-users] portionable slots and greedy users
  - From: Rita
- Re: [HTCondor-users] portionable slots and greedy users
  - From: Dmitri Maziuk
- Re: [HTCondor-users] portionable slots and greedy users
  - From: Rita
- Re: [HTCondor-users] portionable slots and greedy users
  - From: Dmitri Maziuk

Prev by Date: Re: [HTCondor-users] Startd fails on fresh install due to missing execute directory
Next by Date: Re: [HTCondor-users] dynamic slots with gpus
Previous by thread: Re: [HTCondor-users] portionable slots and greedy users
Next by thread: Re: [HTCondor-users] portionable slots and greedy users
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [HTCondor-users] portionable slots and greedy users