Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7

Date: Fri, 20 Oct 2017 11:26:48 -0500
From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7

On 10/20/2017 9:44 AM, Alessandra Forti wrote:

Hi,

is more information needed?


Hi Alessandra,

The version of HTCondor you are using would be helpful :).

But I have some answers/suggestions below that I hope will help...

* On the head node

RemoveMemoryUsage = ( ResidentSetSize_RAW > 2000*RequestMemory )
SYSTEM_PERIODIC_REMOVE = $(RemoveMemoryUsage)Â || <OtherParameters>

So the questions are two

1) Why SYSTEM_PERIODIC_REMOVEÂ didn't work?

Because the (system_)periodic_remove expressions are evaluated by thecondor_shadow while the job is running, and the *_RAW attributes areonly updated in the condor_schedd.

A simple solution is to use attribute MemoryUsage instead ofResidentSetSize_RAW. So I think things will work as you want if youinstead did:


  RemoveMemoryUsage = ( MemoryUsage > 2*RequestMemory )
  SYSTEM_PERIODIC_REMOVE = $(RemoveMemoryUsage)  || <OtherParameters>

Note that MemoryUsage is in the same units as RequestMemory, so onlyneed to multiply by 2 instead of 2000.

You are not the first person to be tripped up by this. :( I realize itis not at all intuitive. I think I will add a quick patch in the code toallow _RAW attributes to be referenced inside of job policy expressionsto help prevent frustration by the next person.

Also you may want to place your memory limit policy on the execute nodesvia startd policy expression, instead of having them enforced on thesubmit machine (what I think you are calling the head node). The reasonis the execute node policy is evaluated every five seconds, while thesubmit machine policy is evaluated every several minutes. A runaway jobcould consume a lot of memory in a few minutes :).

2) Shouldn't htcondor set the job soft limit with this configuration?or is the site expected to set the soft limit separately?

Personally, I think "soft" limits in cgroups are completely bogus. Theway the Linux kernel treats soft limits does not do in practice whatanyone (including htcondor itself) expects. I recommend settingsCGROUP_MEMORY_LIMIT to either none or hard, soft makes no sense imho.

"CGROUP_MEMORY_LIMIT=hard" is clear to understand: if the job uses morememory than it requested, it is __immediately__ kicked off and put onhold. This way users get a consistent experience.

If you want jobs to be able to go over their requested memory so long asthe machine isn't swapping, consider disabling swap on your executenodes (not a bad idea for compute servers in general) and simply leaving"CGROUP_MEMORY_LIMIT=none". What will happen is if the system isstressed, eventually the Linux OOM (out of memory killer) will kick inand pick a process to kill. HTCondor sets the OOM priority of jobprocess such that the OOM killer should always pick job processes aheadof other processes on the system. Furthermore, HTCondor "captures" theOOM request to kill a job and only allows it to continue if the job isindeed using more memory than requested (i.e. provisioned in the slot).This is probably what you wanted by setting the limit to soft in thefirst place.

I am thinking we should remove the "soft" option to CGROUP_MEMORY_LIMITin future releases, it just causes confusion imho. Curious if others onthe list disagree...


Hope the above helps,
regards,
Todd

Follow-Ups:
- Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
  - From: Thomas Hartmann
- Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
  - From: Thomas Patrick Downes
- Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
  - From: Alessandra Forti

References:
- [HTCondor-users] htcondor cgroups and memory limits on CentOS7
  - From: Alessandra Forti
- Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
  - From: Alessandra Forti

Prev by Date: Re: [HTCondor-users] Forwarding Kerberos-Credentials
Next by Date: Re: [HTCondor-users] Querying Startd classad from python bindings
Previous by thread: Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
Next by thread: Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7