Hi,
    Â
    is more information needed?
    
    I was thinking to setup a user wrapper to insert the soft limit in
    cgroup and perhaps even a hard one on a per job basis.
    
    cheers
    alessandra
    
    On 19/10/2017 09:42, Alessandra Forti
      wrote:
    
    
      
      Hi,
      
      I've setup cgroups on my htcondor cluster some months ago. I
      expected cgroups to handle soft limits and htcondor to kill with
      SYTEM_PERIODIC_REMOVE when the limit is twice the requested
      memory. However last week we had a user running havoc on the nodes
      and using up to 35GB of RSS when his limit should have been 4GB. 
      
      My settings are as follows
      
      * On the WNs
      
      # Enable CGROUP
       BASE_CGROUP = /system.slice/condor.service
       CGROUP_MEMORY_LIMIT = soft
      
      * On the head node 
      
      RemoveMemoryUsage = ( ResidentSetSize_RAW >
        2000*RequestMemory )
        SYSTEM_PERIODIC_REMOVE = $(RemoveMemoryUsage)Â ||Â
        <OtherParameters>
      
      this is a set up other sites have. 
      
      cgroup doesn't have any limit set neither soft nor hard.
      
      So the questions are two 
      
      1) Why SYSTEM_PERIODIC_REMOVEÂ didn't work? Here is an example of
      job that exceeded the limit 4GB limit
      
      condor_history 66469.0 -autoformat ClusterId
        2000*RequestMemory ResidentSetSize_RAW
        66469 4000000 34723028
      
      2) Shouldn't htcondor set the job soft limit with this
      configuration? or is the site expected to set the soft limit
      separately? 
      
      thanks
      
      cheers
      alessandra
      
      -- 
Respect is a rational process. \\//
Fatti non foste a viver come bruti, ma per seguir virtute e canoscenza(Dante)
For Ur-Fascism, disagreement is treason. (U. Eco)
But but but her emails... covfefe!
      
      
      
      _______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/
    
    
    -- 
Respect is a rational process. \\//
Fatti non foste a viver come bruti, ma per seguir virtute e canoscenza(Dante)
For Ur-Fascism, disagreement is treason. (U. Eco)
But but but her emails... covfefe!