Please try
use POLICY : WANT_HOLD_IF (DISK_USAGE_EXCEEDED ,
105, job exceeded requested disk)
(This is without quotation marks around the error message)
Tomer.
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of ervikrant06@xxxxxxxxx <ervikrant06@xxxxxxxxx>
Sent: Wednesday, April 7, 2021 3:00 PM To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx> Subject: Re: [HTCondor-users] Taking action on job breaching scratch space limit Thanks for inputs.
Tomer, I tried your configuration, it doesn't work for me. On the worker node I can see the diskusage is 5GB and the amount of disk allocated to the job is 3GB. RequestDisk is 1GB.
# condor_who -af RequestDisk diskusage disk
1000000 5120014 3133873 1000000 5120014 3133873 Modified following parameter also on both submit and worker node. Before this modification, it was showing me requestdisk value on the worker node always as 1.
JOB_DEFAULT_REQUESTdisk = 1000000
I am using condor version 8.5.8 which is old but it does seem to have support for policy as per condor_config_val use policy:want_hold_if.
With condor 8.8.5 (newer) version using exact same configuration seeing following message in /var/log/condor/StartLog which is making startd dead.
04/07/21 07:45:39 ERROR "Syntax error in WANT_HOLD_REASON _expression_: 'ifThenElse((DiskUsage =!= UNDEFINED && DiskUsage > RequestDisk), ""job exceeded requested disk"", UNDEFINED)'" at line 571 in file /slots/23/dir_1952943/userdir/.tmprZa7ap/BUILD/condor-8.8.5/src/condor_startd.V6/util.cpp
Thanks & Regards,
Vikrant Aggarwal
On Tue, Apr 6, 2021 at 8:22 PM <tomerp@xxxxxxxxxxx> wrote:
|