Hi Patrick,
Since there is a matching slot, your job(s) may be cycling between startup and failure. What does condor_q <cluster id> -af NumShadowStarts say? If this is non-zero that means the jobs are matching and starting but failing quickly (especially if you
see a large number).
-Cole Bollig
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Patrick Claflin <pat@xxxxxxxxxxx>
Sent: Friday, March 7, 2025 8:05 AM To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>; Beyer, Christoph <christoph.beyer@xxxxxxx> Subject: Re: [HTCondor-users] Trouble getting jobs to run even though there is a valid machine. Hi Christoph,
Thanks for responding! I did try the -better-analyze option and it shows me that I have an eligible machine. I tried your suggestion with the "reverse" option and it looks like it's ready to run the job but will not. Have you ever encountered an issue like this? Thanks again for your suggestions. Best regards, Patrick PS C:\Users\pat> condor_q -better-analyze:reverse -machine ASG-PAT-7080.CAMPUS.CU.CLEMSON.EDU -- Schedd: ASG-PAT-7080.CAMPUS.CU.CLEMSON.EDU : <130.127.55.243:9618?... -- Slot: slot1@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx : Analyzing matches for 1 Jobs in 1 autoclusters The Requirements _expression_ for this slot is Start && (WithinResourceLimits) START is true WithinResourceLimits is (MY.Cpus > 0 && TARGET.RequestCpus <= MY.Cpus && MY.Memory > 0 && TARGET.RequestMemory <= MY.Memory && MY.Disk > 0 && TARGET.RequestDisk <= MY.Disk && (TARGET.RequestGPUs is undefined || MY.GPUs >= TARGET.RequestGPUs)) [0] : Start [1] : MY.Cpus > 0 [2] : TARGET.RequestCpus <= MY.Cpus [3] : [1] && [2] [4] : MY.Memory > 0 [5] : [3] && [4] [6] : TARGET.RequestMemory <= MY.Memory [7] : [5] && [6] [8] : MY.Disk > 0 [9] : [7] && [8] [10] : TARGET.RequestDisk <= MY.Disk [11] : [9] && [10] [12] : TARGET.RequestGPUs is undefined [13] : MY.GPUs >= TARGET.RequestGPUs [14] : [12] || [13] [15] : [11] && [14] [16] : [0] && [15] This slot defines the following attributes: Cpus = 16 Disk = 21436520 GPUs = 1 Memory = 32480 Job 4.0 has the following attributes: TARGET.RequestCpus = 1 TARGET.RequestDisk = 10240 TARGET.RequestMemory = 32 The Requirements _expression_ for this slot reduces to these conditions: Clusters Step Matched Condition ----- --------- --------- [2] 1 TARGET.RequestCpus <= MY.Cpus [6] 1 TARGET.RequestMemory <= MY.Memory [10] 1 TARGET.RequestDisk <= MY.Disk [12] 1 TARGET.RequestGPUs is undefined slot1@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx: Run analysis summary of 1 jobs. 1 (100.00 %) match both slot and job requirements. 1 match the requirements of this slot. 1 have job requirements that match this slot. Patrick Claflin System Administrator/Developer Clemson Center for Geospatial Technologies (www.clemsongis.org) Clemson University (864) 656-7462 pat@xxxxxxxxxxx On 3/7/2025 2:01 AM, Beyer, Christoph wrote:
|