[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Running non-gpu job on gpu machine referring Whats_New_condor_week_2023



Hello,Â

Yes, I do have that in submit file of non-gpu job.

On Fri, Feb 7, 2025, 7:33âPM John M Knoeller via HTCondor-users <htcondor-users@xxxxxxxxxxx> wrote:
SLOT_TYPE_2_START = TARGET.BackfillJob

says that in order to match with the backfill slot, a job must haveÂ

 ÂBackfillJob=true

In the job classad. Do your non-gpu jobs have that?



From:ÂHTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Vikrant Aggarwal <ervikrant06@xxxxxxxxx>
Sent:ÂFriday, February 7, 2025 3:01 PM
To:ÂHTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Subject:Â[HTCondor-users] Running non-gpu job on gpu machine referring Whats_New_condor_week_2023
Â
Hello Experts,

I was reading the presentation Whats_New_condor_week_2023Âand came across an interesting feature of backfill which I wanted to use on a gpu machine.Â

From the presentation, I made this configuration, my GPU job runs on the machine without any trouble.Â

START = $(START)
use feature : GPUs
GPU_DISCOVERY_EXTRA = -extra
PreemptMaxRuntime = 4 * 24 * 60
ExemptMaxRuntime = 4 * 24 * 60

BackfillSlot = true
ResourceConflict = "GPUs"
use FEATURE : PartitionableSlot(1, 100%)
SLOT_TYPE_1_START = TARGET.RequestGpus > 0
SLOT_TYPE_2_BACKFILL = true
use FEATURE : PartitionableSlot(2, 90%, GPUs=0)
SLOT_TYPE_2_PREEMPT = size(ResourceConflict?:"") > 0
SLOT_TYPE_2_START = TARGET.BackfillJob

However a non-gpu machine stays in idle status. --better-analyze doesn't reveal why it's in idle status.Â

executable = sleep.sh
transfer_executable = false
arguments = 600
should_transfer_files = NO
+BackfillJob = True
queue 1

following I see in better-analyze for second slot.Â

The Requirements _expression_ for this slot reduces to these conditions:

   ÂClusters
Step  ÂMatched ÂCondition
----- Â-------- Â---------
[0] Â Â Â Â Â 1 ÂSTART
[1] Â Â Â Â Â 1 ÂWithinResourceLimits


Am I missing anything in the configuration to make non-gpu jobs run on a gpu machine?Â

For clarity: at the time of testing no GPU job was running on that machine, it was a completely idle machine.Â

Also, is the featureÂPreferGPUJobs mentioned in ppt introduced yet or not, couldn't find anything in release notes about it.Â




Thanks & Regards,
Vikrant Aggarwal
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe

The archives can be found at: https://www-auth.cs.wisc.edu/lists/htcondor-users/