[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] GPU memory request



*** Attention: This is an external email. Use caution responding, opening attachments or clicking on links. ***

 

This seems to work on the 23 nodes:

Universe = vanilla

gpus_minimum_memory = 1MB

request_gpus = 1

Executable   = /usr/bin/echo

Arguments = hello compute

output           = h100.txt

error            = h100.err

Log          = h100.log


however, thereâs a warning

WARNING: the line 'gpus_minimum_memory = 1MB' was unused by condor_submit. Is it a typo?

 

Is that just a condor_submit bug?

 

From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Todd L Miller via HTCondor-users <htcondor-users@xxxxxxxxxxx>
Date: Monday, March 25, 2024 at 12:08
âPM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Cc: Todd L Miller <tlmiller@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] GPU memory request

*** Attention: This is an external email. Use caution responding, opening attachments or clicking on links. ***

> Weâre running a 10.9 / 23 cluster and using
>
> use feature: GPUs
>
> How does a user request a certain amount of GPU memory?

        For recent releases:

https://urldefense.com/v3/__https://htcondor.readthedocs.io/en/latest/man-pages/condor_submit.html*gpus_minimum_memory__;Iw!!Cn_UX_p3!ldAOUS0h-q3CeQ7kXWVRzVV2rYk2DMGydSduBCmjzVfD56nUmfzrVx2-DhPHylqo2vW__YGH72WtaDyiKJm2vJ0i8vd861Xg$

        For older releases, you'll have to write an _expression_:

https://urldefense.com/v3/__https://htcondor.readthedocs.io/en/v10_0/man-pages/condor_submit.html*index-60__;Iw!!Cn_UX_p3!ldAOUS0h-q3CeQ7kXWVRzVV2rYk2DMGydSduBCmjzVfD56nUmfzrVx2-DhPHylqo2vW__YGH72WtaDyiKJm2vJ0i8ukBFdTk$

-- ToddM