[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] GPU memory request



*** Attention: This is an external email. Use caution responding, opening attachments or clicking on links. ***

Thanks!

That works.

 

From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Zach McGrew <mcgrewz@xxxxxxx>
Date: Monday, March 25, 2024 at 4:09
âPM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] GPU memory request

*** Attention: This is an external email. Use caution responding, opening attachments or clicking on links. ***

You'll want something like this:

require_gpus = GlobalMemoryMb >= 2048

To request a GPU with at least 2GB of GPU memory. The gpus_minimum_discovery is only in the 23.x feature branch I believe, not the 23.0 LTS or 10.9.

-Zach

________________________________________
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Weatherby,Gerard <gweatherby@xxxxxxxx>
Sent: Monday, March 25, 2024 10:40 AM
To: HTCondor-Users Mail List
Subject: Re: [HTCondor-users] GPU memory request

You don't often get email from gweatherby@xxxxxxxxx Learn why this is important<https://urldefense.com/v3/__https://aka.ms/LearnAboutSenderIdentification__;!!Cn_UX_p3!ihSQvkpsCt_eyQq4BQNuQ92Wal7FzA4ivPTijtVHMtOWG_bdpSD1Vc3syxGJLQXkF8oVJOWVWDoF2njmWcjI$ >
*** Attention: This is an external email. Use caution responding, opening attachments or clicking on links. ***

This seems to work on the 23 nodes:

Universe = vanilla
gpus_minimum_memory = 1MB
request_gpus = 1
Executable   = /usr/bin/echo
Arguments = hello compute
output           = h100.txt
error            = h100.err
Log          = h100.log


however, thereâs a warning

WARNING: the line 'gpus_minimum_memory = 1MB' was unused by condor_submit. Is it a typo?

Is that just a condor_submit bug?

From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Todd L Miller via HTCondor-users <htcondor-users@xxxxxxxxxxx>
Date: Monday, March 25, 2024 at 12:08
âPM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Cc: Todd L Miller <tlmiller@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] GPU memory request
*** Attention: This is an external email. Use caution responding, opening attachments or clicking on links. ***

> Weâre running a 10.9 / 23 cluster and using
>
> use feature: GPUs
>
> How does a user request a certain amount of GPU memory?

        For recent releases:

https://urldefense.com/v3/__https://htcondor.readthedocs.io/en/latest/man-pages/condor_submit.html*gpus_minimum_memory__;Iw!!Cn_UX_p3!ldAOUS0h-q3CeQ7kXWVRzVV2rYk2DMGydSduBCmjzVfD56nUmfzrVx2-DhPHylqo2vW__YGH72WtaDyiKJm2vJ0i8vd861Xg$<https://urldefense.com/v3/__https://nam11.safelinks.protection.outlook.com/?url=""> >

        For older releases, you'll have to write an _expression_:

https://urldefense.com/v3/__https://htcondor.readthedocs.io/en/v10_0/man-pages/condor_submit.html*index-60__;Iw!!Cn_UX_p3!ldAOUS0h-q3CeQ7kXWVRzVV2rYk2DMGydSduBCmjzVfD56nUmfzrVx2-DhPHylqo2vW__YGH72WtaDyiKJm2vJ0i8ukBFdTk$<https://urldefense.com/v3/__https://nam11.safelinks.protection.outlook.com/?url=""> >

-- ToddM

_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://urldefense.com/v3/__https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users__;!!Cn_UX_p3!ihSQvkpsCt_eyQq4BQNuQ92Wal7FzA4ivPTijtVHMtOWG_bdpSD1Vc3syxGJLQXkF8oVJOWVWDoF2nX3yUpX$

The archives can be found at:
https://urldefense.com/v3/__https://lists.cs.wisc.edu/archive/htcondor-users/__;!!Cn_UX_p3!ihSQvkpsCt_eyQq4BQNuQ92Wal7FzA4ivPTijtVHMtOWG_bdpSD1Vc3syxGJLQXkF8oVJOWVWDoF2hX-KhZU$