[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] HTCondor 24 and STARTER_HIDE_GPU_DEVICES



Dear all,

Nobody else is experiencing issues with the STARTER_HIDE_GPU_DEVICES variable set to True in HTCondor 24.0.X, unlike me. Is that correct? Would it be safe to set STARTER_HIDE_GPU_DEVICES = False? The GPUs machines are the only ones we still have on 23.0.XX version due to this issue.

Thank you again.

Best regards,

Carles

On Mon, 19 May 2025 at 19:43, Greg Thain via HTCondor-users <htcondor-users@xxxxxxxxxxx> wrote:
On 5/16/25 07:33, Carles Acosta wrote:
>
> According to the documentation, STARTER_HIDE_GPU_DEVICES is supposed
> to hide only the non-assigned GPUs from the job. Since the job is
> correctly submitted with a request_gpu and HTCondor is assigning one,
> I would expect the assigned GPU to remain visible even with this
> setting enabled.
>
> Am I misunderstanding how STARTER_HIDE_GPU_DEVICES is supposed to work?


Hi Carles:

You are understanding correctly how STARTER_HIDE_GPU_DEVICES is supposed
to work, but it appears that something is going wrong with it. Can you
send me the StarterLog.slot_xxx from a job with STARTER_HIDE_GPU_DEVICES
is true, and I can try to see what is going on? You can email me the
file directly if you are concerned about ip address, etc. in the StarterLog.


-greg


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe

Join us in June at Throughput Computing 25: https://osg-htc.org/htc25

The archives can be found at: https://www-auth.cs.wisc.edu/lists/htcondor-users/


--
Carles Acosta i Silva
PIC (Port d'Informacià CientÃfica)
Campus UAB, Edifici D
E-08193 Bellaterra, Barcelona
Tel: +34 93 581 33 08
Fax: +34 93 581 41 10
AvÃs - Aviso - Legal Notice: Âhttp://legal.ifae.es