Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[HTCondor-users] How to reserve resources for GPU jobs
- Date: Mon, 25 Aug 2025 15:30:08 -0600
- From: "K._Scott Rowe" <krowe@xxxxxxxx>
- Subject: [HTCondor-users] How to reserve resources for GPU jobs
Hey there. Imagine I have an EP running HTCondor-23.0.17 with 24 cores,
512GB RAM, and one GPU. There are many CPU-only jobs running on this EP
for weeks at a time, and there are usually one or two GPU jobs as well.Â
The CPU-only jobs may take weeks to finish, so sadly a GPU job may have
to wait weeks to start. I would like GPU jobs to not have to wait so long.
Is there a way I could reserve say 1 core and 128GB of RAM for GPU jobs,
and only GPU jobs, on this EP thus letting CPU-only jobs continue to run
on the other 23 cores and 384GB of RAM?
I have been trying to do this with static slots but have not figured out
how to make a slot that has the GPU as a resource and will NOT run
CPU-only jobs.
I should also mention that we don't use preemtion and really don't want
to use it as it doesn't work well with our pipeline. I would also
rather not ask our users to add a ClassAd to their submit scripts (e.g.
+IsGPUJob), but if that is the only way, then so be it.
Thanks
--
K. Scott Rowe -- Science Information Services
Science Operations Center, National Radio Astronomy Observatory
1011 Lopezville Socorro, NM 87801
krowe@xxxxxxxx -- 1.575.835.7193 -- www.nrao.edu