[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Issues regarding use of NVIDIA MIGs with HTCondor



For (4), I'll defer to the GPU experts on the list.
	(4) is a known problem.  We've been working on higher-priority 
improvements to our GPU support, much of which will appear in the 
upcoming 9.8.0 release.
	For work-arounds, if the MIG GPU(s) are the only ones on the 
system, it's fairly easy for the start to enforce that only one GPU is 
given to each job (START = $(START) && RequestGPUs <= 1).  It becomes much 
more complicated if you have non-MIG GPUs mixed in (you have to isolate 
the MIG GPUs to their own partitionable slot).
	With 9.8.0, you should also be able to have the 
multi-GPU-requesting jobs specify that they require non-MIG GPUs, 
although the required expressions may be clumsy.
- ToddM