So GPU memory is being detected and reported.
But since you are using -not-nested in your config, you can't use any of the newer features of HTCondor that depend on the nested GPU properties ads.
-tj
From: Weatherby,Gerard <gweatherby@xxxxxxxx>
Sent: Wednesday, April 24, 2024 10:48 AM To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx> Cc: John M Knoeller <johnkn@xxxxxxxxxxx> Subject: Re: HTCondor not picking up GPU memory?
*** Attention: This is an external email. Use caution responding, opening attachments or clicking on links. ***
tj,
/usr/libexec/condor/condor_gpu_discovery -properties -extra DetectedGPUs="GPU-2b5bf517" Common=[ Capability=7.5; ClockMhz=1590.00; ComputeUnits=40; CoresPerCU=64; DeviceName="Tesla T4"; DevicePciBusId="0000:03:00.0"; DeviceUuid="2b5bf517-8290-ca68-bb9e-eaf4336d1321"; DriverVersion=12.10; ECCEnabled=true; GlobalMemoryMb=14966; MaxSupportedVersion=12010; ] GPU_2b5bf517=[ id="GPU-2b5bf517"; ] nmradmin@neon:~$ /usr/libexec/condor/condor_gpu_discovery -properties -extra --not-nested DetectedGPUs="GPU-2b5bf517" CUDACapability=7.5 CUDAClockMhz=1590.00 CUDAComputeUnits=40 CUDACoresPerCU=64 CUDADeviceName="Tesla T4" CUDADevicePciBusId="0000:03:00.0" CUDADeviceUuid="2b5bf517-8290-ca68-bb9e-eaf4336d1321" CUDADriverVersion=12.10 CUDAECCEnabled=true CUDAGlobalMemoryMb=14966 CUDAMaxSupportedVersion=12010
|