Hi Todd,
Thank you very much for checking this.
Here you have the condor_status output for gpu03, running condor 23.10.1-1.el9 currently:
# condor_status -cons 'Machine=="
gpu03.pic.es"' -af:h Name SlotType GPUs DetectedGpus AssignedGpus JobStatus
Name         SlotType   ÂGPUs DetectedGpus        AssignedGpus JobStatus
slot1@xxxxxxxxxxxx  Partitionable 0  ÂGPU-c659279d, GPU-c659279d undefined  Âundefined
slot1_1@xxxxxxxxxxxx Dynamic    0  ÂGPU-c659279d, GPU-c659279d undefined  Âundefined
slot1_2@xxxxxxxxxxxx Dynamic    0  ÂGPU-c659279d, GPU-c659279d undefined  Âundefined
slot1_3@xxxxxxxxxxxx Dynamic    0  ÂGPU-c659279d, GPU-c659279d undefined  Âundefined
slot1_4@xxxxxxxxxxxx Dynamic    0  ÂGPU-c659279d, GPU-c659279d undefined  Âundefined
slot1_5@xxxxxxxxxxxx Dynamic    0  ÂGPU-c659279d, GPU-c659279d undefined  Âundefined
slot1_6@xxxxxxxxxxxx Dynamic    0  ÂGPU-c659279d, GPU-c659279d undefined  Âundefined
slot1_7@xxxxxxxxxxxx Dynamic    0  ÂGPU-c659279d, GPU-c659279d undefined  Âundefined
slot1_8@xxxxxxxxxxxx Dynamic    0  ÂGPU-c659279d, GPU-c659279d undefined  Âundefined
slot2@xxxxxxxxxxxx  Partitionable 0  ÂGPU-c659279d, GPU-c659279d GPU-c659279d undefined
slot2_1@xxxxxxxxxxxx Dynamic    1  ÂGPU-c659279d, GPU-c659279d GPU-c659279d undefined
We can compare with a brother machine, gpu02, still running condor 23.0.10-1.el9:
# condor_status -cons 'Machine=="
gpu02.pic.es"' -af:h Name SlotType GPUs DetectedGpus AssignedGpus JobStatus
Name         SlotType   ÂGPUs DetectedGpus        AssignedGpus       ÂJobStatus
slot1@xxxxxxxxxxxx  Partitionable 0  ÂGPU-0f8a8574, GPU-0f8a8574 undefined         undefined
slot1_1@xxxxxxxxxxxx Dynamic    0  ÂGPU-0f8a8574, GPU-0f8a8574 undefined         undefined
slot1_2@xxxxxxxxxxxx Dynamic    0  ÂGPU-0f8a8574, GPU-0f8a8574 undefined         undefined
slot1_3@xxxxxxxxxxxx Dynamic    0  ÂGPU-0f8a8574, GPU-0f8a8574 undefined         undefined
slot1_4@xxxxxxxxxxxx Dynamic    0  ÂGPU-0f8a8574, GPU-0f8a8574 undefined         undefined
slot1_5@xxxxxxxxxxxx Dynamic    0  ÂGPU-0f8a8574, GPU-0f8a8574 undefined         undefined
slot1_6@xxxxxxxxxxxx Dynamic    0  ÂGPU-0f8a8574, GPU-0f8a8574 undefined         undefined
slot1_7@xxxxxxxxxxxx Dynamic    0  ÂGPU-0f8a8574, GPU-0f8a8574 undefined         undefined
slot1_8@xxxxxxxxxxxx Dynamic    0  ÂGPU-0f8a8574, GPU-0f8a8574 undefined         undefined
slot2@xxxxxxxxxxxx  Partitionable 0  ÂGPU-0f8a8574, GPU-0f8a8574 GPU-0f8a8574,GPU-0f8a8574 undefined
slot2_1@xxxxxxxxxxxx Dynamic    1  ÂGPU-0f8a8574, GPU-0f8a8574 GPU-0f8a8574       Âundefined
slot2_2@xxxxxxxxxxxx Dynamic    1  ÂGPU-0f8a8574, GPU-0f8a8574 GPU-0f8a8574       Âundefined
So, even though there are two DetectedGpus, the slot2@gpu03 only shows 1 GPU on AssignedGpus. The SLOT definitions are:
[root@gpu03 ~]# condor_config_val SLOT_TYPE_1 SLOT_TYPE_2
cpus=8, gpus=0, auto
cpus=4, gpus=100%, auto
I'm draining gpu03 so I can do more testing if needed.
Thank you again.
Cheers,
Carles