there sems to be a little something missing somewhere ;) 
        
        
        
        I had similar problems when we started to use GPUs, the
          cause was an individual configuration overwriting the feature
          config. 
        
        
        
        What does condor_config_val say, it should look somehow
          similar to this: 
        
        
        
        
          [root@batchg003 ~]# condor_config_val -dump | grep -i gpu
            ENVIRONMENT_FOR_AssignedGPUs =
            GPU_DEVICE_ORDINAL=/(CUDA|OCL)//Â CUDA_VISIBLE_DEVICES
            ENVIRONMENT_VALUE_FOR_UnAssignedGPUs = 10000
            MACHINE_RESOURCE_INVENTORY_GPUs =
            $(LIBEXEC)/condor_gpu_discovery -properties
            $(GPU_DISCOVERY_EXTRA)
            SLOT_TYPE_1 = GPUs=1, CPUs=2
            SLOT_WEIGHT = GPUs
            START = (NODE_IS_HEALTHY =?= True) && (StartJobs =?=
            True) && TARGET.RequestGpus &&
            (RequestRuntime <= 12000)
            STARTD_CRON_GPUs_MONITOR_EXECUTABLE =
            $(LIBEXEC)/condor_gpu_utilization
            STARTD_CRON_GPUs_MONITOR_METRICS = SUM:GPUs, PEAK:GPUsMemory
            STARTD_CRON_GPUs_MONITOR_MODE = WaitForExit
            STARTD_CRON_GPUs_MONITOR_PERIOD = 1
            STARTD_CRON_JOBLIST = NODEHEALTH GPUs_MONITOR GPUs_MONITOR
            
          
          Best
          
          Christoph
          
          
          
         
        
        
        
          -- 
          Christoph Beyer
          DESY Hamburg
          IT-Department
          
          Notkestr. 85
          Building 02b, Room 009
          22607 Hamburg
          
          phone:+49-(0)40-8998-2317
          mail: 
christoph.beyer@xxxxxxx
        
        
        
        
        
        
        Hello
            everyone,
            I made a task where there was only "condor_gpu_discovery
            -extra" and the output was only "DetectedGPUs = 0". However,
            when I execute the command manually, it returns:
            
            ÂC: \> condor_gpu_discovery -extra
            DetectedGPUs = "CUDA1"
            CUDACapability = 1.2
            CUDAClockMhz = 1402.00
            CUDAComputeUnits = 2
            CUDACoresPerCU = 8
            CUDADeviceName = "GeForce 210"
            CUDADevicePciBusId = "0000: 05: 00.0"
            CUDADeviceUuid = "00000000-0000-0000-0000-000000000000"
            CUDADriverVersion = 6.50
            CUDAECCEnabled = false
            CUDAGlobalMemoryMb = 1024
            CUDARuntimeVersion = 10.20
            
            So in the configuration context, condor_gpu_discovery does
            not have access to any GPU information.
            
            Best regards
            Josef
          
          On 2.4.2020 13:34, Josef
            MitlÃhner wrote:
          
           Hi,
            
            lspci | grep -i nvidia
            05:00.0 VGA compatible controller: NVIDIA Corporation GT218
            [GeForce 210] (rev a2)
            
            C:\>condor_status -l mitlohner-w764 | grep -i gpu
            DetectedGPUs = 0
            GPUs = 0
            MachineResources = "Cpus Memory Disk Swap GPUs"
            TotalGPUs = 0
            TotalSlotGPUs = 0
            
            Best regards
            Josef
            
            On 2.4.2020 12:45, Beyer,
              Christoph wrote:
            
            
              
                hmm,
                
                
                
                what does 
                
                
                
                lspci | grep -i nvidia
                
                
                
                say ? 
                
                
                
                condor_Status should look somehow like this: 
                
                
                
                [root@batchg003 ~]# condor_status -l batchg003 |
                  grep -i gpu
                  AssignedGPUs = "CUDA0"
                  DetectedGPUs = 1
                  GPUs = 1
                  MachineResources = "Cpus Memory Disk Swap GPUs"
                  SlotWeight = GPUs
                  Start = (NODE_IS_HEALTHY =?= true) &&
                  (StartJobs =?= true) && TARGET.RequestGpus
                  && (RequestRuntime <= 12000)
                  TotalGPUs = 1
                  TotalSlotGPUs = 1
                  [root@batchg003 ~]# condor_status -l batchg003 | grep
                  -i cuda
                  AssignedGPUs = "CUDA0"
                  CUDACapability = 6.1
                  CUDADeviceName = "GeForce GTX 1080 Ti"
                  CUDADevicePciBusId = "0000:65:00.0"
                  CUDADeviceUuid =
                  "3f2d719f-7d89-c75c-1a71-94316a2fcd12"
                  CUDADriverVersion = 10.2
                  CUDAECCEnabled = false
                  CUDAGlobalMemoryMb = 11178
                  
                
                Best
                
                Christoph
                
                
                
                
                  -- 
                  Christoph Beyer
                  DESY Hamburg
                  IT-Department
                  
                  Notkestr. 85
                  Building 02b, Room 009
                  22607 Hamburg
                  
                  phone:+49-(0)40-8998-2317
                  mail: 
christoph.beyer@xxxxxxx
                
                
                
                
                
                
                Hi,
                    thank you for your reply.
                    
                    The result is the same. The only change is (after
                    installing CUDA pagkage) in the "condor_gpu_disovery
                    -properties" listing:
                    
                    DetectedGPUs="CUDA0"
                    CUDACapability=1.2
                    CUDADeviceName="GeForce 210"
                    CUDADevicePciBusId="0000:05:00.0"
CUDADeviceUuid="00000000-0000-0000-0000-000000000000"
                    CUDADriverVersion=6.50
                    CUDAECCEnabled=false
                    CUDAGlobalMemoryMb=1024
                    CUDARuntimeVersion=10.20
                    
                    Thanks for help,
                    Best regards
                    Josef
                  
                  On 2.4.2020 10:24, Beyer,
                    Christoph wrote:
                  
                  
                    
                      Hi,
                      
                      
                      
                      try 
                      
                      @use feature : GPUs
                      
                      @use feature : GPUsMonitor
                      
                      
                      
                      The second one is not mandatory of course but
                        you will want it ;) 
                      
                      
                      install the cuda and nvidia-driver pkgs (I
                        think those cone with the cuda pkg though) 
                      
                      
                      
                      cuda.x86_64
                      
                      
                      
                      Restart the host and check ... 
                      
                      
                      
                      Best
                      
                      christoph
                      
                      
                      
                        -- 
                        Christoph Beyer
                        DESY Hamburg
                        IT-Department
                        
                        Notkestr. 85
                        Building 02b, Room 009
                        22607 Hamburg
                        
                        phone:+49-(0)40-8998-2317
                        mail: 
christoph.beyer@xxxxxxx
                      
                      
                      
                      
                      
                      
                      Hello,
                         when I run
                          the command "condor_gpu_discovery -properties"
                          on my computer it detects one GPU
                          
                          DetectedGPUs="CUDA0"
                          can't open SOFTWARE\NVIDIA Corporation\GPU
                          Computing Toolkit\CUDA
                          CUDACapability=1.2
                          CUDADeviceName="GeForce 210"
                          CUDADevicePciBusId="0000:05:00.0"
CUDADeviceUuid="00000000-0000-0000-0000-000000000000"
                          CUDADriverVersion=6.50
                          CUDAECCEnabled=false
                          CUDAGlobalMemoryMb=1024
                          
                          In condor.config i have a line with "use
                          feature : GPUs"
                          
                          
                          Why does my HTCondor server say (condor_status
                          -l):
                          ...
                          DetectedGPUs = 0
                          ...
                          
                          ?
                          Thank you for reply
                          Josef
                          
                        
                        
                        _______________________________________________
                        HTCondor-users mailing list
                        To unsubscribe, send a message to 
htcondor-users-request@xxxxxxxxxxx
                        with a
                        subject: Unsubscribe
                        You can also unsubscribe by visiting
                        
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
                        
                        The archives can be found at:
                        
https://lists.cs.wisc.edu/archive/htcondor-users/
                       
                    
                    
                    _______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/
                  
                  
                  
                  _______________________________________________
                  HTCondor-users mailing list
                  To unsubscribe, send a message to 
htcondor-users-request@xxxxxxxxxxx
                  with a
                  subject: Unsubscribe
                  You can also unsubscribe by visiting
                  
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
                  
                  The archives can be found at:
                  
https://lists.cs.wisc.edu/archive/htcondor-users/
                 
               
              
              
              _______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/
            
            
            
            
            _______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/
          
          
          
          _______________________________________________
          HTCondor-users mailing list
          To unsubscribe, send a message to
          
htcondor-users-request@xxxxxxxxxxx with a
          subject: Unsubscribe
          You can also unsubscribe by visiting
          
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
          
          The archives can be found at:
          
https://lists.cs.wisc.edu/archive/htcondor-users/