[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Unable to run VM



Hello,
running HTCondor 24.11.2 on Debian 13 Trixie:

it seems like my EP is unable to match VM job.

Attached submit file, condor_q -better -analyze log, and
condor_config.local of EP.

Can you help me?


Thanks
Valerio


DAEMON_LIST = MASTER STARTD SHARED_PORT
USE_SHARED_PORT = TRUE
COLLECTOR_HOST = $(CONDOR_HOST)

ALLOW_READ = *
ALLOW_READ = $(ALLOW_READ), 10.10.*, 192.168.*
ALLOW_WRITE = *
ALLOW_WRITE = $(ALLOW_WRITE), 10.10.*, 192.168.*
ALLOW_WRITE = $(ALLOW_WRITE), *@$(HOSTNAME)
ALLOW_ADMINISTRATOR = *
ALLOW_CONFIG = *
ALLOW_NEGOTIATOR = 10.10.*, 192.168.*
ALLOW_ADVERTISE_MASTER = *
ALLOW_ADVERTISE_STARTD = *
ALLOW_ADVERTISE_SCHEDD = *
ALLOW_NEGOTIATOR_SCHEDD = $(COLLECTOR_HOST)
ALLOW_WRITE_COLLECTOR = $(ALLOW_WRITE)
ALLOW_WRITE_STARTD    = $(ALLOW_WRITE)
ALLOW_READ_COLLECTOR  = $(ALLOW_READ)
ALLOW_READ_STARTD     = $(ALLOW_READ)
ALLOW_CLIENT = *

VM_GAHP_SERVER = /usr/sbin/condor_vm-gahp
VM_GAHP_LOG = $(LOG)/VMGahpLog
VMGAHP_USER_UID=0
VM_TYPE = kvm
VM_MEMORY = 30000
VM_MAX_NUMBER = 4
VM_NETWORKING = TRUE
VM_NETWORKING_TYPE = bridge
VM_NETWORKING_BRIDGE_INTERFACE = eno1

NUM_SLOTS = 1
SLOT_TYPE_1 = 100%
SLOT_TYPE_1_PARTITIONABLE = TRUE
NUM_SLOTS_TYPE_1 = 1
RANK = ImageSize

START = True
SUSPEND = False
PREEMPT = False

universe = vm
vm_type = kvm
transfer_executable = false
vm_memory = 8192M
vm_networking = true
vm_networking_type = bridge
vm_disk = /opt/vmimages/www2.img:vda:w:raw
executable = /opt/vmimages/www2.img
#log = /var/log/condor/www2vm.log
Requirements = ( Machine == "ep1ext.sel" )
request_cpus = 4
request_memory = 8192M
request_disk = 5000M
queue

$ condor_q -better-analyze


-- Schedd: t450.sel : <10.10.0.47:9618?...
The Requirements expression for job 233.000 is

    ((Machine == "ep1ext.sel")) && (TARGET.Arch == "X86_64") && (TARGET.HasVM is true) && (TARGET.VM_Type == MY.JobVMType) && (TARGET.VM_AvailNum > 0) &&
    (TARGET.Disk >= RequestDisk) && (TARGET.TotalMemory >= MY.JobVMMemory) && (TARGET.VM_Memory >= MY.JobVMMemory) && TARGET.VM_Networking &&
    stringListIMember(JobVMNetworkingType,TARGET.VM_Networking_Types,",") && (TARGET.Cpus >= RequestCpus) && (TARGET.HasFileTransfer)

    [0]    : Machine == "ep1ext.sel"
    [1]    : TARGET.Arch == "X86_64"
    [2]    : [0] && [1]
    [3]    : TARGET.HasVM is true
    [4]    : [2] && [3]
    [5]    : TARGET.VM_Type == MY.JobVMType
    [6]    : [4] && [5]
    [7]    : TARGET.VM_AvailNum > 0
    [8]    : [6] && [7]
    [9]    : TARGET.Disk >= RequestDisk
    [10]   : [8] && [9]
    [11]   : TARGET.TotalMemory >= MY.JobVMMemory
    [12]   : [10] && [11]
    [13]   : TARGET.VM_Memory >= MY.JobVMMemory
    [14]   : [12] && [13]
    [15]   : TARGET.VM_Networking
    [16]   : [14] && [15]
    [17]   : stringListIMember(JobVMNetworkingType,TARGET.VM_Networking_Types,",")
    [18]   : [16] && [17]
    [19]   : TARGET.Cpus >= RequestCpus
    [20]   : [18] && [19]
    [21]   : TARGET.HasFileTransfer
    [22]   : [20] && [21]

Job 233.000 defines the following attributes:

    JobVMMemory = 8192
    JobVMNetworkingType = "bridge"
    JobVMType = "kvm"
    RequestCpus = 4
    RequestDisk = 5120000 (kb)

slot1@xxxxxxxxxx has the following attributes:

    TARGET.Arch = "X86_64"
    TARGET.Cpus = 12
    TARGET.Disk = 1794578092 (kb)
    TARGET.HasFileTransfer = true
    TARGET.HasVM = true
    TARGET.Machine = "ep1ext.sel"
    TARGET.TotalMemory = 32136

The Requirements expression for job 233.000 reduces to these conditions:

        Slots
Step   Matched  Condition
----- --------- ---------
[0]           1  Machine == "ep1ext.sel"
[1]           1  TARGET.Arch == "X86_64"
[3]           1  TARGET.HasVM is true
[5]           0  TARGET.VM_Type == MY.JobVMType
[7]           0  TARGET.VM_AvailNum > 0
[13]          0  TARGET.VM_Memory >= MY.JobVMMemory
[15]          0  TARGET.VM_Networking
[17]          0  stringListIMember(JobVMNetworkingType,TARGET.VM_Networking_Types,",")


233.000:  Run analysis summary ignoring user priority.  Of 1 slots on 1 machines,
      1 slots are rejected by your job's requirements
      0 slots reject your job because of their own requirements
      0 slots match and are willing to run your job

WARNING:  Be advised:
   No machines matched the job's constraints