[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Condor and KVM: cannot connect to qemu:///session



Hi everyone,

I'm having a problem getting Condor to start up a KVM virtual machine in Condor. I posted an email before, and with advice from a few people, I was able to sort out my KVM problems. But now, whenever I run a vm universe job, the condor_vm-gahp fails with the following error:

12/07 16:18:12 ** condor_vm-gahp (CONDOR_VM_GAHP) STARTING UP
12/07 16:18:12 ** /afs/nd.edu/user37/condor/software/versions/amd64-redhat5/condor-7.4.2-dynamic/sbin/condor_vm-gahp
12/07 16:18:12 ** SubsystemInfo: name=VM_GAHP type=GAHP(9) class=DAEMON(1)
12/07 16:18:12 ** Configuration: subsystem:VM_GAHP local:<NONE> class:DAEMON
12/07 16:18:12 ** $CondorVersion: 7.4.2 Mar 29 2010 BuildID: 227044 $
12/07 16:18:12 ** $CondorPlatform: X86_64-LINUX_RHEL5 $
12/07 16:18:12 ** PID = 13583
12/07 16:18:12 ** Log last touched 12/7 16:18:10
12/07 16:18:12 ******************************************************
12/07 16:18:12 Using config source: /afs/nd.edu/user37/condor/condor_config
12/07 16:18:12 Using local config sources:
12/07 16:18:12    /afs/nd.edu/user37/condor/software/config/machines/dqcneh100.local
12/07 16:18:12 DaemonCore: Command Socket at <10.32.72.74:9118>
12/07 16:18:12 VMGAHP[13583]: VM-GAHP initialized with run-mode 3
12/07 16:18:12 VMGAHP[13583]: Initial UID/GUID=0/0, EUID/EGUID=126019/1313, Condor UID/GID=108172,40
12/07 16:18:12 VMGAHP[13583]: Initialize Uids: caller=root, job user=rjansen
12/07 16:18:18 ERROR "Failed to create libvirt connection: could not connect to qemu:///session" at line 989 in file xen_type.cpp

Now, I have adjusted /etc/libvirt/libvirt.conf to allow the libvirt group to access the libvirt rw socket, and I added the users root, rjansen, and condor to that group.

Additionally, I can connect just fine (as root and rjansen) to qemu:///session, through virsh, and through the libvirt C library using example code from the qemu website. In fact, the code I use to connect to the library in the example program is essentially the same as the code on line 989 in xen_type.cpp, which is failing.

I'm not sure if I'm doing something wrong with Condor or something wrong with KVM/libvirt, but I'd like to get this working.

Does anyone have any ideas on how to fix this problem?

Thanks,
Ryan