You might be able to fix it by adding
DEFAULT_DOMAIN_NAME=
o your configuration.
You have not said what the -debug output of condor_drain is. I would expect to see messages like.
05/29/25 13:21:17.138 Finding proper daemon name for "condor-execute"
05/29/25 13:21:17.138 Daemon name contains no '@', treating as a regular hostname
then it will either print
05/29/25 13:21:17.138 ipv6_getaddrinfo() could not look up condor-execute: ....
or it will append the default domain name to it and print or
05/29/25 13:21:17.138 Returning daemon name: "condor-execute.<default-domain>"
or
05/29/25 13:21:17.138 Failed to construct daemon name, returning NULL.
-tj
From: Curtis Spencer <curtis.spencer@xxxxxxxxxxxx>
Sent: Friday, May 30, 2025 1:26 PM
To: John M Knoeller <johnkn@xxxxxxxxxxx> Cc: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx> Subject: Re: [HTCondor-users] condor_drain error What steps can I take to fix that? I'm using the
Execute Role on the machine I am trying to drain and I am able to run jobs on it so I'm confused about why the collector wouldn't be able to find a STARTD with the name of that machine name in the Collector.
condor_status shows the `condor-execute` machine:
root@condor-master:~# condor_status
Name OpSys Arch State Activity LoadAv Mem ActvtyTime slot1@condor-execute LINUX X86_64 Unclaimed Idle 0.000 3927 0+00:00:00 Total Owner Claimed Unclaimed Matched Preempting Drain Backfill BkIdle X86_64/LINUX 1 0 0 1 0 0 0 0 0 Total 1 0 0 1 0 0 0 0 0 I've tried explicitly setting the MACHINE ClassAd to "condor-execute" and have restarted Condor but I get the same error from `condor_drain`.
Thanks,
Curtis
On Fri, May 30, 2025 at 10:27âAM John M Knoeller <johnkn@xxxxxxxxxxx> wrote:
|