Hi Justin,
EP is short for Execution Point. An execution point in HTCondor is a host designated to run jobs. this host runs the startd.
Yes, you could add the increased log debugging level in any configuration file like the one under your config directory (/etc/condor/config.d/xx-myconfig.config).
The format for this is <daemon>_DEBUG = D_SECURITY. So, for your case it would be COLLECTOR_DEBUG on the host with the collector and STARTD_DEBUG on the host with the new startd.
I believe that the condor_status -direct failed because it tried to get information about the startd, and that doesn't exist since the startd is failing to authenticate with the collector and subsequently not sending any ads.
I don't think you need to restart at this point, but rather add the security debugging to the collector and startd to get more information about why the authentication is failing.
-Cole Bollig
From: Justin Killebrew <jk@xxxxxxx>
Sent: Wednesday, August 23, 2023 7:43 AM To: Cole Bollig <cabollig@xxxxxxxx> Cc: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx> Subject: Re: [HTCondor-users] StartLog: Failed to authenticate Hi Cole. Thanks for suggestions but I have more novice questions:
What does EP stand for?
How do I add D_SECURITY for the collector and the EP startd? Just add them to a config file like /etc/condor/config.d/xx-myconfig.config?
On the central manager, when I run _condor_TOOL_DEBUG=“D_SECURITY” condor_status -debug -direct <hostname> I see the error:
08/23/23 08:12:03 Can't find address for startd
bench5.timehole.org
Error: Failed to locate startd
bench5.timehole.org
Can't find address for startd
bench5.timehole.org
So the central manager can’t see the execute node, bench5, but it’s in the hosts file and can ping from the command line. How is condor resolving names?
Should I start over?! Is this much configuration trouble typical for a fresh, clean install?
Thanks,
JK
On Aug 21, 2023, at 10:26 AM, Cole Bollig <cabollig@xxxxxxxx> wrote:
|