Hello Cole.
Adding TRUST_DOMAIN to the EP config file helped i.e. more successful actions in the EP StartLog but I still see errors:
I guess I can safely ignore this:
Running: /usr/bin/docker container prune -f --filter=label=org.htcondorproject=True
08/25/23 07:53:05 Failed to read results from '/usr/bin/docker container prune -f
But:
08/25/23 08:22:59 Token requested not yet approved; please ask collector
bench12.timehole.org admin to approve request ID 2446816.
And:
08/25/23 08:26:01 SECMAN: FAILED: Received "DENIED" from server for user condor_pool@ using method IDTOKENS.
08/25/23 08:26:01 ERROR: SECMAN:2010:Received "DENIED" from server for user condor_pool@ using method IDTOKENS.
08/25/23 08:26:01 Collector update failed; will try to get a token request for trust domain 192.168.1.12, identity (default).
08/25/23 08:26:01 Failed to start non-blocking update to <192.168.1.12:9618>.
08/25/23 08:26:01 Trying token request to remote host
bench12.timehole.org for user (default).
08/25/23 08:26:01 SECMAN: command 60047 DC_START_TOKEN_REQUEST to collector
bench12.timehole.org from TCP port 46127 (blocking).
08/25/23 08:26:01 SECMAN: using session bench12:1603:1692965161:95 for {<192.168.1.12:9618?alias=bench12.timehole.org>,<60047>}.
08/25/23 08:26:01 SECMAN: resume, NOT reauthenticating.
08/25/23 08:26:01 SECMAN: Server rejected our session id
08/25/23 08:26:01 SECMAN: Invalidating negotiated session rejected by peer
08/25/23 08:26:01 DC_INVALIDATE_KEY: removed key id bench12:1603:1692965161:95.
08/25/23 08:26:01 Failed to request a new token: DAEMON:1:failed to start command for token request with remote daemon at '<192.168.1.12:9618?alias=bench12.timehole.org>'.|SECMAN:2004:Server
rejected our session id
I tried to (auto) approve the token on the central manager, bench12, with:
$ condor_token_request_approve -reqid 2446816
Remote daemon did not provide information for request ID 2446816.
Or:
$ condor_token_request_auto_approve -lifetime 3600 -netblock 192.168.1.0/24
Failed to create new auto-approval rule: SECMAN:2010:Received "DENIED" from server for user
justin@xxxxxxxxxxxxxxxxxxxx using method FS.
Or sudo:
$ sudo condor_token_request_auto_approve -lifetime 3600 -netblock 192.168.1.0/24
[sudo] password for justin:
Failed to create new auto-approval rule: SECMAN:2010:Received "DENIED" from server for user condor_pool@ using method IDTOKENS.
On both the EP and central manager:
$ condor_config_val TRUST_DOMAIN
192.168.1.12
Is this TRUST_DOMAIN confusion caused by not using FQDN during installations? Which is better, names (resolved via hosts files only) or ip addresses?
Is this a normal amount of installation woes or have I steered off the rails somehow? I will have to install again to create the ârealâ cluster.
Thanks very much!
JK
|