[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Strange issue with condor_reconfig, shared_port and condor 24.0.7



Dear all,

We are in the process of migratingÂall our resources to HTCondor 24.0.7 from 23.0.22. Everything works in general, the CMs and the main schedd are already updated, and we are migrating the startdsÂgradually to 24.0.7.Â

However, we found a strange issue. We change from the CM some options ofÂtheÂstartd using commands like:

condor_config_val -n wn-name -startd -set option=false
condor_reconfig -name wn-name -startd

But, in some cases, I obtain this error when running the condor_reconfig after an update to 24.0.7:

ERROR
SECMAN:2011:Connection closed during command authorization. Probably due to an unknown command.
Can't send Reconfig command to startdÂwn-name

And in the SharedPortLog:Â

05/13/25 15:25:16 SharedPortServer: server was busy, failed to connect startd_1082928_1cc5 as requested by TOOL on <[IP]:38373>: primary (<cookie>/startd_1082928_1cc5): Connection refused (111); alt (/var/lock/condor/daemon_sock/startd_1082928_1cc5): Connection refused (111)
[...]

After a while or some restarts (I don't know), it appears to be solved:

Sent "Reconfig" command to startd twn-test

Do you have any hints about what's happening?

Thank you very much.Â

Cheers,

Carles


--
Carles Acosta i Silva
PIC (Port d'Informacià CientÃfica)
Campus UAB, Edifici D
E-08193 Bellaterra, Barcelona
Tel: +34 93 581 33 08
Fax: +34 93 581 41 10
AvÃs - Aviso - Legal Notice: Âhttp://legal.ifae.es