Hi all,
We recently updated our HTCondor-CEs to HTC 25.0.2 /HTC-CE 25.0.1 but had to roll back since the JobRouter kept dying [0] whenever it tried to transform a CMS job. For some reason, the JobRouter tried to run as (?) a user that simply does not exist on our system, possibly the submit user on the remote machine (I assume âgfactoryâ stands for GlideIn factory).
Iâve traced this ghost user account back to the Job attribute âOsUserâ. Apparently many jobs have it, but only for CMS itâs different from the Owner that we map jobs to. Iâm not familiar with this attribute, and itâs not documented; all I could find was a commit for HTC 25.
Why does the JobRouter try to access this user? What does the OSUser attribute do? And critically, can we overwrite it to fix this?
Cheers,
Max
[0]
10/23/25 08:54:40 passwd_cache::cache_uid(): getpwnam("gfactory") failed: user not found
10/23/25 08:54:40 gfactory not in passwd file
10/23/25 08:54:40 Failed in init_user_ids(gfactory,(null))
10/23/25 08:54:40 WriteUserLog::initialize: init_user_ids(pcms02) failed!
10/23/25 08:54:40 passwd_cache::cache_uid(): getpwnam("gfactory") failed: user not found
10/23/25 08:54:40 gfactory not in passwd file
10/23/25 08:54:40 Failed in init_user_ids(gfactory,(null))
[1]
https://github.com/htcondor/htcondor/commit/34ca97e2960306ed2d75deae6710d1e77e9ef097
Attachment:
smime.p7s
Description: S/MIME cryptographic signature