Thanks for this.
Error, does seem to point to some sort of mismatch between the condorâce and condor batch setup, the batch setup has that file root: root and mode 600, and as far as I can tell condor-ce is trying to read it as
condorâ. Setting the mode to 644 gets rid of the error but changes nothing else. SELINUX is disabled.
So Iâm a bit stuck at the moment.
Chris.
On 08/05/2026, 01:49, ""ëêë"" <geonmo@xxxxxxxxxxx> wrote:
Hi Chris,
To address the identity mismatch and Job Router errors you're seeing, I would suggest verifying the following configuration points.
1. Job Router and Permissions
First, please ensure that the Job Router on your
HTCondor-CE is correctly pointing to the local Batch Schedd and has the necessary permissions to hand off jobs. Check if these are explicitly set in your
condor-ce/conf.d:
# HTCondor-CE side
JOB_ROUTER_SCHEDD2_SPOOL = /var/lib/condor/spool
JOB_ROUTER_SCHEDD2_NAME = $(FULL_HOSTNAME)
JOB_ROUTER_SCHEDD2_POOL = $(FULL_HOSTNAME):9618
And on the Local Batch (HTCondor) side, ensure the Schedd allows the CE's routing process to impersonate the job owner:
QUEUE_SUPER_USER_MAY_IMPERSONATE = .*
2. SSL/VOMS Mapping (HTCondor v23 vs. v24)
Regarding the VOMS authentication issues, it is important to note that the way HTCondor handles SSL/Certificate mapping changed significantly between
v23 and v24+.
In newer versions, the default behavior for mapping X.509 certificates has become more strict.
It often requires comparing not just the DN, but also additional attributes like VOMS roles.
This change often requires adding commas or specific formatting in your mapfiles that wasn't necessary before.
You can find the detailed requirements and the new mapping logic in this EGI documentation:
HTCondor and SSL authentication
Please check if your certificate DNs and roles are being mapped correctly under the new version's rules.
3. Authentication Method
Lastly, out of curiosity, is there a specific reason your site is prioritizing
VOMS/SSL over SCITOKENS for this setup?
Since many grid infrastructures are migrating toward tokens, knowing your requirements might help us suggest a more streamlined authentication path.
Hope this helps!
Best regards,
-- Geonmo
ââââââ ìë ëì ââââââ
ëëìë : Chris Brew - STFC UKRI via HTCondor-users <htcondor-users@xxxxxxxxxxx>
ëëìë : HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
ìì : Chris Brew - STFC UKRI <chris.brew@xxxxxxxxxx>
ëìëì : 2026-05-07 (ë) 23:07:29
ìë : [HTCondor-users] Authentication Issue between HTCondorCE Schedd and Batch Schedd
Hi,
Iââve still not got anywhere with the VOMS authentication (Iâll post some more info soon), but Token auth seems to be working in that Jobs get into the condor-ce Schedd and are visible with
condor_ce_q however they donât make it as far as the Schedd for the batch system.
I just copied the config of that from the config of the Schedd on our existing ArcCEs so itâs possibly itâs missing some necessary config for accepting Jobs from the Job_Router.
Iâve got three recurring errors. One in the /var/log/condor-ce/JobRouterLog:
05/07/26 14:44:29 Failed to commit job submission :
05/07/26 14:44:29 JobRouter failure (src="" failed to submit job
Which is matched with this one in /var/log/condor/SchedLog:
05/07/26 14:44:29 (pid:923597) (bt:ccbf:13) SetEffectiveOwner: UserRec lookup for owner condor@xxxxxxxxxxx found no match
05/07/26 14:44:29 (pid:923597) Owner condor@xxxxxxxxxxx has no JobQueueUserRec
05/07/26 14:44:29 (pid:923597) Creating pending JobQueueUserRec for owner condor@xxxxxxxxxxx
05/07/26 14:44:29 (pid:923597) Error: MakeUserRec with illegal identifiers: user=condor@xxxxxxxxxxx, os_user=condor
05/07/26 14:44:29 (pid:923597) NewCluster(): failed to create new User record for condor@xxxxxxxxxxx
And then another more frequent one every ten seconds in /var/log/condor-ce/JobRouterLog:
05/07/26 14:47:09 Failed to open /var/lib/condor/spool/job_queue.log: errno=13
Which looks to me like the JobRouter is trying to put jobs into the queue as (the illegal) user condor rather than the accounts the tokens are mapped to in the condor-ce Schedd (they show up there as the correctly mapped local user).
Does anyone have any idea where I should be looking?
Thanks,
Chris.
![]() |