[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Authentication Issue between HTCondorCE Schedd and Batch Schedd



Hi Cole,

Sorry that was on my first post but I forgot to add it to the second Iâm bleeding edge:

This is Condor CE 25.7.0 on AlmaLinux 10:

# condor_ce_version
$HTCondorCEVersion: 25.7.0 $
$CondorVersion: 25.8.2 2026-04-15 BuildID: 896299 PackageID: 25.8.2-1 GitSHA: 37a36d5f $
$CondorPlatform: x86_64_AlmaLinux10 $

Thanks,
Chris.

On 07/05/2026, 19:00, "Cole Bollig" <cabollig@xxxxxxxx> wrote:

Hi Chris,

What version of CE and backing Schedd are you running?

-Cole Bollig

From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Chris Brew - STFC UKRI via HTCondor-users <htcondor-users@xxxxxxxxxxx>
Sent: Thursday, May 7, 2026 9:00 AM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Cc: Chris Brew - STFC UKRI <chris.brew@xxxxxxxxxx>
Subject: [HTCondor-users] Authentication Issue between HTCondorCE Schedd and Batch Schedd
 
Hi,

Iââve still not got anywhere with the VOMS authentication (Iâll post some more info soon), but Token auth seems to be working in that Jobs get into the condor-ce Schedd and are visible with condor_ce_qâ however they donât make it as far as the Schedd for the batch system.

I just copied the config of that from the config of the Schedd on our existing ArcCEs so itâs possibly itâs missing some necessary config for accepting Jobs from the Job_Router.

Iâve got three recurring errors. One in the /var/log/condor-ce/JobRouterLog:

05/07/26 14:44:29 Failed to commit job submission :
05/07/26 14:44:29 JobRouter failure (src="" failed to submit job

Which is matched with this one in /var/log/condor/SchedLog:

05/07/26 14:44:29 (pid:923597) (bt:ccbf:13) SetEffectiveOwner: UserRec lookup for owner condor@xxxxxxxxxxx found no match
05/07/26 14:44:29 (pid:923597) Owner condor@xxxxxxxxxxx has no JobQueueUserRec
05/07/26 14:44:29 (pid:923597) Creating pending JobQueueUserRec for owner condor@xxxxxxxxxxx
05/07/26 14:44:29 (pid:923597) Error: MakeUserRec with illegal identifiers: user=condor@xxxxxxxxxxx, os_user=condor
05/07/26 14:44:29 (pid:923597) NewCluster(): failed to create new User record for condor@xxxxxxxxxxx

And then another more frequent one every ten seconds in /var/log/condor-ce/JobRouterLog:

05/07/26 14:47:09 Failed to open /var/lib/condor/spool/job_queue.log: errno=13

Which looks to me like the JobRouter is trying to put jobs into the queue as (the illegal) user condor rather than the accounts the tokens are mapped to in the condor-ce Schedd (they show up there as the correctly mapped local user).

Does anyone have any idea where I should be looking?

Thanks,
Chris.