Hi Maarten,
I messed up the flag for turning on debugging for condor_submit. It is supposed to be -debug:D_ALWAYS,D_SECURITY. The subtle difference is there is no space between the levels and the flag but rather a colon. With that in mind just setting D_SECURITY should
produce the output that is helpful.
-Cole Bollig
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Cole Bollig via HTCondor-users <htcondor-users@xxxxxxxxxxx>
Sent: Thursday, February 13, 2025 8:15 AM To: htcondor-users@xxxxxxxxxxx <htcondor-users@xxxxxxxxxxx> Cc: Cole Bollig <cabollig@xxxxxxxx> Subject: Re: [HTCondor-users] v24.0.4 condor_submit only works sometimes
Hi Maarten,
Just to confirm you see this issue only on a V24.0.x release and things work accordingly on a v24.x and v23.x release? Do you see any tokens listed if you run condor_token_list as a user and not root? Would you be willing to try the failed submission with higher
debugging levels: -debug D_ALWAYS,D_SECURITY:2
-Cole Bollig
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Maarten Litmaath via HTCondor-users <htcondor-users@xxxxxxxxxxx>
Sent: Tuesday, February 11, 2025 11:59 AM To: geonmo@xxxxxxxxxxx <geonmo@xxxxxxxxxxx>; htcondor-users@xxxxxxxxxxx <htcondor-users@xxxxxxxxxxx> Cc: Maarten Litmaath <Maarten.Litmaath@xxxxxxx> Subject: Re: [HTCondor-users] v24.0.4 condor_submit only works sometimes
Hi again,
FYI, I tried to replicate with v24.x what has been working OK for v23.x.
The use of "/tmp" as account home directory has some advantages,
but I have also been able to reproduce the problem for an account
with a normal home directory under "/home":
======================================================================
[mytest@htc24s-ce ~]$ condor_ping -debug -verbose -type schedd WRITE
02/11/25 18:48:12 recognized WRITE as authorization level, using command 60021.
02/11/25 18:48:12 Can't open directory "/etc/condor/passwords.d" as PRIV_ROOT, errno: 13 (Permission denied)
02/11/25 18:48:12 Can't open directory "/etc/condor/passwords.d" as PRIV_ROOT, errno: 13 (Permission denied)
Destination: local schedd
Remote Version: $CondorVersion: 24.0.4 2025-02-02 BuildID: 784178 PackageID: 24.0.4-1 GitSHA: c93a1052 $
Local Version: $CondorVersion: 24.0.4 2025-02-02 BuildID: 784178 PackageID: 24.0.4-1 GitSHA: c93a1052 $
Session ID: htc24s-ce:26121:1739296092:393
Instruction: WRITE
Command: 60021
Encryption: AES
Integrity: AES
Authenticated using: FS
All authentication methods: FS,TOKEN,KERBEROS,SCITOKENS
Remote Mapping: mytest@xxxxxxx
Authorized: TRUE
======================================================================
[mytest@htc24s-ce ~]$ for i in `seq 100`; do condor_submit -debug D_SECURITY my-test.jdl || break; echo == $i; sleep 1; done
[...]
== 75
Submitting job(s)02/11/25 18:47:47 Can't open directory "/etc/condor/passwords.d" as PRIV_ROOT, errno: 13 (Permission denied)
02/11/25 18:47:47 Can't open directory "/etc/condor/passwords.d" as PRIV_ROOT, errno: 13 (Permission denied)
.
1 job(s) submitted to cluster 136.
== 76
Submitting job(s)02/11/25 18:47:48 Can't open directory "/etc/condor/passwords.d" as PRIV_ROOT, errno: 13 (Permission denied)
02/11/25 18:47:48 Can't open directory "/etc/condor/passwords.d" as PRIV_ROOT, errno: 13 (Permission denied)
.
ERROR: Failed to commit job submission into the queue.
ERROR: Failed to create new User record for condor@xxxxxxxx
======================================================================
This problem has the "hallmark" of a race condition...
From: geonmo@xxxxxxxxxxx <geonmo@xxxxxxxxxxx> on behalf of "류건모" <geonmo@xxxxxxxxxxx>
Sent: Tuesday, February 11, 2025 7:25 AM To: htcondor-users@xxxxxxxxxxx <htcondor-users@xxxxxxxxxxx> Cc: Maarten Litmaath <Maarten.Litmaath@xxxxxxx> Subject: RE: Re: [HTCondor-users] v24.0.4 condor_submit only works sometimes Hello, Maarten.
The core of the problem seems to be that FS authentication is not working properly and the user is authenticated as “condor@xxxxxxx”.
Could you please check the condor_ping information as user alicesgm? ---- condor_ping -debug -verbose -type schedd WRITE
.... Authenticated using: FS All authentication methods: TOKEN,FS .... -------
First, check the mount option and permissions information sharing in the /tmp directory, it may be that the alicesgm account is unable to write to /tmp or SELinux issue.
If you suspect SELinux, check the information below to see if you missed anything.
[root@ui20 tmp]# semanage permissive -l
Builtin Permissive Types
condor_negotiator_t condor_master_t condor_collector_t condor_procd_t condor_startd_t condor_schedd_t
As I know, absence of condor_schedd_t can cause SELinux to fail because actions not registered with permissive can be blocked.
Also, could you check that the account have an idtokens issued as "condor@xxxxxxx"?
Similarly, you can check by doing a condor_token_list on the alicesgm account.
Regards,
-- Geonmo
────── 원본 메일 ──────
|