[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Getting Authentication issues on Execute or random Jobs



Increasing SHARED_PORT_MAX_FILE_DESCRIPTORS may help (the default in later versions was increased to 20000). But if the default of 4096 is causing these errors, it suggests thereâs some deeper problem.

These errors are not related to authentication, but to HTCondorâs machinery that allows multiple daemons (i.e. condor_master, condor_startd) to be contactable via a single TCP port. You could configure HTCondor so that each daemon binds to its own dynamic TCP port (set USE_SHARED_PORT=False), but that has drawbacks (primarily navigating firewalls).

 - Jaime

> On Feb 11, 2026, at 10:26âAM, Ram Ban <ramban046@xxxxxxxxx> wrote:
> 
> Hi,
> 
> I am seeing random Jobs getting restarted due to hit of Lease time even though other jobs are running fine on executor from same submitter, On investigating condor logs I found out these authentication errors(attached in ss)
> 
> Will these be fixed by increasing MAX_FILE_DESCRIPTORS??
> 
> Also I run all my machines in my own private network, can I remove this authentication on Execute at least to remove these random issues?
> 
> I am running on condor version 10.2.0
> Thanks and regards 
> Raman
> 
> <1000083751.png>
>