Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] Schedd not always appearing on condor_status -schedds
- Date: Thu, 7 Nov 2019 15:47:28 +0000
- From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
- Subject: Re: [HTCondor-users] Schedd not always appearing on condor_status -schedds
On 11/7/2019 8:10 AM, Stewart Martin-Haugh wrote:
> Hi,
>
> We noticed that when querying condor_status -schedds
>
On the same machine where "condor_status -schedd" gives the inconsistent
results, what does
condor_config_val -v collector_host
say? Does it list more than one central manager?
The most common reason I've seen behavior like the below is when sites
have two or more central managers configured (i.e. HAD, for high
availability), and yet the daemon in question (in this case schedd
condor-ce01) is configured to only report to one central manager instead
of both. When you do "condor_status", it will query one of the central
managers at random (load balance), resulting in 50% of the time you see
it, 50% you dont...
Hope the above helps,
Todd
> the condor-ce doesn't always appear - if you do it in quick succession I
> would say only about 50% of the time.
>
> e.g.
>
> Name                 Machine
> ÂRunningJobs  IdleJobs  HeldJobs
>
> arc-ce01................ Â Â Â Â Â Â arc-ce01................
> Â Â Â Â Â3235 Â Â Â Â617 Â Â Â Â Â0
> arc-ce02................ Â Â Â Â Â Â arc-ce02................
> Â Â Â Â Â3210 Â Â Â Â398 Â Â Â Â Â0
> arc-ce03................ Â Â Â Â Â Â arc-ce03................
> Â Â Â Â Â3372 Â Â Â Â525 Â Â Â Â Â0
> arc-ce04................ Â Â Â Â Â Â arc-ce04................
> Â Â Â Â Â2697 Â Â Â Â921 Â Â Â Â Â0
> arc-ce05................ Â Â Â Â Â Â arc-ce05................
> Â Â Â Â Â3116 Â Â Â Â743 Â Â Â Â Â0
> condor-ce01................ Â Â Â Â Âcondor-ce01................
> Â Â Â Â Â Â0 Â Â Â Â Â0 Â Â Â Â Â0
>
> vs.
> Name                 Machine
> ÂRunningJobs  IdleJobs  HeldJobs
>
> arc-ce01................ Â Â Â Â Â Â arc-ce01................
> Â Â Â Â Â3235 Â Â Â Â617 Â Â Â Â Â0
> arc-ce02................ Â Â Â Â Â Â arc-ce02................
> Â Â Â Â Â3210 Â Â Â Â398 Â Â Â Â Â0
> arc-ce03................ Â Â Â Â Â Â arc-ce03................
> Â Â Â Â Â3372 Â Â Â Â525 Â Â Â Â Â0
> arc-ce04................ Â Â Â Â Â Â arc-ce04................
> Â Â Â Â Â2697 Â Â Â Â921 Â Â Â Â Â0
> arc-ce05................ Â Â Â Â Â Â arc-ce05................
> Â Â Â Â Â3116 Â Â Â Â743 Â Â Â Â Â0
>
> Is this a known problem?
>
> Cheers,
> Stewart
>
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
>
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/
>