Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] Consistency problems between schedd(s) view and CM?
- Date: Wed, 3 Dec 2025 17:02:45 +0100
- From: Steffen Grunewald <steffen.grunewald@xxxxxxxxxx>
- Subject: Re: [HTCondor-users] Consistency problems between schedd(s) view and CM?
On Wed, 2025-12-03 at 13:32:06 +0000, Bockelman, Brian wrote:
> Under typical conditions, the time between claimed and activated is less than a second - can be hard to catch in a busy pool.
> Under busy conditions - or if there are persistent failures in activation, the numbers diverge. That causes a slot to be claimed - but no jobs running. In the past, Iâve found a large discrepancy a fruitful place to dig for bugs or misconfiguration.
> Is this a possible explanation for what youâre seeing?
Hi Brian,
I'd be surprised if a busy cluster would result in those periods of time spanning
multiples of 10 minutes AFAICT - each of the squares in the graph grid is half an
hour wide! (But I'm willing to learn, and I'm curious about the real cause, thus
I'll be watching this space.)
What really puzzles me is that only the "other" part seems to be affected (set to
0, instead the white stuff bubbles up). There must be something to that in
particular...
Best,
Steffen