[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Unexpected CumulativeSuspensionTime > RemoteWallclockTime in HTCondor 24.0.7



Hi Greg,

Thank you very much. So far, this issue has only occurred once for a single job.Â

Cheers,

Carles

On Thu, 7 Aug 2025 at 18:46, Greg Thain via HTCondor-users <htcondor-users@xxxxxxxxxxx> wrote:

On 7/18/25 2:55 AM, Carles Acosta wrote:
>
> We have some machines where jobs may be suspended under certain
> conditions. Today, for the first time, we noticed a job where
> CumulativeSuspensionTime is greater than RemoteWallclockTime, which
> should not be possible according to the HTCondor documentation. In
> fact the CommitedTime is also greather than the RemoteWallclockTime.
> # condor_history 1328280 -af JobStatus RemoteWallclockTime
> CumulativeSuspensionTime CommittedTime
> 4 6761.0 7142 7949


Carles:

It is not our intention that CumulativeSuspensionTime should ever be
greater than RemoteWallclockTime, but it is possible there is a bug
where we lose track of some of these values when the schedd restarts. I
will see if I can reproduce this, and if so, fix it.


-greg


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe

The archives can be found at: https://www-auth.cs.wisc.edu/lists/htcondor-users/


--
Carles Acosta i Silva
PIC (Port d'Informacià CientÃfica)
Campus UAB, Edifici D
E-08193 Bellaterra, Barcelona
Tel: +34 93 581 33 08
Fax: +34 93 581 41 10
http://www.pic.esÂ
AvÃs - Aviso - Legal Notice: Âhttp://legal.ifae.es