[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Unexpected CumulativeSuspensionTime > RemoteWallclockTime in HTCondor 24.0.7



Dear all,

We have some machines where jobs may be suspended under certain conditions. Today, for the first time, we noticed a job where CumulativeSuspensionTime is greater than RemoteWallclockTime, which should not be possible according to the HTCondor documentation. In fact the CommitedTime is also greather than the RemoteWallclockTime.

# condor_history 1328280 -af JobStatus RemoteWallclockTime CumulativeSuspensionTime CommittedTime
4 6761.0 7142 7949

It seems to be an isolated issue, possibly caused by a reporting glitch or something specific to the machine. There was a condor_schedd restart around 40 minutes before the job completed. Could that have affected the accounting?ÂAs far as I understand, we should always expect RemoteWallclockTime to be greater than or equal to CumulativeSuspensionTime, correct?

We are running HTCondor v 24.0.7.Â

Cheers,

Carles

--
Carles Acosta i Silva
PIC (Port d'Informacià CientÃfica)
Campus UAB, Edifici D
E-08193 Bellaterra, Barcelona
Tel: +34 93 581 33 08
Fax: +34 93 581 41 10
AvÃs - Aviso - Legal Notice: Âhttp://legal.ifae.es