[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] which status for a Held DAGMAN ?



Hi Stefano,

Being held is not considered an error for DAGMan. When you run condor_hold on a running DAGMan job, the DagStatus attribute in the node status file should actually be 3 (submitted) with an inline comment saying "STATUS_SUBMITTED (held)". That being said when writing the node status file, DAGMan checks to see if the DAG is finished executing. This would take precedence over the hold and set the status to done or error.

-Cole Bollig

From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Stefano Belforte via HTCondor-users <htcondor-users@xxxxxxxxxxx>
Sent: Tuesday, May 13, 2025 4:30 PM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Cc: Stefano Belforte <stefano.belforte@xxxxxxx>
Subject: [HTCondor-users] which status for a Held DAGMAN ?
 
Dear experts,

with reference to the list of possible DagStatus values in the node
status file

https://urldefense.com/v3/__https://htcondor.readthedocs.io/en/latest/automated-workflows/dagman-information-files.html*current-node-status-file__;Iw!!Mak6IKo!O3TGkKL3kkvYmstgdZ0TcndQLXuR4Y5jUOia-cGFK-m33lSawslXvQQZTKtfAtbdo8BEOL1H2S4qRhXNBGPR4u1lKRd2HQ$

Am I correct in assuming that when a running DAG is held via condor_old
as per
https://urldefense.com/v3/__https://htcondor.readthedocs.io/en/latest/automated-workflows/dagman-interaction.html*suspending-a-running-dag__;Iw!!Mak6IKo!O3TGkKL3kkvYmstgdZ0TcndQLXuR4Y5jUOia-cGFK-m33lSawslXvQQZTKtfAtbdo8BEOL1H2S4qRhXNBGPR4u1ieQCeMA$

it goes into FAILED status ?

I have seen this, but before I "code around it", would like to be sure
that it reliably happens all of the times.

Thanks

Stefano

_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe

Join us in June at Throughput Computing 25: https://urldefense.com/v3/__https://osg-htc.org/htc25__;!!Mak6IKo!O3TGkKL3kkvYmstgdZ0TcndQLXuR4Y5jUOia-cGFK-m33lSawslXvQQZTKtfAtbdo8BEOL1H2S4qRhXNBGPR4u20zV-L2g$

The archives can be found at: https://www-auth.cs.wisc.edu/lists/htcondor-users/