thanks Cole. We parse the node_status using classad.parseAds()
so the comments are lost.
I will go the extra length and explicitly check the job status
with a condor_q.
Stefano
Hi Stefano,
Being held is not considered an error for DAGMan. When you run condor_hold on a running DAGMan job, the DagStatus attribute in the node status file should actually be 3 (submitted) with an inline comment saying "STATUS_SUBMITTED (held)". That being said when writing the node status file, DAGMan checks to see if the DAG is finished executing. This would take precedence over the hold and set the status to done or error.
-Cole Bollig
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Stefano Belforte via HTCondor-users <htcondor-users@xxxxxxxxxxx>
Sent: Tuesday, May 13, 2025 4:30 PM
To: HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx>
Cc: Stefano Belforte <stefano.belforte@xxxxxxx>
Subject: [HTCondor-users] which status for a Held DAGMAN ?Dear experts,
with reference to the list of possible DagStatus values in the node
status file
https://urldefense.com/v3/__https://htcondor.readthedocs.io/en/latest/automated-workflows/dagman-information-files.html*current-node-status-file__;Iw!!Mak6IKo!O3TGkKL3kkvYmstgdZ0TcndQLXuR4Y5jUOia-cGFK-m33lSawslXvQQZTKtfAtbdo8BEOL1H2S4qRhXNBGPR4u1lKRd2HQ$
Am I correct in assuming that when a running DAG is held via condor_old
as per
https://urldefense.com/v3/__https://htcondor.readthedocs.io/en/latest/automated-workflows/dagman-interaction.html*suspending-a-running-dag__;Iw!!Mak6IKo!O3TGkKL3kkvYmstgdZ0TcndQLXuR4Y5jUOia-cGFK-m33lSawslXvQQZTKtfAtbdo8BEOL1H2S4qRhXNBGPR4u1ieQCeMA$
it goes into FAILED status ?
I have seen this, but before I "code around it", would like to be sure
that it reliably happens all of the times.
Thanks
Stefano
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
Join us in June at Throughput Computing 25: https://urldefense.com/v3/__https://osg-htc.org/htc25__;!!Mak6IKo!O3TGkKL3kkvYmstgdZ0TcndQLXuR4Y5jUOia-cGFK-m33lSawslXvQQZTKtfAtbdo8BEOL1H2S4qRhXNBGPR4u20zV-L2g$
The archives can be found at: https://www-auth.cs.wisc.edu/lists/htcondor-users/