Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Can a DAGMan node be an already running job?

Date: Mon, 30 Sep 2024 15:43:24 -0500 (CDT)
From: Todd L Miller <tlmiller@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] Can a DAGMan node be an already running job?

Let's say I have the following two DAGs

A -> B

A -> C
Where A is a long running job. There's an A job running because someonehas invoked the A -> B workflow. Now someone comes along and wants toinvoke the A -> C workflow. I don't want to re-run A, but I can't markit as DONE in the DAGfile because then C will start running immediately.

As far as I know, there's no particular time limit on how long aDAG node's PRE script can take, so if you know which job(s) or cluster(s)you're waiting for, you can wait for them in the PRE script. That has twoadvantage: first, it's not stepping outside of DAGMan, meaning that as theworkflow(s) involved get more complicated, you can continue to takeadvantage of (rather than reimplement) its features; second, it doesn'tuse an execute node while it's waiting.

Of course, you still have to skip the job, but that's what the"skip_if_dataflow" submit command is for. (There are other ways to skipthe job from the PRE script, but this one has the advantage of havingHTCondor verify that the two A nodes were, in fact, identical.)


-- ToddM

References:
- [HTCondor-users] Can a DAGMan node be an already running job?
  - From: Belakovski, Nickolai

Prev by Date: Re: [HTCondor-users] escaping of an argument
Next by Date: [HTCondor-users] IPDPS 2025 - Call for Papers
Previous by thread: Re: [HTCondor-users] Can a DAGMan node be an already running job?
Next by thread: [HTCondor-users] escaping of an argument
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [HTCondor-users] Can a DAGMan node be an already running job?