On 1/26/2023 1:25 PM, JM wrote:
HTCondor users,
I have an exit hook to send .
job.ad to a database. However, I
noticed that in some uncertain cases, ImageSize is 1250
instead of the real ImageSize from condor_history. The jobs
run much longer than 15 seconds. I would expect startd will
update .
job.ad.
I even tried to sleep 30 seconds in exit hook to make sure the
update happens.
Does anyone have a clue why?
Hi,
I did not positively confirm this, but my guess is the .
job.ad file
sitting in the scratch directory is written at the start of job
execution, and not re-written every time the job ad is updated.Â
However, note that HTCondor will give a current/updated copy of the
job classad to your exit hook script via stdin [*]. Instead of
having your exit hook read the .
job.ad file, I suggest you use the
information passed to it via stdin. Let us know if you have any
additional problems or questions here. It would not be a big deal
for us to patch HTCondor to update the .
job.ad upon job exit (i.e.
before invoking the exit hook), but using the standard input should
do what you want today....
Hope this helps,
Todd
[*] = In the manual at link:
ÂÂ
https://htcondor.readthedocs.io/en/latest/admin-manual/hooks.html#work-fetching-hooks-invoked-by-htcondor
look for "HOOK_JOB_EXIT" and note what it says in the section
"Standard input given to the hook".
--
Todd Tannenbaum <tannenba@xxxxxxxxxxx> University of Wisconsin-Madison
Center for High Throughput Computing Department of Computer Sciences
Calendar: https://tinyurl.com/yd55mtgd 1210 W. Dayton St. Rm #4257