Hi Todd!
Il 27/03/19 19:51, Todd Tannenbaum ha
scritto:
On 3/27/2019 12:21 PM, David Rebatto wrote:Hi, I noticed that vanilla jobs restarting after preemption overwrite the output of previous execution. Is there a way to instruct them to *append* to output file instead?By "output file" above, I assume you mean the job's stdout as defined by the "output=filename" line in your submit file? Yes, see below. Are you using HTCondor's file transfer or a shared file system? File transfer, the jobs are flocking away from the submission
pool. If you are using HTCondor's file transfer, the issue is likely the output from previous executions is not being transferred back upon preemption to the submit machine from the execute machine. I think you can achieve what you want by adding the following line to your submit file: when_to_transfer_output = ON_EXIT_OR_EVICT I have it in the submit file, and it is working. Here's the
(stripped) submit file: universe = vanilla The job prints some debug information on stdout, its real output
on 'squares.txt', and a checkpoint in 'checkpoint.txt' whenever it
gets a SIGTERM. I attach the full submit file and the script, maybe I'm doing something wrong in there. Cheers, -- David Rebatto I.N.F.N. - Sezione di Milano Via Celoria, 16 - 20133 Milano ITALY tel: +39 02503.17623 e-mail: David.Rebatto@xxxxxxxxxx URL: http://www.mi.infn.it/~rebatto "Some say the world will end in fire; some say in segfaults." -- Randall Munroe |
Attachment:
test_checkpoint.tgz
Description: application/compressed-tar
Attachment:
smime.p7s
Description: Firma crittografica S/MIME