Hello,
I have the submit file below I'm using to submit a series of job (same executable, different input file and arguments read from a file). If I submit this witch condor_submit from my submission node having
gdebrecz@godzilla14linux:~/works/condor/examples$ condor_version
$CondorVersion: 8.8.0 Jan 03 2019 BuildID: Debian-8.8.0-1 PackageID: 8.8.0-1 Debian-8.8.0-1 $
$CondorPlatform: X86_64-Ubuntu_18.04 $
than everything works fine.
However if I submit this from another submission node (and submit it to the same execute node), with
gergely.debreczeni@adas191linux:/tmp/batchsubmission$ condor_version
$CondorVersion: 8.4.2 Feb 22 2016 BuildID: Debian-8.4.2~dfsg.1-1build1 Debian-8.4.2~dfsg.1-1build1 $
then the job is not running, it turns into Held state with the following error from condor_q -anal:
--------------------------------
## Executable
executable = batch.sh
universe = docker
docker_image = dani_tensorflow:v_01
## Logs
log = out/batch.$(Process).log
output = out/batch.$(Process).stdout
error = out/batch.$(Process).stderr
## File transfer
should_transfer_files = Yes
when_to_transfer_output = ON_EXIT
line = $(Row)+1
transfer_output_files = output.out
transfer_output_remaps = "output.out=out/output$INT(line).out"
#transfer_input_files = batch.sh
## Resources requested
request_cpus = 1
request_GPUs = 0
Requirements = (ResourceType == "Dedicated") && (regexp(".*dani_tensorflow.*",LocallyAv$
## Submit command
queue transfer_input_files, arguments from [0:2:1] ./filelist
This e-mail and any files transmitted with it contain confidential and may contain privileged information. If you are not the intended recipient (or have received this e-mail in error) please notify the sender immediately and delete this e-mail. Any unauthorized use, copying, disclosure or distribution of the material in this e-mail is strictly forbidden. |