Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] Why do remote job submissions on Linux require 'should_transfer_files'
- Date: Fri, 09 Mar 2012 10:12:24 -0500 (EST)
- From: Tim St Clair <tstclair@xxxxxxxxxx>
- Subject: Re: [Condor-users] Why do remote job submissions on Linux require 'should_transfer_files'
inline
----- Original Message -----
> From: "Thomas Luff" <Thomas.Luff@xxxxxxx>
> To: condor-users@xxxxxxxxxxx
> Sent: Friday, March 9, 2012 8:27:45 AM
> Subject: [Condor-users] Why do remote job submissions on Linux require 'should_transfer_files'
>
> I'm currently toying with remote job submission for our validation
> cluster and have come across a bit of an issue.
>
> Jobs that are remotely submit to be run on Linux without specifying
> 'should_transfer_files = true' get stuck in the hold state with the
> following error:
>
> 023.000: Request is held.
> Hold reason: Error from slot1@xxxxxxxxxx: Failed to execute
> '/bin/sleep' with arguments 1m: Cannot access specified iwd
> "/work/condor/local/spool/23/0/cluster23.proc0.subproc0"
> (errno=2: 'No such file or directory')
>
> The Linux job only gets stuck when remotely submitted (condor_submit
> -remote). It executes successfully when submitted locally.
>
> A similar job can be run fine on a windows node with
> 'should_transfer_files = IF_NEEDED'.
>
> My Jobs are as follows:
> Linux:
> <pre>
> Executable = /bin/sleep
> Arguments = "1m"
>
> Requirements = (ARCH=="INTEL" || ARCH=="X86_64") &&
> OpSys=="LINUX"
>
> transfer_executable = false
> should_transfer_files = TRUE
> WhenToTransferOutput = ON_EXIT
> Run_As_Owner = True
> Queue
> </pre>
>
> Windows:
> <pre>
> Executable = \\network\path\to\bin\perl
> Arguments = "-e '$time = time+60; sleep 1 until time >
> $time'" #Sleep 60 seconds
>
> Requirements = ((ARCH=="INTEL" || ARCH=="X86_64") &&
> OpSys=="WINNT51")
>
> transfer_executable = false
> should_transfer_files = IF_NEEDED
> WhenToTransferOutput = ON_EXIT
>
> Run_As_Owner = True
>
> Queue
> </pre>
>
> My question is, Why is this needed and why is it different between
> Windows and Linux?
condor_submit fills in a series of defaults when working locally, a simple way to check the full list to compare the differences is to run `condor_q -long cluster.proc` and compare the differences.
In general: The path of least resistance for remote submission, in my experience, has been to set:
should_transfer_files = TRUE
# something sane on windows, but not needed for windows.
iwd = /tmp
>
> Thanks
>
> P.S Has anyone successfully built Condor for ARM Linux?
Not yet. Once ARM makes it's way to the data center it might become more of a priority, but right now it would just melt a cell phone.
>
>
>
>
> -- IMPORTANT NOTICE: The contents of this email and any attachments
> are confidential and may also be privileged. If you are not the
> intended recipient, please notify the sender immediately and do not
> disclose the contents to any other person, use it for any purpose,
> or store or copy the information in any medium. Thank you.
>
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx
> with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/condor-users/
>