Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] Condor & Bosco submissions
- Date: Fri, 1 Mar 2013 11:53:24 -0600
- From: Jaime Frey <jfrey@xxxxxxxxxxx>
- Subject: Re: [HTCondor-users] Condor & Bosco submissions
On Mar 1, 2013, at 5:00 AM, Guillermo Marco Puche <guillermo.marco@xxxxxxxxxxxxxxxxxxxxx> wrote:
> I've been trying Bosco lately and seems to work pretty well for me to submit to another lan cluster SGE cluster.
>
> For example:
>
> $ condor_q
> -- Submitter: brugal : <192.168.6.2:11000?sock=3072_dcd9_3> : brugal
> ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
> 62.0 gmarco 3/1 04:43 0+00:14:32 R 0 0.0 bwa.sh
>
>
> I was then trying to achieve the same but with my local Condor installation and not with condor pool inside Bosco. I'm having no success when trying to submit exactly the same condor job file:
>
> As root i start condor with "condor_master".
> ps -ef | grep condor
> condor 3850 1 0 05:05 ? 00:00:00 condor_master
> condor 3851 3850 0 05:05 ? 00:00:00 condor_collector -f
> condor 3853 3850 0 05:05 ? 00:00:00 condor_negotiator -f
> condor 3854 3850 0 05:05 ? 00:00:00 condor_schedd -f
> condor 3855 3850 0 05:05 ? 00:00:00 condor_startd -f
> root 3856 3854 0 05:05 ? 00:00:00 condor_procd -A /var/run/condor/procd_pipe.SCHEDD -L /var/log/condor/ProcLog.SCHEDD -R 10000000 -S 60 -C 498
> condor 3907 3855 87 05:05 ? 00:00:03 mips
> root 3924 3758 0 05:05 pts/0 00:00:00 grep condor
> I try to submit my job and holds on Idle state forever, with Bosco I don't have that problem:
>
> condor_q
>
> -- Submitter: brugal : <192.168.6.2:41257> : brugal
> ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
> 26.0 gmarco 3/1 05:07 0+00:00:00 I 0 0.0 bwa.sh
>
>
> That's my job file:
>
> universe = grid
> grid_resource = batch sge gmarco@cacique
> executable = bwa.sh
> output = bwa.out
> error = bwa.err
> log = bwa.log
> should_transfer_files = YES
> transfer_output = true
> stream_output = true
> when_to_transfer_output = ON_EXIT_OR_EVICT
> queue
Submitting jobs to a remote cluster using a regular installation of HTCondor requires some manual configuration steps, which we don't have documented currently. This is one of the advantages of Bosco. Over time, we may make this kind of job submission easier to do with a regular HTCondor installation.
Thanks and regards,
Jaime Frey
UW-Madison HTCondor Project