my file .sub is the following:
universe = vanilla
executable = sim_rebounding_DT.exe
requirements = Memory >= 128
rank = kflops
should_transfer_files = YES
when_to_transfer_output = ON_EXIT
transfer_input_files = meas.txt
error = FIR_SRA.err
log = FIR_SRA.log
output = FIR_SRA_cmeans_432.txt
arguments = 0 0
queue
output = FIR_SRA_cmeans_422.txt
arguments = 0 1
queue
when i submit this file to condor
pool the command condor_status show
it:
C:\thesis\simulation >condor_status
Name
OpSys Arch
State Activity
LoadAv Mem ActvtyTime
vm1@dtc-mvill <mailto:vm1@dtc-mvill > WINNT51
INTEL Unclaimed Idle
0.000 251
0+02:08:28
vm2@dtc-mvill <mailto:vm2@dtc-mvill > WINNT51
INTEL Unclaimed Idle
0.000 251
0+02:08:29
dtc-snaranjo. WINNT51 INTEL
Unclaimed Idle
0.030 478 0+02:03:27
dtc-vhinojosa WINNT51 INTEL
Claimed Busy
0.000 1015 0+00:02:21
id-vhinojosa. WINNT51 INTEL
Unclaimed Idle
0.840 254 0+02:05:22
Machines Owner Claimed Unclaimed Matched
Preempting
INTEL/WINNT51
5 0
1
4
0
0
Total
5 0
1
4 0
0
The the dtc-vhinojosa is running with
the job, but due to my constraint
the next machine is dtc-snaranjo, but
i don't know because it doesn't run
the job.
i use the command -analyze and the
result is the next:
C:\thesis\simulation >condor_q -analyze
-- Submitter: dtc-vhinojosa : <10.0.1.171:4685
> : dtc-vhinojosa
ID OWNER
SUBMITTED
RUN_TIME ST PRI SIZE
CMD
---
045.000: Request is being serviced
---
045.001: Run analysis summary. Of
5 machines,
0 are rejected
by your job's requirements
0 reject your
job because of their own requirements
1 match, but are
serving users with a better priority
in the pool
4 match, match,
but reject the job for unknown
reasons
0 match, but
will not currently preempt their existing
job
0 are available
to run your job
can i help me which could be the
reasons for the condor show the
message "reject the job for unknown
reasons" or where i can search
the mistake?
thanks for your help
regards,
victor
________________________________
De: condor-users-bounces@xxxxxxxxxxx en nombre de
David A. Kotz
Enviado el: Jue 01/06/2006 02:40 p.m.
Para: Condor-Users Mail List
Asunto: Re: [Condor-users] i have a
problem
Victor,
The first step is to use the
-analayze switch to condor_q. Try
using
this command on the submit node:
condor_q -analyze 26.0
and also this one (if it works in
Windows):
condor_q -better-analzye 26.0
Those commands should give you some
indication of why job 26.0 is
not
starting.
If you get nothing useful from those
commands, compare long listings of
the jobs and the machines:
condor_q -l 26.0
condor_status -l dtc-mvill
to see if you can spot
incompatibilities between the job's
requirements
and the machine's requirements.
- dave
V?tor Hinojosa wrote:
> i have a condor pool. the
summary is the following:
>
> Name
OpSys Arch
State Activity
LoadAv Mem ActvtyTime
> vm1@dtc-mvill <mailto:vm1@dtc-mvill >
WINNT51 INTEL Unclaimed
Idle 0.000
508 0+02:08:29
> vm2@dtc-mvill <mailto:vm2@dtc-mvill >
WINNT51 INTEL Unclaimed
Idle 0.330
508 0+02:08:30
> dtc-vhinojosa WINNT51
INTEL Unclaimed Idle
0.000 1015 0+00:08:06
> id-vhinojosa. WINNT51
INTEL Unclaimed Idle
0.010 254 0+02:08:09
>
Machines Owner Claimed Unclaimed Matched
Preempting
> INTEL/WINNT51
4
0 0
4
0
0
>
Total
4 0
0
4 0
0
>
> i submit a task with
condor_submit. i check the status of
my job with condor_q command.
>
> -- Submitter: dtc-vhinojosa :
<10.0.1.171:2934 > : dtc-vhinojosa
> ID OWNER
SUBMITTED RUN_TIME ST PRI
SIZE CMD
> 26.0 Victor
6/1 13:02
0+00:00:00 I 0 0.3
sim_rebounding_DT
> 26.1 Victor
6/1 13:02
0+00:00:00 I 0 0.3
sim_rebounding_DT
> 26.2 Victor
6/1 13:02
0+00:00:00 I 0 0.3
sim_rebounding_DT
> 26.3 Victor
6/1 13:02
0+00:00:00 I 0 0.3
sim_rebounding_DT
> 26.4 Victor
6/1 13:02
0+00:00:00 I 0 0.3
sim_rebounding_DT
> 26.5 Victor
6/1 13:02
0+00:00:00 I 0 0.3
sim_rebounding_DT
> 26.6 Victor
6/1 13:02
0+00:00:00 I 0 0.3
sim_rebounding_DT
> 26.7 Victor
6/1 13:02
0+00:00:00 I 0 0.3
sim_rebounding_DT
> 26.8 Victor
6/1 13:02
0+00:00:00 I 0 0.3
sim_rebounding_DT
> 26.9 Victor
6/1 13:02
0+00:00:00 I 0 0.3
sim_rebounding_DT
> 10 jobs; 10 idle, 0 running,
0 held
>
> when i install the condor pool
i set up all machines with the
option "always run Condor jobs".so i
don't know what happen. somebody can
help me or where i can search
the mistake?
>
> regards,
>
>
> victor hinojosa
>
>
>
------------------------------------------------------------------------
>
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
_______________________________________________
Condor-users mailing list
Condor-users@xxxxxxxxxxx
https://lists.cs.wisc.edu/mailman/listinfo/condor-users
_______________________________________________
Condor-users mailing list
Condor-users@xxxxxxxxxxx
https://lists.cs.wisc.edu/mailman/listinfo/condor-users