my file .sub is the following:
universe = vanilla
executable = sim_rebounding_DT.exe
requirements = Memory >= 128
rank = kflops
should_transfer_files = YES
when_to_transfer_output = ON_EXIT
transfer_input_files = meas.txt
error = FIR_SRA.err
log = FIR_SRA.log
output = FIR_SRA_cmeans_432.txt
arguments = 0 0
queue
output = FIR_SRA_cmeans_422.txt
arguments = 0 1
queue
when i submit this file to condor pool the command condor_status show it:
C:\thesis\simulation>condor_status
Name OpSys Arch State Activity LoadAv Mem ActvtyTime
vm1@dtc-mvill <mailto:vm1@dtc-mvill> WINNT51 INTEL Unclaimed Idle 0.000 251 0+02:08:28
vm2@dtc-mvill <mailto:vm2@dtc-mvill> WINNT51 INTEL Unclaimed Idle 0.000 251 0+02:08:29
dtc-snaranjo. WINNT51 INTEL Unclaimed Idle 0.030 478 0+02:03:27
dtc-vhinojosa WINNT51 INTEL Claimed Busy 0.000 1015 0+00:02:21
id-vhinojosa. WINNT51 INTEL Unclaimed Idle 0.840 254 0+02:05:22
Machines Owner Claimed Unclaimed Matched Preempting
INTEL/WINNT51 5 0 1 4 0 0
Total 5 0 1 4 0 0
The the dtc-vhinojosa is running with the job, but due to my constraint the next machine is dtc-snaranjo, but i don't know because it doesn't run the job.
i use the command -analyze and the result is the next:
C:\thesis\simulation>condor_q -analyze
-- Submitter: dtc-vhinojosa : <10.0.1.171:4685> : dtc-vhinojosa
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
---
045.000: Request is being serviced
---
045.001: Run analysis summary. Of 5 machines,
0 are rejected by your job's requirements
0 reject your job because of their own requirements
1 match, but are serving users with a better priority in the pool
4 match, match, but reject the job for unknown reasons
0 match, but will not currently preempt their existing job
0 are available to run your job
can i help me which could be the reasons for the condor show the message "reject the job for unknown reasons" or where i can search the mistake?
thanks for your help
regards,
victor
________________________________
De: condor-users-bounces@xxxxxxxxxxx en nombre de David A. Kotz
Enviado el: Jue 01/06/2006 02:40 p.m.
Para: Condor-Users Mail List
Asunto: Re: [Condor-users] i have a problem
Victor,
The first step is to use the -analayze switch to condor_q. Try using
this command on the submit node:
condor_q -analyze 26.0
and also this one (if it works in Windows):
condor_q -better-analzye 26.0
Those commands should give you some indication of why job 26.0 is not
starting.
If you get nothing useful from those commands, compare long listings of
the jobs and the machines:
condor_q -l 26.0
condor_status -l dtc-mvill
to see if you can spot incompatibilities between the job's requirements
and the machine's requirements.
- dave
Víctor Hinojosa wrote:
> i have a condor pool. the summary is the following:
>
> Name OpSys Arch State Activity LoadAv Mem ActvtyTime
> vm1@dtc-mvill <mailto:vm1@dtc-mvill> WINNT51 INTEL Unclaimed Idle 0.000 508 0+02:08:29
> vm2@dtc-mvill <mailto:vm2@dtc-mvill> WINNT51 INTEL Unclaimed Idle 0.330 508 0+02:08:30
> dtc-vhinojosa WINNT51 INTEL Unclaimed Idle 0.000 1015 0+00:08:06
> id-vhinojosa. WINNT51 INTEL Unclaimed Idle 0.010 254 0+02:08:09
> Machines Owner Claimed Unclaimed Matched Preempting
> INTEL/WINNT51 4 0 0 4 0 0
> Total 4 0 0 4 0 0
>
> i submit a task with condor_submit. i check the status of my job with condor_q command.
>
> -- Submitter: dtc-vhinojosa : <10.0.1.171:2934> : dtc-vhinojosa
> ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
> 26.0 Victor 6/1 13:02 0+00:00:00 I 0 0.3 sim_rebounding_DT
> 26.1 Victor 6/1 13:02 0+00:00:00 I 0 0.3 sim_rebounding_DT
> 26.2 Victor 6/1 13:02 0+00:00:00 I 0 0.3 sim_rebounding_DT
> 26.3 Victor 6/1 13:02 0+00:00:00 I 0 0.3 sim_rebounding_DT
> 26.4 Victor 6/1 13:02 0+00:00:00 I 0 0.3 sim_rebounding_DT
> 26.5 Victor 6/1 13:02 0+00:00:00 I 0 0.3 sim_rebounding_DT
> 26.6 Victor 6/1 13:02 0+00:00:00 I 0 0.3 sim_rebounding_DT
> 26.7 Victor 6/1 13:02 0+00:00:00 I 0 0.3 sim_rebounding_DT
> 26.8 Victor 6/1 13:02 0+00:00:00 I 0 0.3 sim_rebounding_DT
> 26.9 Victor 6/1 13:02 0+00:00:00 I 0 0.3 sim_rebounding_DT
> 10 jobs; 10 idle, 0 running, 0 held
>
> when i install the condor pool i set up all machines with the option "always run Condor jobs".so i don't know what happen. somebody can help me or where i can search the mistake?
>
> regards,
>
>
> victor hinojosa
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
_______________________________________________
Condor-users mailing list
Condor-users@xxxxxxxxxxx
https://lists.cs.wisc.edu/mailman/listinfo/condor-users
<<winmail.dat>>