[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-devel] "condor_q -better-analyze" has always a strange requirement marked as "REMOVE" !?!
- Date: Mon, 29 Nov 2010 08:00:21 -0600
- From: "Timothy St. Clair" <tstclair@xxxxxxxxxx>
- Subject: Re: [Condor-devel] "condor_q -better-analyze" has always a strange requirement marked as "REMOVE" !?!
Rob - 
   The requirement is fictitious, and inserted by condor_submit.  To
determine root cause, you will likely want to check the StartLog on the
machine in question.  
Cheers,
Tim
On Thu, 2010-11-25 at 19:14 -0800, Rob wrote:
> Hi,
> 
> This is with Condor 7.4.2 on a Linux Fedora master and with Condor 7.4.4 on the 
> Windows XP pool PCs.
> 
> Whenever I do a "condor_q -better-analyze" on a job, I always get an additional 
> requirement like:
> 
> ( ( ( 1024 * target.Memory ) >= 25 ) && ( ( 1024 * 
> ceiling(ifThenElse(JobVMMemory isnt 
> undefined,JobVMMemory,2.441406250000000E-02)) ) >= 25 ) )
> 
> which gives 0 (zero) machines and is thus marked as "REMOVE".
> 
> However, this requirement is not set by me in the submission file, hence how can 
> I remove it?
> 
> For example, today I encounter this situation:
> 
> ==========================
> $ cat helloworld.condor
> Universe   = Vanilla
> Executable = helloworld.exe
> should_transfer_files = YES
> when_to_transfer_output = ON_EXIT
> 
> Requirements = (target.Arch == "INTEL") && (target.OpSys == "WINNT51") && 
> (Machine == "SP_1F_02")
> 
> log = $(Cluster)_$(PROCESS)_helloworld.log
> output = $(Cluster)_$(PROCESS)_helloworld.out
> error  = $(Cluster)_$(PROCESS)_helloworld.err
> arguments = "helloworld"
> Queue
> 
> 
> 
> $ condor_q -better-analyze 325
> 
> -- Submitter: condor.dns.org : <115.125.120.71:60614> : condor.dns.org
> ---
> 325.000:  Run analysis summary.  Of 492 machines,
>     490 are rejected by your job's requirements
>       2 reject your job because of their own requirements
>       0 match but are serving users with a better priority in the pool
>       0 match but reject the job for unknown reasons
>       0 match but will not currently preempt their existing job
>       0 match but are currently offline
>       0 are available to run your job
>     Last successful match: Fri Nov 26 11:58:16 2010
>     Last failed match: Fri Nov 26 12:08:07 2010
>     Reason for last match failure: no match found
> 
> The Requirements expression for your job is:
> 
> ( ( target.Arch == "INTEL" ) && ( target.OpSys == "WINNT51" ) &&
> ( target.Machine == "SP_1F_02" ) ) && ( target.Disk >= DiskUsage ) &&
> ( ( ( target.Memory * 1024 ) >= ImageSize ) &&
> ( ( RequestMemory * 1024 ) >= ImageSize ) ) && ( target.HasFileTransfer )
> 
>     Condition                         Machines Matched    Suggestion
>     ---------                         ----------------    ----------
> 1   ( ( ( 1024 * target.Memory ) >= 25 ) && ( ( 1024 * 
> ceiling(ifThenElse(JobVMMemory isnt 
> undefined,JobVMMemory,2.441406250000000E-02)) ) >= 25 ) )
>                                       0                   REMOVE
> 2   ( target.Machine == "SP_1F_02" )  2                    
> 3   ( target.Arch == "INTEL" )        492                  
> 4   ( target.OpSys == "WINNT51" )     492                  
> 5   ( target.Disk >= 25 )             492                  
> 6   ( target.HasFileTransfer )        492                  
> ==========================
> 
> This tells me that "no match found" and the only reason for that is the 
> misterious 1st requirement; otherwise at least 2 machines should be available 
> for running the job......
> 
> What should I do?
> 
> Thanks.
> 
> Rob.
> 
> 
>       
> _______________________________________________
> Condor-devel mailing list
> Condor-devel@xxxxxxxxxxx
> https://lists.cs.wisc.edu/mailman/listinfo/condor-devel