Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Condor-users] execute node choice and disk parameter
- Date: Fri, 12 Mar 2010 11:40:35 -0800
- From: kristian kvilekval <kris@xxxxxxxxxxx>
- Subject: [Condor-users] execute node choice and disk parameter
I've set up a small condor test of a head node with two slave execute
nodes.
Unfortunately I can't seem to get anything scheduled on the slaves
as they are not chosen due to TARGET.Disk > 1500 parameter. See log
below.
These execute nodes are booted remotely and use a NFS root,
however they do have access to a shared NFS filesystem, and
each has a 500G scratch disk.
I have configured the following on both head and the slave
UID_DOMAIN = ece.ucsb.edu
FILESYSTEM_DOMAIN = saw.ece.ucsb.edu
COLLECTOR_NAME = CBI UCSB
USE_NFS = True
Is it possible to change how condor calculates available disk space
on the slave execute nodes?
Thanks,
Kris
--------------------------------------------------------------
$ condor_status
Name OpSys Arch State Activity LoadAv Mem ActvtyTime
slot1@b0001 LINUX X86_64 Unclaimed Idle 0.000 992 0+00:40:04
slot2@b0001 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:28
slot3@b0001 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:29
slot4@b0001 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:30
slot1@b0002 LINUX X86_64 Unclaimed Idle 0.000 992 0+00:40:04
slot2@b0002 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:28
slot3@b0002 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:29
slot4@b0002 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:30
slot1@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 1007 0+00:13:08
slot2@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.100 1007 0+00:13:15
Total Owner Claimed Unclaimed Matched Preempting Backfill
X86_64/LINUX 10 0 0 10 0 0 0
Total 10 0 0 10 0 0 0
kgk@claw$ condor_q
-- Submitter: claw.ece.ucsb.edu : <128.111.60.123:41670> : claw.ece.ucsb.edu
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
0 jobs; 0 idle, 0 running, 0 held
kgk@claw$ condor_submit eattime.cmd
Submitting job(s)......
Logging submit event(s)......
6 job(s) submitted to cluster 14.
kgk@claw$ condor_q -ana 14.0
-- Submitter: claw.ece.ucsb.edu : <128.111.60.123:41670> : claw.ece.ucsb.edu
---
014.000: Run analysis summary. Of 10 machines,
10 are rejected by your job's requirements
0 reject your job because of their own requirements
0 match but are serving users with a better priority in the pool
0 match but reject the job for unknown reasons
0 match but will not currently preempt their existing job
0 match but are currently offline
0 are available to run your job
WARNING: Be advised:
No resources matched request's constraints
The Requirements expression for your job is:
( TARGET.Arch == "X86_64" ) && ( TARGET.OpSys == "LINUX" ) &&
( TARGET.Disk >= DiskUsage ) && ( ( ( TARGET.Memory * 1024 ) >= ImageSize ) &&
( ( RequestMemory * 1024 ) >= ImageSize ) ) &&
( TARGET.FileSystemDomain == MY.FileSystemDomain )
Condition Machines Matched Suggestion
--------- ---------------- ----------
1 ( TARGET.Disk >= 1500 ) 2 MODIFY TO 0
2 ( TARGET.FileSystemDomain == "saw.ece.ucsb.edu" )
8
3 ( TARGET.Arch == "X86_64" ) 10
4 ( TARGET.OpSys == "LINUX" ) 10
5 ( ( ( 1024 * TARGET.Memory ) >= 1500 ) && ( ( 1024 * ceiling(ifThenElse(JobVMMemory isnt undefined,JobVMMemory,1.464843750000000E+00)) ) >= 1500 ) )
10
Conflicts:
conditions: 1, 2
kgk@claw$ condor_status
Name OpSys Arch State Activity LoadAv Mem ActvtyTime
slot1@b0001 LINUX X86_64 Unclaimed Idle 0.000 992 0+00:40:04
slot2@b0001 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:28
slot3@b0001 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:29
slot4@b0001 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:30
slot1@b0002 LINUX X86_64 Unclaimed Idle 0.000 992 0+00:40:04
slot2@b0002 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:28
slot3@b0002 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:29
slot4@b0002 LINUX X86_64 Unclaimed Idle 0.000 992 0+16:40:30
slot1@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 0.000 1007 0+03:50:04
slot2@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle -1.000 1007 0+19:50:20
Total Owner Claimed Unclaimed Matched Preempting Backfill
X86_64/LINUX 10 0 0 10 0 0 0
Total 10 0 0 10 0 0 0