Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[HTCondor-users] Job Submission in parallel Universe
- Date: Sat, 13 Apr 2013 16:56:15 +0500
- From: Usman Khan <muakrules@xxxxxxxx>
- Subject: [HTCondor-users] Job Submission in parallel Universe
I do add these entries in /etc/condor/condor_config.local
DedicatedScheduler = “DedicatedScheduler@muak@submit.foo.com”
muak is my user in linux
STARTD_EXPRS = $(START_EXPRS), DedicatedScheduler
And confirm with this command
$ condor_status –const ‘DedicatedScheduler =!= NULL’
But it wont show anything...
Once I tried job completely executed but didn't give output.
Here is my condor_config.local and log file
Please do check and guide me where did I do anything wrong...
I take guidence from presentation in this link
http://www.docstoc.com/docs/43721750/Condor-Parallel-Universe-European-Condor-Week-2008-Barcelona
000 (018.000.000) 04/09 23:32:14 Job submitted from host: <10.0.0.1:54389>
...
014 (018.000.000) 04/09 23:32:16 Node 0 executing on host: <10.0.0.1:55426>
...
001 (018.000.000) 04/09 23:32:16 Job executing on host: MPI_job
...
015 (018.000.000) 04/09 23:32:16 Node 0 terminated.
(1) Normal termination (return value 127)
Usr 0 00:00:00, Sys 0 00:00:00 - Run Remote Usage
Usr 0 00:00:00, Sys 0 00:00:00 - Run Local Usage
Usr 0 00:00:00, Sys 0 00:00:00 - Total Remote Usage
Usr 0 00:00:00, Sys 0 00:00:00 - Total Local Usage
0 - Run Bytes Sent By Node
0 - Run Bytes Received By Node
0 - Total Bytes Sent By Node
0 - Total Bytes Received By Node
...
005 (018.000.000) 04/09 23:32:17 Job terminated.
(1) Normal termination (return value 127)
Usr 0 00:00:00, Sys 0 00:00:00 - Run Remote Usage
Usr 0 00:00:00, Sys 0 00:00:00 - Run Local Usage
Usr 0 00:00:00, Sys 0 00:00:00 - Total Remote Usage
Usr 0 00:00:00, Sys 0 00:00:00 - Total Local Usage
0 - Run Bytes Sent By Job
0 - Run Bytes Received By Job
0 - Total Bytes Sent By Job
0 - Total Bytes Received By Job
Partitionable Resources : Usage Request
Cpus : 1
Disk (KB) : 10 10
Memory (MB) : 0 0
...
000 (019.000.000) 04/10 11:45:26 Job submitted from host: <10.0.0.1:47407>
...
## What machine is your central manager?
#CONDOR_HOST = $(FULL_HOSTNAME)
CONDOR_HOST = pucitServer.CentOSWorld.com
DedicatedScheduler = "DedicatedScheduler@muak@pucitServer.CentOSWorld.com"
STARTD_EXPRS = $(START_EXPRS), DedicatedScheduler
## Pool's short description
COLLECTOR_NAME = Personal Condor at $(FULL_HOSTNAME)
## When is this machine willing to start a job?
START = TRUE
## When to suspend a job?
SUSPEND = FALSE
## When to nicely stop a job?
## (as opposed to killing it instantaneously)
PREEMPT = FALSE
## When to instantaneously kill a preempting job
## (e.g. if a job is in the pre-empting stage for too long)
KILL = FALSE
## This macro determines what daemons the condor_master will start and keep its watchful eyes on.
## The list is a comma or space separated list of subsystem names
DAEMON_LIST = COLLECTOR, MASTER, NEGOTIATOR, SCHEDD, STARTD