Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] New setup, take 2
- Date: Mon, 9 Aug 2004 16:06:59 -0400
- From: bfargo@xxxxxxxxxxxxxx
- Subject: Re: [Condor-users] New setup, take 2
Hello this is from the condor on manual page
located here
"
condor_ on starts up a set of the Condor daemons on a set of machines. This command assumes that the condor_ master is already running on the machine. If this is not the case, condor_ on will fail complaining that it cannot find the address of the master. The command condor_ on with no arguments will tell the condor_ master to start up the Condor daemons specified in the configuration variable DAEMON_LIST. If a daemon is specified as an argument, condor_ on starts up only that daemon.
This command cannot be used to start up the condor_ master daemon.
"
try using the command condor_master to start up the daemons in linux. While I am a new condor user myself. That is what I used to start up my RH8 box.
If I'm wrong somebody will surely correct me.
Hope this works
-Bill
>
> From: Mike Frederick <Mike@xxxxxxxxxx>
> Date: 2004/08/09 Mon PM 03:12:51 EDT
> To: "'condor-users@xxxxxxxxxxx'" <condor-users@xxxxxxxxxxx>
> Subject: [Condor-users] New setup, take 2
>
> All,
>
> Thanks for the previous help. As I said, I'm brand new to both Linux and
> Condor...
>
> I am attempting to prove that we could use Condor in our environment here.
> What I need to do is get a setup so that I can from a Windows 2000 box run
> jobs in a pool on itself, another Windows box, a RedHat box and a Solaris
> box. I've got the Condor_Master on the first Windows box up and running
> (and accepting/running jobs). Now I am working a RedHat box issue. I have
> installed the RedHat box, run condor_config and thought I had setup
> correctly. When I try to run condor_on the Red Hat 9 box I get:
>
> "Can't connect to local master"
>
> even though from the RH box I can ping the local master (the Windows box).
> The RedHat box does not show up on the master when I run condor_status.
>
> I am attaching the RedHat condor_config file. Please let me know what I can
> do to get a little further along in this test. And if you need any other
> info let me know that too. TIA!
>
> Red Hat 9 condor_config file:
>
> CONDOR_HOST = elpin.nuview.com
> RELEASE_DIR = /opt/condor-6.6.6
> LOCAL_DIR = $(RELEASE_DIR)/home
> LOCAL_CONFIG_FILE = $(LOCAL_DIR)/condor_config.local
> CONDOR_ADMIN = mike@xxxxxxxxxx <mailto:mike@xxxxxxxxxx>
> MAIL = /bin/mail
> UID_DOMAIN = $(FULL_HOSTNAME)
> FILESYSTEM_DOMAIN = $(FULL_HOSTNAME)
> FLOCK_FROM =
> FLOCK_TO =
> FLOCK_NEGOTIATOR_HOSTS = $(FLOCK_TO)
> FLOCK_COLLECTOR_HOSTS = $(FLOCK_TO)
> HOSTALLOW_ADMINISTRATOR = $(CONDOR_HOST)
> HOSTALLOW_OWNER = $(FULL_HOSTNAME), $(HOSTALLOW_ADMINISTRATOR)
> HOSTALLOW_READ = *
> HOSTALLOW_WRITE = *
> HOSTALLOW_NEGOTIATOR = $(NEGOTIATOR_HOST)
> HOSTALLOW_NEGOTIATOR_SCHEDD = $(NEGOTIATOR_HOST), $(FLOCK_NEGOTIATOR_HOSTS)
> HOSTALLOW_WRITE_COLLECTOR = $(HOSTALLOW_WRITE), $(FLOCK_FROM)
> HOSTALLOW_WRITE_STARTD = $(HOSTALLOW_WRITE), $(FLOCK_FROM)
> HOSTALLOW_READ_COLLECTOR = $(HOSTALLOW_READ), $(FLOCK_FROM)
> HOSTALLOW_READ_STARTD = $(HOSTALLOW_READ), $(FLOCK_FROM)
> LOCK = $(LOG)
> DEFAULT_DOMAIN_NAME = nuview.com
> GLIDEIN_SERVER_NAME = gridftp.cs.wisc.edu
> GLIDEIN_SERVER_DIR = /p/condor/public/binaries/glidein
> ALL_DEBUG =
> MAX_COLLECTOR_LOG = 1000000
> COLLECTOR_DEBUG =
> MAX_KBDD_LOG = 1000000
> KBDD_DEBUG =
> MAX_NEGOTIATOR_LOG = 1000000
> NEGOTIATOR_DEBUG = D_MATCH
> MAX_NEGOTIATOR_MATCH_LOG = 1000000
> MAX_SCHEDD_LOG = 1000000
> SCHEDD_DEBUG = D_COMMAND
> MAX_SHADOW_LOG = 1000000
> SHADOW_DEBUG =
> MAX_STARTD_LOG = 1000000
> STARTD_DEBUG = D_COMMAND
> MAX_STARTER_LOG = 1000000
> STARTER_DEBUG = D_NODATE
> MAX_MASTER_LOG = 1000000
> MASTER_DEBUG = D_COMMAND
> MINUTE = 60
> HOUR = (60 * $(MINUTE))
> StateTimer = (CurrentTime - EnteredCurrentState)
> ActivityTimer = (CurrentTime - EnteredCurrentActivity)
> ActivationTimer = (CurrentTime - JobStart)
> LastCkpt = (CurrentTime - LastPeriodicCheckpoint)
> STANDARD = 1
> PVM = 4
> VANILLA = 5
> MPI = 8
> IsPVM = (TARGET.JobUniverse == $(PVM))
> IsMPI = (TARGET.JobUniverse == $(MPI))
> IsVanilla = (TARGET.JobUniverse == $(VANILLA))
> IsStandard = (TARGET.JobUniverse == $(STANDARD))
> SmallJob = (TARGET.ImageSize < (15 * 1024))
> NonCondorLoadAvg = (LoadAvg - CondorLoadAvg)
> BackgroundLoad = 0.3
> HighLoad = 0.5
> StartIdleTime = 15 * $(MINUTE)
> ContinueIdleTime = 5 * $(MINUTE)
> MaxSuspendTime = 10 * $(MINUTE)
> MaxVacateTime = 10 * $(MINUTE)
> KeyboardBusy = (KeyboardIdle < $(MINUTE))
> ConsoleBusy = (ConsoleIdle < $(MINUTE))
> CPUIdle = ($(NonCondorLoadAvg) <= $(BackgroundLoad))
> CPUBusy = ($(NonCondorLoadAvg) >= $(HighLoad))
> KeyboardNotBusy = ($(KeyboardBusy) == False)
> BigJob = (TARGET.ImageSize >= (50 * 1024))
> MediumJob = (TARGET.ImageSize >= (15 * 1024) && TARGET.ImageSize < (50
> * 1024))
> SmallJob = (TARGET.ImageSize < (15 * 1024))
> JustCPU = ($(CPUBusy) && ($(KeyboardBusy) == False))
> MachineBusy = ($(CPUBusy) || $(KeyboardBusy))
> WANT_SUSPEND = $(UWCS_WANT_SUSPEND)
> WANT_VACATE = $(UWCS_WANT_VACATE)
> START = TRUE
> SUSPEND = $(UWCS_SUSPEND)
> CONTINUE = $(UWCS_CONTINUE)
> PREEMPT = $(UWCS_PREEMPT)
> KILL = $(UWCS_KILL)
> PERIODIC_CHECKPOINT = $(UWCS_PERIODIC_CHECKPOINT)
> PREEMPTION_REQUIREMENTS = $(UWCS_PREEMPTION_REQUIREMENTS)
> PREEMPTION_RANK = $(UWCS_PREEMPTION_RANK)
> UWCS_WANT_SUSPEND = ( $(SmallJob) || $(KeyboardNotBusy) \
> || $(IsPVM) || $(IsVanilla) )
> UWCS_WANT_VACATE = ( $(ActivationTimer) > 10 * $(MINUTE) \
> || $(IsPVM) || $(IsVanilla) )
> UWCS_START = ( (KeyboardIdle > $(StartIdleTime)) \
> && ( $(CPUIdle) || \
> (State != "Unclaimed" && State != "Owner")) )
> UWCS_SUSPEND = ( $(KeyboardBusy) || \
> ( (CpuBusyTime > 2 * $(MINUTE)) \
> && $(ActivationTimer) > 90 ) )
> UWCS_CONTINUE = ( $(CPUIdle) && ($(ActivityTimer) > 10) \
> && (KeyboardIdle > $(ContinueIdleTime)) )
> UWCS_PREEMPT = ( ((Activity == "Suspended") && \
> ($(ActivityTimer) > $(MaxSuspendTime))) \
> || (SUSPEND && (WANT_SUSPEND == False)) )
> UWCS_KILL = $(ActivityTimer) > $(MaxVacateTime)
> UWCS_PERIODIC_CHECKPOINT = $(LastCkpt) > (3 * $(HOUR))
> UWCS_PREEMPTION_REQUIREMENTS = $(StateTimer) > (1 * $(HOUR)) &&
> RemoteUserPrio > SubmittorPrio * 1.2
> UWCS_PREEMPTION_RANK = (RemoteUserPrio * 1000000) - TARGET.ImageSize
> TESTINGMODE_WANT_SUSPEND = False
> TESTINGMODE_WANT_VACATE = False
> TESTINGMODE_START = True
> TESTINGMODE_SUSPEND = False
> TESTINGMODE_CONTINUE = True
> TESTINGMODE_PREEMPT = False
> TESTINGMODE_KILL = False
> TESTINGMODE_PERIODIC_CHECKPOINT = False
> TESTINGMODE_PREEMPTION_REQUIREMENTS = False
> TESTINGMODE_PREEMPTION_RANK = 0
> LOG = $(LOCAL_DIR)/log
> SPOOL = $(LOCAL_DIR)/spool
> EXECUTE = $(LOCAL_DIR)/execute
> BIN = $(RELEASE_DIR)/bin
> LIB = $(RELEASE_DIR)/lib
> SBIN = $(RELEASE_DIR)/sbin
> HISTORY = $(SPOOL)/history
> COLLECTOR_LOG = $(LOG)/CollectorLog
> KBDD_LOG = $(LOG)/KbdLog
> MASTER_LOG = $(LOG)/MasterLog
> NEGOTIATOR_LOG = $(LOG)/NegotiatorLog
> NEGOTIATOR_MATCH_LOG = $(LOG)/MatchLog
> SCHEDD_LOG = $(LOG)/SchedLog
> SHADOW_LOG = $(LOG)/ShadowLog
> STARTD_LOG = $(LOG)/StartLog
> STARTER_LOG = $(LOG)/StarterLog
> SHADOW_LOCK = $(LOCK)/ShadowLock
> COLLECTOR_HOST = $(CONDOR_HOST)
> NEGOTIATOR_HOST = $(CONDOR_HOST)
> RESERVED_DISK = 5
> DAEMON_LIST = MASTER, STARTD, SCHEDD
> MASTER = $(SBIN)/condor_master
> STARTD = $(SBIN)/condor_startd
> SCHEDD = $(SBIN)/condor_schedd
> KBDD = $(SBIN)/condor_kbdd
> NEGOTIATOR = $(SBIN)/condor_negotiator
> COLLECTOR = $(SBIN)/condor_collector
> GRID_MONITOR = $(SBIN)/grid_monitor.sh
> MASTER_ADDRESS_FILE = $(LOG)/.master_address
> PREEN = $(SBIN)/condor_preen
> PREEN_ARGS = -m -r
> STARTER_LIST = STARTER, STARTER_PVM, STARTER_STANDARD
> STARTER = $(SBIN)/condor_starter
> STARTER_PVM = $(SBIN)/condor_starter.pvm
> STARTER_STANDARD = $(SBIN)/condor_starter.std
> STARTD_ADDRESS_FILE = $(LOG)/.startd_address
> BenchmarkTimer = (CurrentTime - LastBenchmark)
> RunBenchmarks : (LastBenchmark == 0 ) || ($(BenchmarkTimer) >= (4 *
> $(HOUR)))
> CONSOLE_DEVICES = mouse, console
> COLLECTOR_HOST_STRING = "$(COLLECTOR_HOST)"
> STARTD_EXPRS = COLLECTOR_HOST_STRING
> STARTD_JOB_EXPRS = ImageSize, ExecutableSize, JobUniverse, NiceUser
> SHADOW_LIST = SHADOW, SHADOW_PVM, SHADOW_STANDARD
> SHADOW = $(SBIN)/condor_shadow
> SHADOW_PVM = $(SBIN)/condor_shadow.pvm
> SHADOW_STANDARD = $(SBIN)/condor_shadow.std
> SCHEDD_ADDRESS_FILE = $(LOG)/.schedd_address
> SHADOW_SIZE_ESTIMATE = 1800
> SHADOW_RENICE_INCREMENT = 10
> PERIODIC_EXPR_INTERVAL = 60
> QUEUE_SUPER_USERS = root, condor
> PVMD = $(SBIN)/condor_pvmd
> PVMGS = $(SBIN)/condor_pvmgs
> VALID_SPOOL_FILES = job_queue.log, job_queue.log.tmp, history, \
> Accountant.log, Accountantnew.log
> INVALID_LOG_FILES = core
> JAVA =
> JAVA_MAXHEAP_ARGUMENT =
> JAVA_CLASSPATH_DEFAULT = $(LIB) $(LIB)/scimark2lib.jar .
> JAVA_CLASSPATH_ARGUMENT = -classpath
> JAVA_CLASSPATH_SEPARATOR = :
> JAVA_BENCHMARK_TIME = 2
> JAVA_EXTRA_ARGUMENTS =
> GRIDMANAGER = $(SBIN)/condor_gridmanager
> GAHP = $(SBIN)/gahp_server
> MAX_GRIDMANAGER_LOG = 1000000
> GRIDMANAGER_DEBUG = D_COMMAND
> GRIDMANAGER_LOG = /tmp/GridmanagerLog.$(USERNAME)
> CRED_MIN_TIME_LEFT = 120
>
>
>
>
_______________________________________________
Condor-users mailing list
Condor-users@xxxxxxxxxxx
http://lists.cs.wisc.edu/mailman/listinfo/condor-users