Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[condor-users] condor_starter ERROR
- Date: Fri, 31 Oct 2003 19:56:58 +0000 (GMT)
- From: Ralf Schmid <ralf@xxxxxxxxxxxx>
- Subject: [condor-users] condor_starter ERROR
Hi,
I'm getting this error in the starter logfiles:
<SNIP>
10/31 19:33:06 ******************************************************
10/31 19:33:06 ** condor_starter (CONDOR_STARTER) STARTING UP
10/31 19:33:06 ** $CondorVersion: 6.5.3 Jul 2 2003 $
10/31 19:33:06 ** $CondorPlatform: INTEL-LINUX-GLIBC23 $
10/31 19:33:06 ** PID = 9060
10/31 19:33:06 ******************************************************
10/31 19:33:07 Using config file: /users/condor/condor_config
10/31 19:33:07 Using local config files:
/users/condor/hosts/leo3/condor_config.local
10/31 19:33:07 DaemonCore: Command Socket at <192.168.0.12:35075>
10/31 19:33:07 Done setting resource limits
10/31 19:33:07 ERROR "Assertion ERROR on (result)" at line 148 in file
NTsenders.C
10/31 19:33:07 ShutdownFast all jobs.
<SNIP>
The respective StartLog ...
<SNIP>
10/31 19:03:10 DaemonCore: Command received via TCP from host
<192.168.0.39:59408>
10/31 19:03:10 DaemonCore: received command 444 (ACTIVATE_CLAIM), calling
handler (command_activate_claim)
10/31 19:03:10 vm1: Got activate_claim request from shadow
(<192.168.0.39:59408>)
10/31 19:03:10 vm1: Remote job ID is 427.0
10/31 19:03:10 vm1: Got universe "VANILLA" (5) from request classad
10/31 19:03:10 vm1: State change: claim-activation protocol successful
10/31 19:03:10 vm1: Changing activity: Idle -> Busy
10/31 19:03:10 Starter pid 24666 exited with status 4
10/31 19:03:10 vm1: State change: starter exited
10/31 19:03:10 vm1: Changing activity: Busy -> Idle
<SNIP>
The shadow logfiles in the submission machine looks OK to me :
<SNIP>
10/31 20:03:21 ******************************************************
10/31 20:03:21 ** condor_shadow (CONDOR_SHADOW) STARTING UP
10/31 20:03:21 ** $CondorVersion: 6.5.3 Jul 2 2003 $
10/31 20:03:21 ** $CondorPlatform: INTEL-LINUX-GLIBC23 $
10/31 20:03:21 ** PID = 21575
10/31 20:03:21 ******************************************************
10/31 20:03:21 Using config file: /users/condor/condor_config
10/31 20:03:21 Using local config files:
/users/condor/hosts/convoluta/condor_config.local
10/31 20:03:21 DaemonCore: Command Socket at <192.168.0.39:59092>
10/31 20:03:22 Initializing a VANILLA shadow
10/31 20:03:22 (427.5) (21575): Request to run on <192.168.0.13:32773>
was ACCEPTED
<SNIP>
so the nodes are claimed, but stay idle. Any ideas what is going wrong?
Thanks in advance,
Ralf
=======================================
Dr. Ralf Schmid
ICAPB
School of Biological Sciences
The University of Edinburgh
King's Buildings
Ashworth Building
Edinburgh EH9 3JW
Condor Support Information:
http://www.cs.wisc.edu/condor/condor-support/
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>