Hi,
I’m
running a vanilla universe job on a windows XP machine. The setup is quite
simple. I’m running a FORTRAN program, which is given some input files via transfer_input_files. The program also
generate some output files which were supposed to be returned
automatically by condor. The program runs perfectly from a DOS-window, but in
condor...
Here
is a possible error, but I don’t know what to do.
8/26
13:36:06
******************************************************
8/26
13:36:06
Using config file:
C:\Condor\condor_config
8/26
13:36:06
Using local config files:
C:\Condor/condor_config.local
8/26
13:36:06
DaemonCore: Command Socket at
<192.38.89.192:2627>
8/26
13:36:07
Initializing a VANILLA shadow
8/26
13:36:07
(10.0) (1012): Request to run on <192.38.89.192:1049> was ACCEPTED
8/26
13:43:11
(10.0) (1012): condor_write(): Socket closed when trying to write buffer
8/26
13:43:12
(10.0) (1012): Buf::write(): condor_write() failed
8/26
13:43:12
(10.0) (1012): SECMAN: Error sending response classad!
8/26
13:43:12
(10.0) (1012): condor_write(): Socket closed when trying to write buffer
8/26
13:43:12
(10.0) (1012): Buf::write(): condor_write() failed
8/26
13:43:12
(10.0) (1012): SECMAN: Error sending response classad!
8/26
13:43:13
(10.0) (1012): Job 10.0 terminated: exited with status -1073741515
8/26
13:43:15
(10.0) (1012): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 100
Here
is my Starter Log:
8/26
13:36:07
******************************************************
8/26
13:36:07
** condor_starter (CONDOR_STARTER) STARTING UP
8/26
13:36:07
** C:\Condor\bin\condor_starter.exe
8/26
13:36:07
** $CondorVersion: 6.6.10 Jun 22 2005
$
8/26
13:36:07
** $CondorPlatform: INTEL-WINNT50 $
8/26
13:36:07
** PID = 2704
8/26
13:36:07
******************************************************
8/26
13:36:07
Using config file:
C:\Condor\condor_config
8/26
13:36:08
Using local config files:
C:\Condor/condor_config.local
8/26
13:36:08
DaemonCore: Command Socket at
<192.38.89.192:2635>
8/26
13:36:08
Setting resource limits not implemented!
8/26
13:36:08
Starter communicating with condor_shadow
<192.38.89.192:2627>
8/26
13:36:08
Submitting machine is "Capec_227-224.kt.dtu.dk"
8/26
13:36:22 Suspending all jobs.
8/26
13:36:42
condor_read(): timeout reading buffer.
8/26
13:42:10 Continuing all jobs.
8/26
13:42:30
condor_read(): timeout reading buffer.
8/26
13:43:11
File transfer completed successfully.
8/26
13:43:13
Starting a VANILLA universe job with ID: 10.0
8/26
13:43:13
IWD: C:\Condor/execute\dir_2704
8/26
13:43:13
Output file: C:\Condor/execute\dir_2704\rdftest.out
8/26
13:43:13
Error file: C:\Condor/execute\dir_2704\rdftest.err
8/26
13:43:13
Renice expr "10"
evaluated to 10
8/26
13:43:13
About to exec
C:\Condor\execute\dir_2704\condor_exec.exe
8/26
13:43:13
Create_Process succeeded, pid=2316
8/26
13:43:13
Process exited, pid=2316, status=-1073741515
8/26
13:43:14
Got SIGQUIT. Performing
fast shutdown.
8/26
13:43:14
ShutdownFast all jobs.
8/26
13:43:14
**** condor_starter (condor_STARTER)
EXITING WITH STATUS 0