Hi,
In the log file I get the following
report:
012 (016.006.000) 08/14 07:18:57 Job was
held.
The job attribute
OnExitRemove _expression_ '(ExitCode != -30) && (ExitCode != -40)
&& (ExitCode != -20)' evaluated to
>>UNDEFINED
Code 5 Subcode
0
The exit code -30, -40 and -20 are iternal error codes within our program
(e.g. license error,...). If such errors occur, the jobs should be reset to idle
again instead of hold or finished.
The condor submit file looks as
follows (just pasted the global entries and the entries for the last job
6...):
Executable = C:\Program
Files\Inpho\ApplicationsMaster 5.2\bin\cjm.exe
Requirements =
(Inpho_ApplicationsMaster52_Directory =!= UNDEFINED &&
Inpho_ApplicationsMaster52_Installed =?= True &&
Inpho_ApplicationsMaster52_OrthoMaster_Installed =?= True) \
&&
(OpSys == "WINNT50" || OpSys == "WINNT51" || OpSys == "WINNT52" || OpSys ==
"WINNT60") \
&& (Arch == "INTEL" || Arch == "IA64" || Arch ==
"X86_64") \
&& (Name == "slot1@xxxxxxxxxxxxxxxxxxx" || Name ==
"slot2@xxxxxxxxxxxxxxxxxxx" || Name == "slot3@xxxxxxxxxxxxxxxxxxx" || Name ==
"slot4@xxxxxxxxxxxxxxxxxxx" || Name == "slot5@xxxxxxxxxxxxxxxxxxx" || Name ==
"slot6@xxxxxxxxxxxxxxxxxxx" || Name == "slot7@xxxxxxxxxxxxxxxxxxx" || Name ==
"slot8@xxxxxxxxxxxxxxxxxxx")
Priority = 0
Universe = Vanilla
Getenv =
True
Output = BLOK14_09-08-2009-rest_$(Cluster)_$(Process).out
Error =
BLOK14_09-08-2009-rest_$(Cluster)_$(Process).out
Log =
BLOK14_09-08-2009-rest_$(Cluster)_$(Process).log
should_transfer_files =
True
when_to_transfer_output = ON_EXIT
transfer_input_files = C:\Program
Files\Inpho\ApplicationsMaster 5.2\bin\QtCore4.dll, C:\Program
Files\Inpho\ApplicationsMaster 5.2\bin\mapdrive.bat
Environment =
ERMAPPER=dummy
...
Leave_in_queue =
True
!= -30) && (ExitCode != -40)
&& (ExitCode != -20)
Initialdir =
\\inas\data\NL2009-10cm\02-process_files\024-inpho-ortho\fullres\blok14_13-08-09-rest\job6
Arguments
= " -exe OM -exedir $$(Inpho_ApplicationsMaster52_Directory) -map 'mapdrive.bat'
-prj
'\\inas\data\NL2009-10cm\02-process_files\024-inpho-ortho\fullres\blok14_13-08-09-rest\job6\BLOK14_09-08-2009-rest_16.prj'"
+WantIOProxy
= True
+EstimatedJobTime = 100
+ElapsedJobTime =
0
Queue
Job 2 and 6 were held. Nevertheless the jobs processed some data. Any idea what goes wrong?
>
-----Original Message-----
> From:
condor-users-bounces@xxxxxxxxxxx
> [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Ian Chesal
> Sent: Montag, 17. August 2009
04:40
> To: Condor-Users Mail List
> Subject: Re: [Condor-users]
OnExitRemove _expression_ evaluted
> to UNDEFINED(Code5 Subcode
0)
>
> > I am running in the problem, that some jobs (not all)
run into Hold
> > status with the following log message:
>
>
> > >> The job attribute OnExitRemove _expression_ evaluated
to
> > UNDEFINED with
> > Code 5 and Subcode 0
>
>
> > Can anybody help me with this problem?
>
> What
does your OnExitRemove _expression_ look like? And for the
> job in question
what does condor_q -long for the job show?
> Hard to say why the
_expression_ evaluated to undefined without
> seeing the _expression_ and the
attributes that were subbed in
> to it at eval time.
>
> Warm
regards,
> - Ian
>
> Confidentiality Notice.
> This
message may contain information that is confidential or
> otherwise
protected from disclosure. If you are not the
> intended recipient, you
are hereby notified that any use,
> disclosure, dissemination,
distribution, or copying of this
> message, or any
attachments, is strictly prohibited. If you
> have received this
message in error, please advise the sender
> by reply e-mail, and delete
the message and any attachments.
> Thank you.
>
>
_______________________________________________
> Condor-users mailing
list
> To unsubscribe, send a message to
>
condor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You
can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/condor-users/
>