We're running a linux pool with Condor 6.6.11 and we persistently
see a
number of vanilla jobs whose Starter keeps dying with (from the
StartLog):
7/17 08:07:03 Starter pid 16900 died on signal 11 (signal 11)
7/17 08:07:03 vm1: State change: starter exited
The StarterLog shows nothing, even with full debug turned on. The jobs
then keep resubmitting themselves to die a similar death. As far as I
can tell this is the daemon itself dying, not the application that its
running (which runs fine from the console). We're using the
dynamically
linked binaries under Debian "etch". Can anyone shed any light why
this
should be happening, and more importantly how we can fix it?