This is a known bug that was fixed in 8.4.4.
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of 刘岚 <liulan199331@xxxxxxx>
Sent: Monday, February 29, 2016 2:57 AM To: HTCondor-Users Mail List Subject: [HTCondor-users] Condor Collector died Hi all,
When I use "condor_restart" to restart condor, the collector always got something wrong and got restarted automatically, then it can work properly.
The error displayed in MasterLog said:
The COLLECTOR died due to signal 11 (Segmentation fault)
And the information in CollectorLog said:
02/25/16 08:21:35 CollectorAd : Inserting ** "< My Pool - 10.1.1.103@xxxxxxxxxx >" 02/25/16 08:21:35 stats: Inserting new hashent for 'Collector':'My Pool - 10.1.1.103@xxxxxxxxxx':'10.1.1.103' Stack dump for process 1462089 at timestamp 1456406515 (11 frames) /usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(dprintf_dump_stack+0x12d)[0x7ffccdb2211d] /usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(_Z18linux_sig_coredumpi+0x40)[0x7ffccdc723c0] /lib64/libpthread.so.0(+0xf790)[0x7ffccc116790] /lib64/libc.so.6(+0x1376fa)[0x7ffccbeaa6fa] /usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(_ZN11DCCollector10sendUpdateEiPN14compat_classad7ClassAdER22DCCollectorAdSequencesS2_b+0x292)[0x7ffccdc4d5b2] condor_collector(_ZN15CollectorDaemon15sendCollectorAdEv+0x4a4)[0x456074] /usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(_ZN12TimerManager7TimeoutEPiPd+0x3d3)[0x7ffccdc7bb03] /usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(_ZN10DaemonCore6DriverEv+0xc03)[0x7ffccdc8ce33] /usr/local/condor/sbin/../lib/libcondor_utils_8_4_3.so(_Z7dc_mainiPPc+0x1799)[0x7ffccdc74559] /lib64/libc.so.6(__libc_start_main+0xfd)[0x7ffccbd91d5d] condor_collector[0x40eab9]
Any help will be appreciated. Kathy
|