Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] CREAM error: Failed to start gahp
- Date: Fri, 02 Oct 2015 09:10:03 -0500
- From: Brian Bockelman <bbockelm@xxxxxxxxxxx>
- Subject: Re: [HTCondor-users] CREAM error: Failed to start gahp
Hi Jose,
Yeah, thereâs been packaging problems since approximately 8.2.3.
I reported something similar here:
https://htcondor-wiki.cs.wisc.edu/index.cgi/tktview?tn=5021
(and GRIF reported it also in Dec 2014, but that doesnât appear to have made it from email to the ticket system).
Unfortunately, I didnât notice that the fix in gt5021 didnât actually fix the problem.
There is a relatively simple work-around. Try adding this file:
# cat /etc/ld.so.conf.d/condor.conf
/usr/lib64/condor
and running ldconfig. That will cause the system loader to add /usr/lib64/condor to the default path.
Brian
> On Oct 2, 2015, at 8:56 AM, Jose Caballero <jcaballero.hep@xxxxxxxxx> wrote:
>
> 2015-10-01 17:44 GMT-04:00 Jose Caballero <jcaballero.hep@xxxxxxxxx>:
>> 2015-10-01 16:22 GMT-04:00 Tim Theisen <tim@xxxxxxxxxxx>:
>>> Hi Jose,
>>>
>>> Please try installing the condor-all rpm. This will pull in the
>>> condor-externals that you are missing.
>>>
>>>
>>
>> Hi,
>>
>> I am at home now, so let's resume tomorrow.
>> But I can tell you I tried that this afternoon, and it was even worse.
>> There were many missing dependencies (libraries needed by
>> /usr/sbin/nordugrid_gahp).
>>
>> Cheers
>> Jose
>
>
>
> If I install condor-all on a clean VM, I get this (*)
> Anyways, I just installed condor-all on the same host I was working,
> where I had condor installed previously, and nordugrid_gahp has all
> dependencies.
> New deployment looks like this (**).
> Still same behaviour (***).
>
>
> (*)
> # ldd /usr/sbin/nordugrid_gahp
> linux-vdso.so.1 => (0x00007ffff79f2000)
> libdl.so.2 => /lib64/libdl.so.2 (0x00007fb21d1c7000)
> libresolv.so.2 => /lib64/libresolv.so.2 (0x00007fb21cfad000)
> libglobus_ftp_client.so.2 => not found
> libglobus_ftp_control.so.1 => not found
> libglobus_gssapi_error.so.2 => not found
> libglobus_xio.so.0 => not found
> libglobus_io.so.3 => not found
> libglobus_rsl.so.2 => not found
> libglobus_gss_assist.so.3 => not found
> libglobus_gssapi_gsi.so.4 => not found
> libglobus_gsi_proxy_core.so.0 => not found
> libglobus_gsi_credential.so.1 => not found
> libglobus_gsi_callback.so.0 => not found
> libglobus_gsi_sysconfig.so.1 => not found
> libglobus_oldgaa.so.0 => not found
> libglobus_gsi_cert_utils.so.0 => not found
> libglobus_openssl.so.0 => not found
> libglobus_openssl_error.so.0 => not found
> libglobus_proxy_ssl.so.1 => not found
> libglobus_callout.so.0 => not found
> libglobus_common.so.0 => not found
> libltdl.so.7 => /usr/lib64/libltdl.so.7 (0x00007fb21cd9c000)
> libcrypto.so.10 => /usr/lib64/libcrypto.so.10 (0x00007fb21c9b9000)
> libldap-2.4.so.2 => /lib64/libldap-2.4.so.2 (0x00007fb21c769000)
> liblber-2.4.so.2 => /lib64/liblber-2.4.so.2 (0x00007fb21c559000)
> libstdc++.so.6 => /usr/lib64/libstdc++.so.6 (0x00007fb21c253000)
> libm.so.6 => /lib64/libm.so.6 (0x00007fb21bfcf000)
> libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fb21bdb8000)
> libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fb21bb9b000)
> libc.so.6 => /lib64/libc.so.6 (0x00007fb21b807000)
> /lib64/ld-linux-x86-64.so.2 (0x00007fb21d3d5000)
> libz.so.1 => /lib64/libz.so.1 (0x00007fb21b5f0000)
> libsasl2.so.2 => /usr/lib64/libsasl2.so.2 (0x00007fb21b3d6000)
> libssl3.so => /usr/lib64/libssl3.so (0x00007fb21b197000)
> libsmime3.so => /usr/lib64/libsmime3.so (0x00007fb21af6a000)
> libnss3.so => /usr/lib64/libnss3.so (0x00007fb21ac2b000)
> libnssutil3.so => /usr/lib64/libnssutil3.so (0x00007fb21a9ff000)
> libplds4.so => /lib64/libplds4.so (0x00007fb21a7fa000)
> libplc4.so => /lib64/libplc4.so (0x00007fb21a5f5000)
> libnspr4.so => /lib64/libnspr4.so (0x00007fb21a3b7000)
> libcrypt.so.1 => /lib64/libcrypt.so.1 (0x00007fb21a17f000)
> librt.so.1 => /lib64/librt.so.1 (0x00007fb219f77000)
> libfreebl3.so => /lib64/libfreebl3.so (0x00007fb219cfd000)
>
>
> (**)
> condor-8.4.0-1.el6.x86_64
> condor-all-8.4.0-1.el6.x86_64
> condor-externals-8.4.0-1.el6.x86_64
> condor-classads-8.4.0-1.el6.x86_64
> condor-bosco-8.4.0-1.el6.x86_64
> condor-vm-gahp-8.4.0-1.el6.x86_64
> condor-external-libs-8.4.0-1.el6.x86_64
> condor-python-8.4.0-1.el6.x86_64
> condor-kbdd-8.4.0-1.el6.x86_64
> condor-std-universe-8.4.0-1.el6.x86_64
> condor-procd-8.4.0-1.el6.x86_64
> condor-cream-gahp-8.4.0-1.el6.x86_64
>
>
> (***)
> 10/02/15 09:52:01 [109596] Found job 9487.0 --- inserting
> 10/02/15 09:52:01 [109596] gahp server not up yet, delaying ping
> 10/02/15 09:52:01 [109596] gahp server not up yet, delaying checkDelegation
> 10/02/15 09:52:01 [109596] BaseResource::DoBatchStatus: gahp server
> not up yet, delaying 5 seconds
> 10/02/15 09:52:01 [109596] (9487.0) doEvaluateState called: gmState
> GM_INIT, creamState
> 10/02/15 09:52:01 [109596] GAHP server pid = 109599
> 10/02/15 09:52:07 [109596] (9487.0) doEvaluateState called: gmState
> GM_DELEGATE_PROXY, creamState
> 10/02/15 09:52:08 [109596] resource
> https://ce403.cern.ch:8443/ce-cream/services/CREAM2 is now up
> 10/02/15 09:52:08 [109596] (9487.0) doEvaluateState called: gmState
> GM_SET_LEASE, creamState
> 10/02/15 09:52:08 [109596] (9487.0) doEvaluateState called: gmState
> GM_SET_LEASE, creamState
> 10/02/15 09:52:10 [109596] (9487.0) doEvaluateState called: gmState
> GM_SUBMIT, creamState
> 10/02/15 09:52:12 [109596] (9487.0) doEvaluateState called: gmState
> GM_SUBMIT_SAVE, creamState
> 10/02/15 09:52:12 [109596] GAHP server pid = 109984
> 10/02/15 09:52:12 [109596] GAHP command 'INITIALIZE_FROM_FILE' failed: -1
> 10/02/15 09:52:12 [109596] GAHP: Failed to initialize from file
> 10/02/15 09:52:12 [109596] (9487.0) doEvaluateState called: gmState
> GM_STAGE_IN, creamState
> 10/02/15 09:52:12 [109596] (9487.0) Stage-in failed: Failed to start gahp
> 10/02/15 09:52:17 [109596] No jobs left, shutting down
> 10/02/15 09:52:19 [109596] Got SIGTERM. Performing graceful shutdown.
> 10/02/15 09:52:19 [109596] **** condor_gridmanager
> (condor_GRIDMANAGER) pid 109596 EXITING WITH STATUS 0
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
>
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/