I wonder if the path of your interactive shell is unusual. (are you really running commands as the user roo?)
try running this command
which gstat
What does it return?
You could try configuring the GANGLIA_GSTAT_COMMAND to have the full path to the gstat command by adding something like this to your condor configuration.
GANGLIA_GSTAT_COMMAND=/path/to/gstat --all --mpifile --gmond_ip=localhost --gmond_port=8649
-tj
From: Nagaraj Panyam <pn@xxxxxxxxxxx>
Sent: Wednesday, July 28, 2021 8:11 AM To: John M Knoeller <johnkn@xxxxxxxxxxx>; HTCondor-Users Mail List <htcondor-users@xxxxxxxxxxx> Subject: Re: [HTCondor-users] HTCondor and condor_ganglia issues Hi,
I have the following issues that I need help with.
About my setup: I have a Ganglia gmetad that handles the regular metrics (cpu, mem, etc) that are sent by gmond's on execute nodes. This part is fine. I now wish to add HTCondor to same gmetad and I need help. This gmetad is on the same host as collector and so on this
host I enabled condor_gangliad. (gmetad, collector and condor_gangliad on same host)
A) GangliadLog has the following set lines repeating. Clip is pasted below. What is the my_popenv error about ?
my_popenv: Failed to exec “gstat, errno=2 (No such file or directory)
When I run the gstat command, it shows output as below:
[roo@ce ~]# gstat --all --mpifile --gmond_ip=127.0.0.1 --gmond_port=8649 wn06.my.domain:128
B) Is condor_gangliad a routine "data source" for Ganglia's gmetad"? What should be the "data_source" declaration in gmetad.conf? I have gmond that listens on 8649 for the metrics from the execute nodes. The host running collector itself appears as "localhost" (see above). I tried to understand from this tutorial video at https://research.cs.wisc.edu/htcondor/tutorials/videos/2014/Ganglia.html but I could not read the Ganglia screen shown in the video.
Thanks Nagaraj
On 7/28/21 3:14 AM, John M Knoeller wrote:
|