Hello all.
We have a cluster with all nodes with CentOS 7. I would like to monitor its usage using Ganglia.
I configured the Ganglia server in the HTCondor collector. This is working, I can access the web service and see some information.
I also installed Ganglia in a few nodes and I can monitor their status (cpu usage, memory, disk, etc), but the metrics of Condor doesn't show up in the web interface.
In fact, what I don't know is which configuration is needed within Condor to send its metrics to Ganglia. The only thing I did was, in the Condor collector, enable the GANGLIAD daemon. The service is running, but I can't see anything related to Condor in the web server. I do know that there exists a file /etc/condor/ganglia.d/00_default_metrics but how do we use this file to define things to see in Ganglia? What I am missing?