Great, it worked! It's just showing you all the metrics it will publish. Take off the "--test" flag and it will publish the metrics to graphite and/or influxdb (and not dump them all to stdout). Grab the Grafana dashboards from https://github.com/fifemon/dashboards [1]and you should see data! Regards,
Kevin
[1] note that the dashboards are currently hard-coded to use a graphite datasource named "fifemon-graphite", they were created before Grafana would template data sources on export. Pretty easy to change in Grafana, or with sed or editor of your choice (s/fifemon-graphite/my-grapite-datasource/). From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Uchenna Ojiaku - NOAA Affiliate <uchenna.ojiaku@xxxxxxxx>
Sent: Wednesday, March 1, 2017 12:25 PM To: HTCondor-Users Mail List Subject: Re: [HTCondor-users] Grafana + HTCondor Job Metrics Hi Kevin,
The result was quite long. Here's the head and tail of the result.
(venv2.0)[xxxxxx@xxxxxxxxxxx bin]# ./condor_probe.py --test /root/probes/etc/condor-probe.cfg 2017-03-01 11:54:57,882 [INFO] __main__ - Probe configuraion: {'delay': 30, 'graphite_host': 'xxxxxxxxx.xxxxxx.xxxxxxxxxx.xxxxxxxxxxxx', 'graphite_pickle_port': 2004, 'influxdb_db': 'xxxxx', 'influxdb_host': 'xxxxxxxxxxxxxxxxxx.xxxxxxx.xxxxxxxxx', 'influxdb_port': '8086', 'influxdb_tags': {'foo': 'bar'}, 'interval': 240, 'meta_namespace': 'probes.condor-mypool', 'namespace': 'clusters.mypool', 'once': False, 'pool': 'xxxxxxxxxxx.xxxxxx.xxxxxxx.xxxxxxxxxxxxx', 'post_pool_glideins': False, 'post_pool_jobs': True, 'post_pool_prio': False, 'post_pool_slots': True, 'post_pool_status': True, 'retries': 10, 'test': True, 'use_graphite': True, 'use_gsi_auth': False, 'use_influxdb': False, 'x509_user_cert': '', 'x509_user_key': ''} 2017-03-01 11:54:57,883 [INFO] __main__ - querying pool xxxxxxx.xxxxxx.xxxxxx.xxxx status 2017-03-01 11:54:57,905 [DEBUG] fifemon.graphite - ('clusters.mypool.collectors.My_Pool_-_xxxxxx_xxxxxx_xxxxxxx_noaa_gov-xxxxx_xxxx_xxxxx_xxxx_xxxx.RecentStatsLifetime', (1488387297.9056799, 1200L)) 2017-03-01 11:54:57,905 [DEBUG] fifemon.graphite - ('clusters.mypool.collectors.My_Pool_xxxxx_xxxx_xxxxx_xxxx_xxxx-xxxxx_xxxx_xxxxx_xxxx_xxxxx.MonitorSelfTime', (1488387297.9056799, 1488387088L)) 2017-03-01 11:54:57,905 [DEBUG] fifemon.graphite - ('clusters.mypool.collectors.My_Pool_xxxxx_xxxx_xxxxx_xxxx_xxxx-xxxxx_xxxx_xxxxx_xxxx_xxxxx.CurrentJobsRunningStandard', (1488387297.9056799, 0L)) 2017-03-01 11:54:57,905 [DEBUG] fifemon.graphite - ('clusters.mypool.collectors.My_Pool_xxxxx_xxxx_xxxxx_xxxx_xxxx-xxxxx_xxxx_xxxxx_xxxx_xxxxx.MonitorSelfRegisteredSocketCount', (1488387297.9056799, 15L)) 2017-03-01 11:54:57,905 [DEBUG] fifemon.graphite - ('clusters.mypool.collectors.My_Pool_xxxxx_xxxx_xxxxx_xxxx_xxxx-xxxxx_xxxx_xxxxx_xxxx_xxxxx.UpdatesInitial_Master', (1488387297.9056799, 5L)) 2017-03-01 11:54:57,905 [DEBUG] fifemon.graphite - ('clusters.mypool.collectors.My_Pool_xxxxx_xxxx_xxxxx_xxxx_xxxx-xxxxx_xxxx_xxxxx_xxxx_xxxxx.RecentUpdatesTotal_StartdPvt', (1488387297.9056799, 18L)) 2017-03-01 11:54:57,906 [DEBUG] fifemon.graphite - ('clusters.mypool.collectors.My_Pool_xxxxx_xxxx_xxxxx_xxxx_xxxx-xxxxx_xxxx_xxxxx_xxxx_xxxxx.LastHeardFrom', (1488387297.9056799, 1488387097L)) 2017-03-01 11:54:57,906 [DEBUG] fifemon.graphite - ('clusters.mypool.schedds.xxxxx_xxxx_xxxxx_xxxx_xxxx.StatsLifetime', (1488387297.9056799, 3533605L)) 2017-03-01 11:54:57,906 [DEBUG] fifemon.graphite - ('clusters.mypool.schedds.xxxxx_xxxx_xxxxx_xxxx_xxxx.TransferQueueNumWaitingToUpload', (1488387297.9056799, 0L)) 2017-03-01 11:54:57,906 [DEBUG] fifemon.graphite - ('clusters.mypool.collectors.My_Pool_xxxxx_xxxx_xxxxx_xxxx_xxxx-xxxxx_xxxx_xxxxx_xxxx_xxxxx.MaxJobsRunningPipe', (1488387297.9056799, 0L)) 2017-03-01 11:54:57,920 [DEBUG] fifemon.graphite - ('clusters.mypool.negotiators.xxxxx_xxxx_xxxxx_xxxx_xxxx.LastNegotiationCycleMatchRate0', (1488387297.9056799, 0.0)) 2017-03-01 11:54:57,920 [DEBUG] fifemon.graphite - ('clusters.mypool.negotiators.xxxxx_xxxx_xxxxx_xxxx_xxxx.LastNegotiationCycleMatchRate1', (1488387297.9056799, 0.0)) 2017-03-01 11:54:57,920 [INFO] __main__ - querying pool xxxxx.xxxx.xxxxx.xxxx.xxxxslots 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.TotalSlotDisk', (1488387297.9352701, 15220332.0)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.jobs.totals.DiskUsage', (1488387297.9352701, 0)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.TotalSlotCpus', (1488387297.9352701, 112L)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.TotalSlotMemory', (1488387297.9352701, 1258795L)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.Memory', (1488387297.9352701, 1258795L)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.TotalDisk', (1488387297.9352701, 14885972L)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.TotalCondorLoadAvg', (1488387297.9352701, 0.0)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.TotalCpus', (1488387297.9352701, 120.0)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.Cpus', (1488387297.9352701, 112L)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.TotalLoadAvg', (1488387297.9352701, 0.59000000000000008)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.Disk', (1488387297.9352701, 14885972L)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.LoadAvg', (1488387297.9352701, 0.59000000000000008)) 2017-03-01 11:54:57,935 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.jobs.totals.MemoryUsage', (1488387297.9352701, 0)) 2017-03-01 11:54:57,936 [DEBUG] fifemon.graphite - ('clusters.mypool.slots.Partitionable.totals.TotalMemory', (1488387297.9352701, 1258795L)) 2017-03-01 11:54:57,936 [INFO] __main__ - querying pool xxxxxx.xxxxxx.xxxxx.xxxxxx.xxxxxx jobs 2017-03-01 11:54:57,939 [INFO] fifemon.probe - (clusters.mypool) posted data in 0.055379152298 s 2017-03-01 11:54:57,939 [DEBUG] fifemon.graphite - ('probes.condor-mypool.update_time', (1488387297.939198, 0.055379152297973633)) 2017-03-01 11:54:57,939 [INFO] fifemon.probe - (clusters.mypool) sleeping 229.944620848 s On Wed, Mar 1, 2017 at 11:51 AM, Kevin Retzke
<kretzke@xxxxxxxx> wrote:
|