Ok, great, the probe is working, it's just not finding the collector. Try setting the "pool" option in the "[condor]" section to your pool's central manager/collector host. See output of "condor_status -collector". Maybe setting 'pool=""' will also work if you're running on a machine within the pool; I've never tried.
Regards, Kevin
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Uchenna Ojiaku - NOAA Affiliate <uchenna.ojiaku@xxxxxxxx>
Sent: Wednesday, March 1, 2017 10:30 AM To: HTCondor-Users Mail List Subject: Re: [HTCondor-users] Grafana + HTCondor Job Metrics Here's the rest:
Thanks,2017-03-01 11:09:21,507 [INFO] __main__ - querying pool localhost status 2017-03-01 11:09:21,513 [WARNING] condor.status - trouble getting pool localhost negotiators status, retrying in 30s. 2017-03-01 11:09:51,546 [WARNING] condor.status - trouble getting pool localhost negotiators status, retrying in 30s. 2017-03-01 11:10:21,589 [WARNING] condor.status - trouble getting pool localhost negotiators status, retrying in 30s. 2017-03-01 11:10:51,620 [WARNING] condor.status - trouble getting pool localhost negotiators status, retrying in 30s. 2017-03-01 11:11:21,654 [WARNING] condor.status - trouble getting pool localhost negotiators status, retrying in 30s. 2017-03-01 11:11:51,686 [WARNING] condor.status - trouble getting pool localhost negotiators status, retrying in 30s. 2017-03-01 11:12:21,725 [WARNING] condor.status - trouble getting pool localhost negotiators status, retrying in 30s. 2017-03-01 11:12:51,757 [WARNING] condor.status - trouble getting pool localhost negotiators status, retrying in 30s. 2017-03-01 11:13:21,802 [WARNING] condor.status - trouble getting pool localhost negotiators status, retrying in 30s. 2017-03-01 11:13:51,834 [WARNING] condor.status - trouble getting pool localhost negotiators status, retrying in 30s. 2017-03-01 11:14:21,864 [ERROR] condor.status - trouble getting pool localhost negotiators status, giving up. 2017-03-01 11:14:21,870 [WARNING] condor.status - trouble getting pool localhost schedds status, retrying in 30s. 2017-03-01 11:14:51,901 [WARNING] condor.status - trouble getting pool localhost schedds status, retrying in 30s. 2017-03-01 11:15:21,949 [WARNING] condor.status - trouble getting pool localhost schedds status, retrying in 30s. 2017-03-01 11:15:51,981 [WARNING] condor.status - trouble getting pool localhost schedds status, retrying in 30s. 2017-03-01 11:16:22,023 [WARNING] condor.status - trouble getting pool localhost schedds status, retrying in 30s. 2017-03-01 11:16:52,055 [WARNING] condor.status - trouble getting pool localhost schedds status, retrying in 30s. 2017-03-01 11:17:22,099 [WARNING] condor.status - trouble getting pool localhost schedds status, retrying in 30s. 2017-03-01 11:17:52,137 [WARNING] condor.status - trouble getting pool localhost schedds status, retrying in 30s. 2017-03-01 11:18:22,173 [WARNING] condor.status - trouble getting pool localhost schedds status, retrying in 30s. 2017-03-01 11:18:52,205 [WARNING] condor.status - trouble getting pool localhost schedds status, retrying in 30s. 2017-03-01 11:19:22,235 [ERROR] condor.status - trouble getting pool localhost schedds status, giving up. 2017-03-01 11:19:22,249 [WARNING] condor.status - trouble getting pool localhost collectors status, retrying in 30s. 2017-03-01 11:19:52,284 [WARNING] condor.status - trouble getting pool localhost collectors status, retrying in 30s. 2017-03-01 11:20:22,327 [WARNING] condor.status - trouble getting pool localhost collectors status, retrying in 30s. 2017-03-01 11:20:52,360 [WARNING] condor.status - trouble getting pool localhost collectors status, retrying in 30s. 2017-03-01 11:21:22,403 [WARNING] condor.status - trouble getting pool localhost collectors status, retrying in 30s. 2017-03-01 11:21:52,443 [WARNING] condor.status - trouble getting pool localhost collectors status, retrying in 30s. 2017-03-01 11:22:22,482 [WARNING] condor.status - trouble getting pool localhost collectors status, retrying in 30s. 2017-03-01 11:22:52,521 [WARNING] condor.status - trouble getting pool localhost collectors status, retrying in 30s. 2017-03-01 11:23:22,570 [WARNING] condor.status - trouble getting pool localhost collectors status, retrying in 30s. 2017-03-01 11:23:52,605 [WARNING] condor.status - trouble getting pool localhost collectors status, retrying in 30s. 2017-03-01 11:24:22,636 [ERROR] condor.status - trouble getting pool localhost collectors status, giving up. 2017-03-01 11:24:22,636 [INFO] __main__ - querying pool localhost slots 2017-03-01 11:24:22,653 [WARNING] condor.slots - trouble getting pool localhost startds, retrying in 30s. 2017-03-01 11:24:52,685 [WARNING] condor.slots - trouble getting pool localhost startds, retrying in 30s. 2017-03-01 11:25:22,725 [WARNING] condor.slots - trouble getting pool localhost startds, retrying in 30s. 2017-03-01 11:25:52,759 [WARNING] condor.slots - trouble getting pool localhost startds, retrying in 30s. 2017-03-01 11:26:22,798 [WARNING] condor.slots - trouble getting pool localhost startds, retrying in 30s. 2017-03-01 11:26:52,835 [WARNING] condor.slots - trouble getting pool localhost startds, retrying in 30s. 2017-03-01 11:27:22,871 [WARNING] condor.slots - trouble getting pool localhost startds, retrying in 30s. 2017-03-01 11:27:52,904 [WARNING] condor.slots - trouble getting pool localhost startds, retrying in 30s. 2017-03-01 11:28:22,943 [WARNING] condor.slots - trouble getting pool localhost startds, retrying in 30s. 2017-03-01 11:28:52,975 [WARNING] condor.slots - trouble getting pool localhost startds, retrying in 30s. 2017-03-01 11:29:23,005 [ERROR] condor.slots - trouble getting pool localhost startds, giving up. 2017-03-01 11:29:23,005 [INFO] __main__ - querying pool localhost jobs 2017-03-01 11:29:23,020 [ERROR] root - Trouble getting pool localhost schedds. 2017-03-01 11:29:23,020 [WARNING] fifemon.graphite - send_dict called with no data 2017-03-01 11:29:23,020 [INFO] fifemon.probe - (clusters.mypool) posted data in 1201.51364803 s 2017-03-01 11:29:23,020 [DEBUG] fifemon.graphite - ('probes.condor-mypool.update_time', (1488385763.020906, 1201.5136480331421)) 2017-03-01 11:29:23,020 [INFO] fifemon.probe - (clusters.mypool) sleeping 0 s On Wed, Mar 1, 2017 at 11:07 AM, Uchenna Ojiaku - NOAA Affiliate
<uchenna.ojiaku@xxxxxxxx> wrote:
|