I have a simple shell script (attached) to forward a classad from a number of clusters to a central collector/negotiator, from there to do matchmaking with Condor-G.
On the first 2 clusters I tried, it worked and I can see the classAd.
It is executing the command
condor_advertise -pool fermigrid1.fnal.gov UPDATE_STARTD_AD GCclassad.txt
and the contents of GCclassad.txt look like this:
So on the central collector/negotiator, condor_status looks like this:
Machines Owner Claimed Unclaimed Matched Preempting
INTEL/LINUX 4 0 0 4 0 0
Total 4 0 0 4 0 0
(Omitted 2 malformed ads in computed attribute totals)
-------------------------\\
If I do the following:
MyAddress = "<131.225.166.93:0>" LastHeardFrom = 1129319400 UpdatesTotal = 4 UpdatesSequenced = 0 UpdatesLost = 0 UpdatesHistory = "0x0000000000000000000000000000000
I see that the two classads which successfully are seen by the collector have a field called MyAddress appended to the classad, a field which is not in the classad file.'
There is a third node on which I am trying to run the same script. I do not see this one show up in the collector. Instead I see:
10/13 09:44:00 Got IP = '(null)' 10/13 09:44:00 No IP address in classAd 10/13 09:44:00 Error: Invalid StartAd 10/13 09:44:00 Could not make hashkey --- ignoring ad 10/13 09:44:00 Received malformed ad from command (0). Ignoring.
Any idea what the magic configuration tweak is to make it include MyAddress in the classad? Thanks for any help.
Steve Timm
-- ------------------------------------------------------------------ Steven C. Timm, Ph.D (630) 840-8525 timm@xxxxxxxx http://home.fnal.gov/~timm/ Fermilab Computing Div/Core Support Services Dept./Scientific Computing Section Assistant Group Leader, Farms and Clustered Systems Group Lead of Computing Farms Team
Attachment:
runclassad.sh
Description: Bourne shell script
Attachment:
GCclassad.sh
Description: Bourne shell script