Thanks Greg for finding the diagrams (slides 23-32) from T.T. I was looking for. The notes on slides 28-30 of the PPTX version go into finer sequential detail about the steps in Claim acquisition. Some additional questions:
- What is Q on slide 27? I understand that J is the job classad and S is the classad for the execute machine.
Yes. Once the schedd has been given a slot to use from the negotiator, the schedd "claim"s the slot, for exclusive (but time-limited) use by that schedd. Assuming that succeeds, the starter "activate"s the claim to run a single job, which causes the startd to create the starter.
- Does the Shadow talk to the Startd and tell it to make a Starter?
File transfer is handled by the shadow and the starter. Input xfer happens right after activation, and Output after the job completes, but the claim is still active during file xfer.
- Where does file transfer go (inbound and outbound) in these steps
- Are there additional communications between processes once a single job is completed?
Once the first job on a claim completes, if the amount of time it
took is less than CLAIM_WORKLIFE, and the schedd can find another
job that fits in the slot, it is free to launch another starter to
reuse the existing claim, but with a new activation for the new
job.
These charts are really nice to show how one can build a robust system from a number of disparately connected parts.
Thanks, and good luck with your talk,
-greg