[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Condor-devel] starter wisdom: what happens when a job exits?
- Date: Tue, 25 Mar 2008 11:51:33 -0700
 
- From: Derek Wright <wright@xxxxxxxxxxx>
 
- Subject: [Condor-devel] starter wisdom: what happens when a job exits?
 
As per previous threads on this list, I've spent a lot of time in the  
past few months adding/changing a ton of code in the startd and  
starter so that there's now a system of hooks in place at various  
stages of a job's life-cycle on the execution machine.  One of the  
things I had to do was come to grips with what the starter actually  
does when a job exits (which was much more challenging than it  
sounds). ;)
As a public service to the Condor development community (and to help  
myself remain sane while I had to change some of it), I wrote up a  
fairly detailed but high-level explanation of what's going on in the  
starter code whenever a job exits.  The results live in the "WISDOM"  
file in the src/condor_starter.V6.1 directory (in HEAD).  The write- 
up is an explanation of how things are in 7.1.* and beyond with the  
hooks in place -- I didn't bother to write it all up as the old work- 
flow with the older (if can believe it, more insane) names, etc.
If you're interested in this topic and have any questions, please ask  
me sooner rather than later so I can edit the document while a) I  
still have time, and b) it's still mostly swapped into my working set.
Cheers,
-Derek