On 06/29/2018 03:16 AM, Thomas Hartmann wrote:
Hi Gred, many thanks - turning on cgroup delegation seem to do the trick!! :) With the delegate option on for all controllers in the Condor unit's service section [1], the job slices survived all new/restarts of other units!
Thomas:How are you reproducing the problem? I can see it happening from time to time, but systemctl restart random_service doesn't seem to trigger it. Do you have some magic that reliably reproduces the escape?
Thanks, -greg