I find condor will execute jobs in other slots when the machine they on failed. But I think the interval is too long, about 22 minutes in my 5 nodes cluster.
So how can I minimize this interval? Condor should know that machine is down, because new jobs are not sent to it.
By the way, condor_q always shows that jobs are in running state, is it right?