[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Schedd possibly spinning on a job



Hi all,

 

Our schedd has been pegged at 100% cpu for several hours and immediately returns to that state on restart.  At D_FULLDEBUG the log floods with the message

 

   05/16/19 12:50:58 satisfyJobs: finding resources for 6092282.0

 

so it almost looks like the schedd is stuck in a loop on this job.  I’d like to remove it to see if that fixes the problem, but of course with the schedd running at 100% condor_rm can’t get through.  Any suggestions?  Also, is there any way to get more detailed information on what’s happening?  D_ALL didn’t seem to have anything useful.

 

Thanks,

 

                                                                                                - Larne