Hello,
I have a pool of computers with HTCondor 8.0.2.
As you know, is critical to have working HTCondor on the central point.
The problem is that quite often Condor gets blocked and the central point will be down.
For previous versions I used the below DOS commands for an unsupervised hard restart (the restart was done automatically once a day):
solution 1:
net stop condor
net start condor
solution 2:
taskkill /f /im condor_master.exe
net start condor /y
With Condor 8.0.2 the above solutions are no longer efficient because some of the demons remain blocked and I cannot restart Condor service.
Any other working solutions when Condor is blocked?
Best regards,
Daniel Popu