Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] killing globus-job-managers
- Date: Thu, 20 Jul 2006 15:24:55 -0700
- From: Michael Thomas <thomas@xxxxxxxxxxxxxxx>
- Subject: Re: [Condor-users] killing globus-job-managers
Steven Timm wrote:
> On Wed, 19 Jul 2006, Michael Thomas wrote:
>>My question: Is it ok to start arbitrarily killing some of these g-j-m
>>processes? What effect will it have on the corresponding jobs?
>
>
> I usually look to see if the corresponding job has exited the queue
> already. If so, there's no harm in killing it.
> Even if the job hasn't exited, condor-g will restart another
> jobmanager when it needs to.
>
>
>>Would it be better/equivalent to condor_rm some of the g-j-m jobs (which
>>are easily identified by their command: data --dest-url=http://...)?
>>What effect will condor_rm'ing these jobs have for the user?
>>
>
> These are grid monitor jobs. they should never under any circumstance
> last more than one hour. If they do something is really wrong.
> Cancelling them it will have no effect on whether the user's jobs execute
> or not, just on what is reported to his condor-g client.
I currently have hundreds of the grid monitor jobs sitting idle in the
queue, where the corresponding process (as reported in
/var/log/messages) has gone away. I assume it's safe to kill these?
--Mike