Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] DedicatedScheduler hogging resources

Date: Thu, 09 Mar 2006 14:59:19 -0600
From: Greg Thain <gthain@xxxxxxxxxxx>
Subject: Re: [Condor-users] DedicatedScheduler hogging resources

Rok Roskar wrote:

I'm running MPI under condor 6.6:

DedicatedScheduler holds on to resources even after all MPI jobs have been
removed from the queue - any way to fix this? Or is it an unfortunate
byproduct of mixing parallel and serial jobs on the same set of resources?

It will hold onto claims for UNUSED_CLAIM_TIMEOUT seconds after the jobleaves the queue, where UNUSED_CLAIM_TIMEOUT is a parameter in thecondor_config file. The default is 300 seconds, and you can lower thisas you like.

Also, my jobs sometimes try to start even when DedicatedScheduler doesn't
have enough resources for them. This causes infinite looping of
unsuccessful job execution, meaning that all the resource time gets
wasted. For example, my job requests 8 machines, but only 7 are available.
Somehow, Condor tries to execute the job anyway, but because there aren't
enough resources, it doesn't run. Solutions?


Does the job try to start, or do the machines just get claimed?

-greg

Follow-Ups:
- Re: [Condor-users] DedicatedScheduler hogging resources
  - From: Rok Roskar
- Re: [Condor-users] DedicatedScheduler hogging resources
  - From: Rok Roskar

References:
- [Condor-users] DedicatedScheduler hogging resources
  - From: Rok Roskar

Prev by Date: [Condor-users] Way to append to classad of already-submitted jobs
Next by Date: Re: [Condor-users] Master won't start
Previous by thread: [Condor-users] DedicatedScheduler hogging resources
Next by thread: Re: [Condor-users] DedicatedScheduler hogging resources
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [Condor-users] DedicatedScheduler hogging resources