Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] memory leak in Condor 7.4.2 schedd ???

Date: Wed, 9 Jun 2010 08:41:51 -0400
From: Ian Chesal <ian.chesal@xxxxxxxxx>
Subject: Re: [Condor-users] memory leak in Condor 7.4.2 schedd ???

Ian,

I think I'm seeing the same thing in the Linux x86 RH3 build. I'mseeing schedds crash with alarming periodicity and the stack tracealways has a malloc() at the top of it.

Memory watch on the schedd didn't yield anything conclusive, yet. Butit sure looks like the 32-bit process hit it's upper memory limit.


I have no line on a fix or work around.

- Ian

Sent from my iPhone

On 2010-06-09, at 5:29 AM, "Smith, Ian" <I.C.Smith@xxxxxxxxxxxxxxx>wrote:

Dear All,
I've recently moved to Condor 7.4.2 on our central manager/submithost runningSolaris 10 and found that the schedd seems to be taking a worryingamountof memory. For instance, at present there are only ~ 150 jobs in thequeue andthe schedd is taking over 900 MB. The documentation seems to suggestthat itshould only be using around 10 kB per job !! Since this has beenrising montonically
seemingly since I restarted the daemons just a few days ago I can only
assume that this is down to a leak.
The net result of this is that condor_q etc can be very slow torespond (more thanfive minutes on occasion) and it is difficult to submit more than ~1000 jobsat once whereas before there was no problem with 10 000 jobs. As faras I cansee the auto-clustering is working fine although I sometimes see inthe schedd log
messages about rebuilding tables ??

Anyone else seem this on other systems ?

Any suggestions for a fix/workaround ?

regards,

-ian.

--------------------------------------------
Dr Ian C. Smith,
Advanced Research Computing (e-Science) Team,
The University of Liverpool,
Computing Services Department.


_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxxwith a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/

References:
- [Condor-users] memory leak in Condor 7.4.2 schedd ???
  - From: Smith, Ian

Prev by Date: [Condor-users] memory leak in Condor 7.4.2 schedd ???
Next by Date: Re: [Condor-users] memory leak in Condor 7.4.2 schedd ???
Previous by thread: [Condor-users] memory leak in Condor 7.4.2 schedd ???
Next by thread: Re: [Condor-users] memory leak in Condor 7.4.2 schedd ???
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [Condor-users] memory leak in Condor 7.4.2 schedd ???