Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] timeout reading buffer

Date: Wed, 1 Mar 2006 10:55:20 -0600
From: Jaime Frey <jfrey@xxxxxxxxxxx>
Subject: Re: [Condor-users] timeout reading buffer

On Feb 28, 2006, at 3:00 PM, Preston Smith wrote:

Right as our condor pools reach about 100% capacity, one of thebusiest

schedds basically stops running jobs.. almost all run down to idle..

The negotiator logs:

2/28 15:44:45     Got NO_MORE_JOBS;  done negotiating
2/28 15:44:45   Negotiating with user@xxxxxxxxxxxxxxx at
<128.211.128.11:59684>
2/28 15:45:15 condor_read(): timeout reading buffer.
2/28 15:45:15     Failed to get reply from schedd
2/28 15:45:15   Error: Ignoring schedd for this cycle


condor_q on that schedd shows:
3342 jobs; 3330 idle, 10 running, 2 held


ShadowLog on 128.211.128.11 shows:

2/28 15:48:08 (21939.0) (32200): condor_read(): timeout readingbuffer.

2/28 15:48:08 (21939.0) (32200): AUTHENTICATE: handshake failed!
2/28 15:48:08 (21939.0) (32200): Authentication Error
AUTHENTICATE:1002:Failure performing handshake


Any suggestions on troubleshooting these timeouts?
We're running 6.6.10..

The most useful information would be the schedd log of 128.211.128.11at the time of the timeout.


+--------------------------------+-----------------------------------+
|           Jaime Frey           | I used to be a heavy gambler.     |
|       jfrey@xxxxxxxxxxx        | But now I just make mental bets.  |
| http://www.cs.wisc.edu/~jfrey/ | That's how I lost my mind.        |
+--------------------------------+-----------------------------------+

Follow-Ups:
- Re: [Condor-users] timeout reading buffer
  - From: Preston Smith

Prev by Date: Re: [Condor-users] PREEMPT
Next by Date: Re: [Condor-users] Condor Daemons not able to communicate
Previous by thread: Re: [Condor-users] [Birdbath Related] Strange behaviour - 6.7.17
Next by thread: Re: [Condor-users] timeout reading buffer
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [Condor-users] timeout reading buffer