Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] Jobs do not execute, they sit idle in the queue indefinitely
- Date: Mon, 20 May 2013 19:53:31 +0100
- From: Brian Candler <B.Candler@xxxxxxxxx>
- Subject: Re: [HTCondor-users] Jobs do not execute, they sit idle in the queue indefinitely
On Mon, May 20, 2013 at 02:38:31PM -0400, Dan Shea wrote:
> Adding STARTD to the gatekeeper node caused all jobs queued to be
> executed on the gatekeeper.
> It seems the gatekeeper machine can not see the execute-only nodes?
> I'm not sure what I have missed in the configuration to cause this
> behaviour? Network wise they all see each other just fine, hostnames
> resolved via /etc/hosts entries.
Have you set ALLOW_WRITE, if so to what?
> > SchedLog:05/17/13 13:41:21 (pid:9037) WARNING: forward resolution of
> > localhost.localdomain doesn't match 10.11.114.220!
This does look like a problem. What does "hostname" show on all the nodes?
Do you have a "localhost.localdomain" entry in /etc/hosts? Normally it would
be for 127.0.0.1, don't be tempted to set it to the external IP of your
machine.