[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] FW: Linux computers cannot join WindowsCentralManager



Hi Alex!

I may be able to help you out with that. There is a little-known utility called condor_fetchlog that allows you to retrieve Condor log files from remote machines without having to log in. To get the Collector log from the remote host, you can run the following command:

condor_fetchlog <collector-host> COLLECTOR

Likewise you can get the StartLog with this command:

condor_fetchlog <execute-host> STARTD

There's no man page, but you can refer to the Condor manual for more information:

http://www.cs.wisc.edu/condor/manual/v7.0/condor_fetchlog.html

Unfortunately the tool may not work in your case, since these machines are having trouble joining the pool to begin with. Still, it's worth a shot. Good luck!

===================================
Tony Rippy
Phone: 888.292.5320

Cycle Computing, LLC
Leader in Condor Grid Solutions
Enterprise Condor Support and Management Tools

http://www.cyclecomputing.com

On Apr 29, 2008, at 12:35 PM, Alex Alas wrote:

Tony,
Sorry for taking that long in reply, I am not sending the log files you
are requesting at this moment, I am waiting for them as well, I am not
in charge on the linux systems and I requested those logs to our Linux
engineer.
On Friday my co-worker and I were reviewing the condor_config file and
it seems like there are some name resolution issues and some paths on
that config file do not exist. Who installed condor on the linux
computer was our Unix Administrator and due to a lack of documentation
he might have specified a different path by mistake. We have now a
recurrent error showing when we try to run sudo ./condor_reconfig:
Can't find address for local master
Perhaps you need to query another pool.

When I get to the files you requested me I will forward them to you.
Thanks for your help.


Respectfully,
Alex Alas
Systems Administrator
Fugro EarthData Inc.

-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx
[mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Tony Rippy
Sent: Friday, April 18, 2008 1:59 PM
To: Condor-Users Mail List
Subject: Re: [Condor-users] FW: Linux computers cannot join
WindowsCentralManager

Hi Alex,

Can you post your CollectorLog and the StartLog from your Linux
execute host? With any luck that will contain some information about
what happens with the UDP and TCP communication when the execute host
attempts to update the collector.

===================================
Tony Rippy
Phone: 888.292.5320

Cycle Computing, LLC
Leader in Condor Grid Solutions
Enterprise Condor Support and Management Tools

http://www.cyclecomputing.com


On Apr 18, 2008, at 1:05 PM, Alex Alas wrote:

G.T.
Thank you for your suggestion, I did try it but it didn't work, before
applying it I read it is intended for wan pools our case is not all
our
computers are in the same LAN, they might be in different subnets but
they co-exist well, no dns conflicts that we are aware. If Anyone has
more ideas on how to fix this issue or could point me in the right
direction to find a solution, your help/input is well appreciated.
Thanks in advance....

Respectfully,
Alex

-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx
[mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of G.T. Chiang
Sent: Thursday, April 17, 2008 10:50 AM
To: Condor-Users Mail List
Cc: Kevin Clark
Subject: Re: [Condor-users] Linux computers cannot join Windows
CentralManager

Hi

I had similar probolem before, you probably can try use
UPDATE_COLLECTOR_WITH_TCP = True on central manager.

On Apr 17 2008, Alex Alas wrote:

We have set up a windows XP computer as a central manager with its
respective pool, we also setup another windows system as a Condor
ViewServer , joining windows systems to the pool doesn't seem to be a
problem but when we want to join Linux systems we cannot do it. We
had
reviewed all settings on the Linux systems and all seem to be
alright,
we have no errors to reports, we had reviewed all logs and we cannot
find any errors that can tell us what to troubleshoot. Windows and
Linux
systems can ping each others.

If anyone had incurred in the past with a similar situation and can
give
us a lead to fix our problem, I'd thank you in advance...



Respectfully,

Alex Alas

Systems Administrator






--
Gen-Tao Chiang
Computer Officer
NIEeS (National Institute for Environmental eScience)
Department of Earth Sciences, University of Cambridge
Downing Street, Cambridge, CB2 3EQ

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx
with
a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx
with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with
a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/