Hi, I've just got some better snippets from the logs below. I've tried looking for what might be the cause of the problems in the docs without any success. As for the various messages in the logs, searching online brings me to other users's questions but no answers. How have other users gotten these problems resolved? CollectorLog:4/13 11:09:03 Housekeeper: Ready to clean old ads CollectorLog:4/13 11:09:03 Cleaning StartdAds ... CollectorLog:4/13 11:09:03 Cleaning StartdPrivateAds ... CollectorLog:4/13 11:09:03 Cleaning QuillAds ... CollectorLog:4/13 11:09:03 Cleaning ScheddAds ... CollectorLog:4/13 11:09:03 Cleaning SubmittorAds ... CollectorLog:4/13 11:09:03 Cleaning LicenseAds ... CollectorLog:4/13 11:09:03 Cleaning MasterAds ... CollectorLog:4/13 11:09:03 Cleaning CkptServerAds ... CollectorLog:4/13 11:09:03 Cleaning CollectorAds ... CollectorLog:4/13 11:09:03 Cleaning StorageAds ... CollectorLog:4/13 11:09:03 Cleaning NegotiatorAds ... CollectorLog:4/13 11:09:03 Cleaning HadAds ... CollectorLog:4/13 11:09:03 Cleaning Generic Ads ... CollectorLog:4/13 11:09:03 Housekeeper: Done cleaning CollectorLog:4/13 11:09:03 DaemonCore: PERMISSION DENIED to unknown user from host <142.58.x.x:56057> for command 1 (UPDATE_SCHEDD_AD) CollectorLog:4/13 11:09:03 DaemonCore: PERMISSION DENIED to unknown user from host <142.58.x.x:56059> for command 11 (UPDATE_SUBMITTOR_AD) CollectorLog:4/13 11:09:03 Found StartdIpAddr CollectorLog:4/13 11:09:03 Got IP = '<142.58.y.y:49192>' CollectorLog:4/13 11:09:03 DaemonCore: PERMISSION DENIED to unknown user from host <142.58.x.x:53049> for command 48 (QUERY_ANY_ADS) CollectorLog:4/13 11:09:03 NegotiatorAd : Inserting ** "< x.irmacs.sfu.ca >" NegotiatorLog:4/13 11:09:03 ---------- Started Negotiation Cycle ---------- NegotiatorLog:4/13 11:09:03 Phase 1: Obtaining ads from collector ... NegotiatorLog:4/13 11:09:03 Getting all public ads ... NegotiatorLog:4/13 11:09:03 IO: Failed to read packet header NegotiatorLog:4/13 11:09:03 Couldn't fetch ads: communication error NegotiatorLog:4/13 11:09:03 Aborting negotiation cycle SchedLog:4/13 11:09:03 (pid:11654) Sent ad to central manager for user@xxxxxxxxxxxxx SchedLog:4/13 11:09:03 (pid:11654) Sent ad to 1 collectors for user@xxxxxxxxxxxxx On Monday 10 April 2006 11:34, Dominic Lepiane wrote: > Greetings, > > I'm taking over a Condor installation at our site and there have been some > problems recently. I am currently trying to just do a fresh install of the > latest version (6.7.18) and cannot get a sample job to run on the master. > I haven't bothered installing other machines yet and I can do that, however > the errors in the logs seem to indicate the problem is not resource > availability. > > Any advice is much appreciated. Thanks in advance! > > OS: 10.4.6 > Condor: 6.7.18 > Install command: > sudo ./condor_configure --install --install-dir=/usr/local/condor > --local-dir=/usr/local/condor/local --type=submit,execute,manager > --owner=condor --verbose > > Simple vanilla submission of a "Hello, World!" shell script, > condor_q shows job ID 1.0 is idle, > > SchedLog : > IO: Failed to read packet header > failed to send RESCHEDULE command to negotiator > IO: Failed to read packet header (many times) > Sent ad to central manager for <user> > Sent ad to 1 collectors for <user> > Sent ad to central manager for <user> > Sent ad to 1 collectors for <user> > > NegotiatorLog : > Phase 1: Obtaining ads from collector.... > Getting all public ads ... > IO: Failed to read packet header > Couldn't fetch ads: communication error > Aborting negotiation cycle > (same cycle repeats every 5 minutes) -- Dominic Lepiane Simon Fraser University/IRMACS dlepiane@xxxxxxxxx (604)268-7369
Attachment:
pgphUCn8joDOt.pgp
Description: PGP signature