Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] 6.7.20 windows condor_quill problem
- Date: Mon, 03 Jul 2006 18:38:31 +0200
- From: Horvátth Szabolcs <szabolcs@xxxxxxxxxxxxx>
- Subject: Re: [Condor-users] 6.7.20 windows condor_quill problem
Hi Erik,
After having the problems for hours I had to turn the quill daemon off
completely. I tried enabling it again today to
save some logs for you but it worked nicely for a few hours. Than its
memory usage started growing
(from 4 mb went up to 10mb) and than "lost connection" with the scheduler.
Now if I query the state of the queue using condor_q I get an older
state while condor_q -d schedd shows
the "real" queue. Its empty now btw so its quite easy to spot the
difference.)
The only thing I found in the log file was this:
Now in new log file C:\Condor/log/QuillLog
7/3 17:47:51 Skipping over and going to the next job queue file
7/3 17:47:51 [QUILL] Unable to open the job_queue.log file!
7/3 17:47:51 Could not open file old job queue file
C:\Condor/spool/job_queue.log.22989 --- ERROR
7/3 17:47:51 Skipping over and going to the next job queue file
7/3 17:47:51 [QUILL] Unable to open the job_queue.log file!
7/3 17:47:51 Could not open file old job queue file
C:\Condor/spool/job_queue.log.22990 --- ERROR
7/3 17:47:51 Skipping over and going to the next job queue file
7/3 17:47:51 [QUILL] Unable to open the job_queue.log file!
7/3 17:47:51 Could not open file old job queue file
C:\Condor/spool/job_queue.log.22991 --- ERROR
7/3 17:47:51 Skipping over and going to the next job queue file
7/3 17:47:51 [QUILL] ERROR: There is no such Proc Ad[22992.0]
The rest is just:
7/3 17:48:36 POLLING RESULT: NO CHANGE
7/3 17:48:36 ********* End of Probing Job Queue Log File *********
7/3 17:48:36 ++++++++ Sending schedd ad to collector ++++++++
7/3 17:48:36 ++++++++ Sent schedd ad to collector ++++++++
7/3 17:48:46 ******** Start of Probing Job Queue Log File ********
7/3 17:48:46 === Current Probing Information ===
7/3 17:48:46 fsize: 2697926 mtime: 1151941358
7/3 17:48:46 first log entry: 22992.0 WallClockCheckpoint 4711
7/3 17:48:46 POLLING RESULT: NO CHANGE
...
Cheers,
Szabolcs
Erik Paulson wrote:
Could you put the full quill logfile on a webserver/ftp server and drop a
note to condor-admin with the URL?
Thanks,
-Erik