Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] Not starting Schedduler
- Date: Wed, 23 Oct 2013 07:28:32 -0400
- From: Jon Thomas <jthomas@xxxxxxxxxx>
- Subject: Re: [HTCondor-users] Not starting Schedduler
On Wed, 2013-10-23 at 07:57 +0100, daniel popu wrote:
> Hi,
>
> Lately on one of my computers from the grid the condor_schedd is no
> longer starting.
> The log show the below error.
>
> Any idea how to fix this?
>
> ******************************************************
> ** condor_schedd.exe (CONDOR_SCHEDD) STARTING UP
> ** D:\condor\bin\condor_schedd.exe
> ** SubsystemInfo: name=SCHEDD type=SCHEDD(5) class=DAEMON(1)
> ** Configuration: subsystem:SCHEDD local:<NONE> class:DAEMON
> ** $CondorVersion: 8.0.3 Sep 19 2013 BuildID: 174914 $
> ** $CondorPlatform: x86_64_Windows8 $
> ** PID = 5968
> ** Log last touched 10/23 09:50:04
> ******************************************************
> Using config source: d:\condor\condor_config
> Using local config sources:
> D:\condor/condor_config.local
> DaemonCore: command socket at <10.214.96.92:58235>
> DaemonCore: private command socket at <10.214.96.92:58235>
> History file rotation is enabled.
> Maximum history file size is: 20971520 bytes
> Number of rotated history files is: 2
> NOTE: QUEUE_ALL_USERS_TRUSTED=TRUE - all queue access checks disabled!
> WARNING: Encountered corrupt log record 7 (byte offset 177)
> Lines following corrupt log record 7 (up to 3):
> 103 356.1123 In "NUL"
> 103 356.1123 Owner "< myId >"
> 103 356.1123 User "< myEmailAddress >"
> ERROR "Error: corrupt log record 7 (byte offset 177) occurred inside
> closed transaction, recovery failed" at line 1143 in file c:\condor
> \execute\dir_21744\userdir\src\condor_utils\classad_log.cpp
> Cron: Killing all jobs
> CronJobList: Deleting all jobs
> Cron: Killing all jobs
> CronJobList: Deleting all jobs
>
If you don't mind losing the job queue ( which means any job already
submitted will not be retried), you can just remove the job_queue.log
and restart.
Save the log. Some folks monitoring this list may want to see the log to
see if it's a bug.
> All the best,
> Daniel
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
>
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/