Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] duplicate jobIDs in the condor_history

Date: Wed, 24 Nov 2010 16:22:40 -0600
From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
Subject: Re: [Condor-users] duplicate jobIDs in the condor_history

Ian Chesal wrote:

So the first question is:
Did you delete the $(SPOOL) directory for the scheduler or the contentsof that directory or the job_queue.log files? If so, you reset the thecluster ID counter and that's why you've got duplicates.
If you're certain you haven't wiped the job_queue.log file for thescheduler, is it possible you have multiple schedulers writing to thesame history file? If so: that's bad.

Or perhaps you have multiple schedds writing to the same job_queue.logfile?? That would also be really bad.


> Each scheduler should have its own

history file.

I would state a superset of the above: each schedd should have its ownprivate log and spool subdirectory.

In any event, i think you can reset the next job id Condor assigns byshutting down your schedd (condor_off -schedd), and append the followingto the end of the spool/job_queue.log file:

  105
  103 0.0 NextClusterNum xxxxx
  106

where xxx = the next job cluster id you want to be assigned. Then turnyour schedd back on (condor_on -schedd). Note I haven't tried thisformula, so buyer beware. And if you haven't fixed the underlyingproblem why the job ids got reused, it may happen again...


Hope the above helps
Todd

Follow-Ups:
- Re: [Condor-users] duplicate jobIDs in the condor_history
  - From: Santanu Das

References:
- [Condor-users] Trouble running multithreaded job in vanilla universe
  - From: Christopher Whelan
- Re: [Condor-users] Trouble running multithreaded job in vanilla universe
  - From: Ian Chesal
- Re: [Condor-users] Trouble running multithreaded job in vanilla universe
  - From: Christopher Whelan
- [Condor-users] duplicate jobIDs in the condor_history
  - From: Santanu Das
- Re: [Condor-users] duplicate jobIDs in the condor_history
  - From: Ian Chesal

Prev by Date: Re: [Condor-users] duplicate jobIDs in the condor_history
Next by Date: Re: [Condor-users] duplicate jobIDs in the condor_history
Previous by thread: Re: [Condor-users] duplicate jobIDs in the condor_history
Next by thread: Re: [Condor-users] duplicate jobIDs in the condor_history
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [Condor-users] duplicate jobIDs in the condor_history