Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] confusion around new spool in 7.5.5

Date: Fri, 18 Feb 2011 11:21:21 -0600
From: Dan Bradley <dan@xxxxxxxxxxxx>
Subject: Re: [Condor-users] confusion around new spool in 7.5.5



On 2/18/11 11:12 AM, Peter Doherty wrote:

On Feb 18, 2011, at 11:26 , Peter Doherty wrote:
I upgraded to v7.5.5 and there's one thing I'm scratching my head over.
I used to have a SPOOL directory filled with directories with nameslike:
cluster15093481.proc0.subproc0.tmp/

According to the changelog I should now have dirs in the format of:
$(SPOOL)/<#>/<#>/cluster<#>.proc<#>.subproc<#>


But the thing is, I don't have anything.
my SPOOL just has:
job_queue.log
local_univ_execute
spool_version

I've got a few thousand jobs in the queue right now.
Where are the spool files? I'm sure I'm looking in the correctdirectory. I've tried to find them, but I can't. I see a lot oflock files in $(TMP_DIR)
I believe the constant I/O of all the spool files was one of thebottlenecks of our Schedd, so if that's really been improved upon,I'm eager to see the effect, but from reading the changelog, the onlydifferent should have been subdirs for the spool to keep from hittingext3 limits.
Hmm, okay. Jobs seem to be running okay, but I see a lot of theseerrors in the Shadow Log:
02/18/11 12:09:25 (pid:649) (15101845.0) (649):Directory::setOwnerPriv() -- failed to find owner of/raid0/gwms_schedd/spool/1845/0/cluster15101845.proc0.subproc0.tmp02/18/11 12:09:25 (pid:649) (15101845.0) (649): Directory::Rewind():failed to find owner of"/raid0/gwms_schedd/spool/1845/0/cluster15101845.proc0.subproc0.tmp"
I guess that's part of the problem. I checked the perms on the spooldirectory, and then I set it to 777 and verified regular users canwrite to it, but that didn't stop the errors, or cause files to becreated there.
So I'm not really clear what's going on.

It appears that these messages are expected for jobs that do not havespool directories (see my other message). Therefore, we should fix theshadow not to generate noise in this case.


--Dan

References:
- [Condor-users] confusion around new spool in 7.5.5
  - From: Peter Doherty
- Re: [Condor-users] confusion around new spool in 7.5.5
  - From: Peter Doherty

Prev by Date: Re: [Condor-users] confusion around new spool in 7.5.5
Next by Date: Re: [Condor-users] confusion around new spool in 7.5.5
Previous by thread: Re: [Condor-users] confusion around new spool in 7.5.5
Next by thread: Re: [Condor-users] confusion around new spool in 7.5.5
Index(es):
- Date
- Thread

Mailing List Archives

Authenticated access

Re: [Condor-users] confusion around new spool in 7.5.5