HTCondor Project List Archives



[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-devel] ProcAPI Messages



On Mon, May 11, 2009 at 03:32:20PM -0500, Jim Summers wrote:
> Hello All,
> 
> I modified the procapi.h file so that all of the age, creation_tim and 
> sample_time variables use an unsigned long type.  That seems to have fixed the 
> ProcAPI errors that we were seeing.
> 
> But now we are seeing the following in SchedLog:
> 5/11 14:53:38 (fd:7) (pid:57011) In 
> DaemonCore::Create_Process(/usr/local/condor/sbin/condor_procd,...)
> 5/11 14:53:38 (fd:7) (pid:57011) PRIV_CONDOR --> PRIV_ROOT at daemon_core.cpp:6852
> 5/11 14:53:38 (fd:7) (pid:57011) PRIV_ROOT --> PRIV_CONDOR at daemon_core.cpp:6885
> 5/11 14:53:38 (fd:11) (pid:57011) Create Process: fork() failed: Resource 
> temporarily unavailable (35)
> 5/11 14:53:38 (fd:7) (pid:57011) start_procd: unable to execute the procd
> 5/11 14:53:38 (fd:5) (pid:57011) Close_Pipe(pipe_end=65536) succeeded
> 5/11 14:53:38 (fd:5) (pid:57011) Close_Pipe(pipe_end=65537) succeeded
> 5/11 14:53:38 (fd:5) (pid:57011) ERROR "unable to start the ProcD" at line 620 
> in file proc_family_proxy.cpp
> 
> I am not sure what to do at this point?
> 
> Ideas / Suggestions?

Do you have your process limit set really low for your uid?

As for the code changes, you could attach a patch to you message
and we can see if we can apply it. I'll have to scrutinize the patch
closely because even though one might think the age of a process can't
be negative, due to kernel issues a negative age actually could be
calculated, so I'd need to do some inspection.

Have you signed our code contribution agreement?

Thank you.

-pete