Mailing List Archives
	Authenticated access
	
	
     | 
    
	 
	 
     | 
    
	
	 
     | 
  
 
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Condor-users] Updated version of "Linux Scalability" Condor page
- Date: Fri, 18 Dec 2009 09:24:20 -0500
 
- From: Ian Stokes-Rees <ijstokes@xxxxxxxxxxxxxxxxxxx>
 
- Subject: [Condor-users] Updated version of "Linux Scalability" Condor page
 
Is there an updated version of this page:
http://www.cs.wisc.edu/condor/condorg/linux_scalability.html
?
In particular, I'm trying to create a 100k node DAG (flat, no 
dependencies), with MAXJOBS 6000 and I'm getting the error:
**** PANIC -- OUT OF FILE DESCRIPTORS at line 796 in dprintf.c
root # cat /proc/sys/fs/file-max
781235
ijstokes $ ulimit -n
1024
These are in 100k separate classads in 100k directories (in a 2-tier 
hierarchy groupX/nodeY, so as to avoid overloading a single directory), 
with 100k log files in each of the node directories.
It takes about 1 hour for the DAG to be submitted.  I've bumped up 
ulmits to a level which should get rid of the problem, but it isn't 
clear if I need to re-submit the DAG, restart Condor, logout/login, or 
even reboot the machine to have these changes come into effect.  Any 
advice kindly appreciated.
Regards,
Ian
$ ulimit -H -a
file size               (blocks, -f) unlimited
pending signals                 (-i) 69632
open files                      (-n) 40000
max user processes              (-u) 20000
--
Ian Stokes-Rees, Research Associate
SBGrid, Harvard Medical School
http://sbgrid.org