[OpenAFS] 1.4.x, select() and recent RHEL kernels beware

Jack Neely jjneely@pams.ncsu.edu
Mon, 26 Nov 2012 11:30:04 -0500


On Thu, Nov 08, 2012 at 06:20:18PM +0100, Stephan Wiesand wrote:
> Hi Dan,
> 
> On Nov 8, 2012, at 16:41 , Dan Van Der Ster wrote:
> 
> [...]
> 
> > All of the nasty details of this incident here:
> >    https://afs.web.cern.ch/afs/reports/html/afs200SegFaults.html
> > 
> > We're now running with a workaround,
> >  ulimit -Hn 1024; ulimit -Sn 1024
> > in our init scripts until we manage to upgrade to 1.6.
> > 
> > Hope this saves someone the effort of troubleshooting this again.
> 
> Great work (again). Thanks a lot for sharing this!
> 
> Cheers,
> 	Stephan
> 

We've had this issue occur at NCSU as well.  I'm trying to figure out if
1.6.2 will be out soon enough to wait for it, or have multiple outages
for installing the ulimits and then upgrading to 1.6.2 when its
available.  (Or spend weeks moving volumes.)  There are another fix or
two in 1.6.2 I'd like to apply to our servers.

Jack Neely

-- 
Jack Neely <jjneely@ncsu.edu>
Linux Czar, OIT Campus Linux Services
Office of Information Technology, NC State University
GPG Fingerprint: 1917 5AC1 E828 9337 7AA4  EA6B 213B 765F 3B6A 5B89