[OpenAFS] Re: volserver crashing

Eric Chris Garrison ecgarris@iupui.edu
Tue, 12 Apr 2011 12:36:14 -0400


On 4/12/11 11:43 AM, Andrew Deason <adeason@sinenomine.net> wrote:
> On Tue, 12 Apr 2011 09:27:58 -0400
> Eric Chris Garrison<ecgarris@iupui.edu>  wrote:
>> >  I've recently upgraded all my servers to openafs-1.4.14-1.1.1 and
> Is this RHEL4/5, or ... ? I assume these are binaries/RPMs from
> openafs.org?

We compiled from source on RHEL 5.5 to make RPM packages to distribute 
on our RHEL servers.  One machine is still at RHEL 4, and it had its 
RPMs compiled separately.

>> >  Then I moved on to the "project" volumes, which have a much higher
>> >  quota.  One (383GB in size) seems to cause problems when I try to move
>> >  it.   It moves a LOT faster (more like 300-400 Mbit/s), but at some
>> >  point, the volserver on the receiving end crashes and all volume moves
>> >  abort:
>> >  
>> >  Apr 10 12:52:41 rfsb2 kernel: volserver[25425]: segfault at
>> >  000000009cf1fdd8 rip 0000003b9ce79a30 rsp 0000000042b42208 error 4
> Do you get a core in /usr/afs/logs ? If you can get a backtrace in gdb
> (just run 'bt'), we could tell you what this is.
>
> If you don't get a core, make sure you're running bosserver with
> 'ulimit -c unlimited'
Yeah, I don't have a core, RHEL sets ulimit to such that none are 
produced.  I'm changing that for the next one.  Once I'm done with some 
long volume transfers, I'll try that problem volume again to see if it 
coughs up a core.

Dumb question:  Would I have to restart bosserver?  Can I do so without 
being disruptive (i.e. restarting the fileserver process) to my users?

Thanks,

Chris