[OpenAFS] Re: Linux client connection timed out after server failure

Kim Kimball dhk@ccre.com
Thu, 16 Aug 2012 09:10:40 -0600


I assume the previous file server no longer appears in VLDB, at all?

Kim
________________
dhk@ccre.com
970-215-6359


On Aug 16, 2012, at 9:03 AM, Andrew Deason <adeason@sinenomine.net> wrote:

> On Thu, 16 Aug 2012 10:22:27 -0400
> Bob Hoffman <hoffman@cs.pitt.edu> wrote:
> 
>> A number of clients persist in saying "Connection timed out" even after
>> the volumes were brought on-line on the new server.
> 
> Clients cache volume location information for 2 hours. They will
> continue to think that the volumes are on the old server until you
> invalidate the cache, or they receive a certain type of error. "The
> server is not responding" is not one of those errors.
> 
>> Here is what I've tried so far with no effect whatsoever:
>> 
>> fs flushmount /afs/cs.pitt.edu/projects/cast
>> fs flushmount /afs/cs.pitt.edu/projects
>> fs flushmount /afs/.cs.pitt.edu/projects/cast
>> fs flushmount /afs/.cs.pitt.edu/projects
>> fs flushvolume /afs/cs.pitt.edu/projects/cast
>> fs flushvolume /afs/cs.pitt.edu/projects
>> fs flushvolume /afs/.cs.pitt.edu/projects/cast
>> fs flushvolume /afs/.cs.pitt.edu/projects
>> vos release projects
>> ls -l /afs/cs.pitt.edu/projects
>> ls -l /afs/.cs.pitt.edu/projects
> 
> Try 'fs checkvolumes'.
> 
>> Is there anything I can do, short of a client reboot, to fix this?
>> Shouldn't AFS have a more graceful recovery in this kind of situation?
>> Why doesn't the client see that the volume has moved to a new server?
> 
> It could in theory recheck the vldb in this scenario, but there are
> other issues with doing that, since the majority of the time such errors
> are encountered when the volume hasn't moved or anything.
> 
> If this situation lasted for more than 2 hours and/or survived an 'fs
> checkv', that's a problem. For that, you can capture some debug data
> like so:
> 
> fstrace clear cm
> fstrace setlog cmfx -buffers 1024
> fstrace sets cm -active
> ls /afs/cs.pitt.edu/projects/cast &
> echo $!
> wait
> fstrace dump cm > /tmp/fstrace.log
> fstrace sets cm -inactive
> 
> -- 
> Andrew Deason
> adeason@sinenomine.net
> 
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info