[OpenAFS] Problems on AFS Unix clients after AFS fileserver moves
Rich Sudlow
rich@nd.edu
Tue, 09 Aug 2005 16:21:57 -0500
Dexter 'Kim' Kimball wrote:
> fs checkv will cause the client to discard what it remembers about volumes.
> Did you try that?
No - That worked!
Thanks
Rich
>
> Kim
>
>
> -----Original Message-----
> From: openafs-info-admin@openafs.org
> [mailto:openafs-info-admin@openafs.org] On Behalf Of Rich Sudlow
> Sent: Tuesday, August 09, 2005 9:58 AM
> To: openafs
> Subject: [OpenAFS] Problems on AFS Unix clients after AFS
> fileserver moves
>
>
> We've been having problems with our cell for the last couple
> years with AFS clients after fileservers are taken out of service.
> Before that things seemed to work ok when doing fileserver
> moves and
> rebuilding. All data was moved off the fileserver but the clients
> still seem to have some need to talk to it. In the past the AFS
> admins have left the fileservers up and empty for a number of
> days to try to resolve this issue - but it doesn't resolve the
> issue.
>
> For example a recent example:
>
> The fileserver reno.helios.nd.edu was shutdown after all data
> moved off of it. However the client still can't get to
> a number of AFS files.
>
> [root@xeon109 root]# fs checkservers
> These servers unavailable due to network or server problems:
> reno.helios.nd.edu.
> [root@xeon109 root]# cmdebug reno.helios.nd.edu -long
> cmdebug: error checking locks: server or network not responding
> cmdebug: failed to get cache entry 0 (server or network
> not responding)
> [root@xeon109 root]# cmdebug reno.helios.nd.edu
> cmdebug: error checking locks: server or network not responding
> cmdebug: failed to get cache entry 0 (server or network
> not responding)
> [root@xeon109 root]#
>
> [root@xeon109 root]# vos listvldb -server reno.helios.nd.edu
> VLDB entries for server reno.helios.nd.edu
>
> Total entries: 0
> [root@xeon109 root]#
>
> on the client:
> rxdebug localhost 7001 -version
> Trying 127.0.0.1 (port 7001):
> AFS version: OpenAFS 1.2.11 built 2004-01-11
>
>
> This is a linux 2.4 client and I don't have kdump - have
> also had these
> problems on sun4x_58 clients too.
>
> I should mention that we've seen some correlation
> to this happening on machines with "busy" AFS caches -
> which makes it
> even more frustrating as it seems to affect machines which
> depend on
> AFS the most. We've tried lots of fs flush* * -
> So far we've ended up rebooting which does fix the
> problem.
>
> Does anyone have any clues what the problem is or what a workaround
> might be?
>
> Thanks
>
> Rich
>
> --
> Rich Sudlow
> University of Notre Dame
> Office of Information Technologies
> 321 Information Technologies Center
> PO Box 539
> Notre Dame, IN 46556-0539
>
> (574) 631-7258 office phone
> (574) 631-9283 office fax
>
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info
>
>
>
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info
--
Rich Sudlow
University of Notre Dame
Office of Information Technologies
321 Information Technologies Center
PO Box 539
Notre Dame, IN 46556-0539
(574) 631-7258 office phone
(574) 631-9283 office fax