[OpenAFS] Re: Connection Timed Out errors occasionally when accessing openafs drive

Derrick Brashear shadow@gmail.com
Mon, 8 Jun 2009 00:51:14 -0400


On Sun, Jun 7, 2009 at 5:02 PM, Russ Allbery<rra@stanford.edu> wrote:
> Jim Rowan <jmr@qualcomm.com> writes:
>> On Jun 7, 2009, at 1:32 PM, Russ Allbery wrote:
>
>>> We tracked a similar problem down to user programs who were trying to
>>> access directories to which they didn't have permission (in our case
>>> because their tokens had expired). =A0Sufficient pounding on such
>>> directories will trigger the Rx backoff handling in the file server
>>> and start delaying Rx calls from that client, which can result in the
>>> client deciding the file server is down or no longer responding.
>
>> How did you track that down?
>
> Correlating system kernel messages with error messages from the web
> server.

simple way is "watch tcpdump for aborts coming from the fileserver
with decreasing frequency over time while at the same time the client
retransmits"


--=20
Derrick