[OpenAFS] afs: Lost contact with file server xxx.xxx.xxx.x

Derrick Brashear shadow@gmail.com
Sat, 19 Apr 2008 09:06:12 -0400


On Fri, Apr 18, 2008 at 8:23 PM, Derek Atkins <warlord@mit.edu> wrote:
> Jeffrey Altman <jaltman@secure-endpoints.com> writes:
>
>  > This was fixed in the Windows cache manager by always retrying RPCs
>  > sent on an existing RX connection that timed out once with a new RX
>  > connection.  I am not sure that a similar change was ever made to the
>  > UNIX cache manager.
>
>  For the record, it still bugs me deeply that so much code was forked
>  for the Windows CM instead of trying to reuse a lot of the cross-platform
>  CM code on Windows.  It leads to issues like these, where the Windows
>  system behaves significantly different than everywhere else.  While
>  it might seem in the short term expedient to just "do it the windows
>  way" I suspect that a good API could still be extremely cross-platform,
>  even in the kernel.  Doing this refactoring work would, IMHO, greatly
>  improve both the windows and non-windows OpenAFS solutions.
>
>  Unfortunately I cannot implement (or fund) this work...  So I'm sure
>  I'll be ignored.
>

I actually have the change in a sandbox waiting to test, and no time
to do so yet.