[OpenAFS-devel] Re: delays and lost contact with fileserver with 1.3.84 and higher

Niklas Edmundsson Niklas.Edmundsson@hpc2n.umu.se
Tue, 8 Nov 2005 22:19:16 +0100 (MET)


On Tue, 8 Nov 2005, Jeffrey Altman wrote:

> Alexander Bergolth wrote:
>> Btw. I've found out that clients on AIX (5.2., oslevel 07) also show the
>> same problem.
>> However Windows Clients (Version 1.4.0008) don't suffer from the delays.
>
> I find this quite interesting.   This provides a bit more weight to my
> theory that the actual bug is not in the RX patches that were applied
> between .82 and .84.  I believe there is a race condition in the RX
> library on non-Windows platforms.  The patches to RX to use thread local
> queues and statistics gathering increases the parallelism and therefore
> the likelihood that the race condition will be triggered.

This feels like the same bug that I mailed about a while ago. I 
noticed that decreasing the chunksize makes it trigger more often, and 
while transferring a large file (DVD iso) it occurs more often at the 
end than in the beginning. This goes for both my Linux test client 
(UP) and my AIX 5.3 client (SMP).


/Nikke
-- 
-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
  Niklas Edmundsson, Admin @ {acc,hpc2n}.umu.se     |    nikke@hpc2n.umu.se
---------------------------------------------------------------------------
  You made a living doing this? - Guinan
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=