[OpenAFS-devel] delays and lost contact with fileserver with 1.3.84 and higher

Alexander Bergolth leo@strike.wu-wien.ac.at
Tue, 01 Nov 2005 21:06:34 +0100


On 11/01/05 19:13, Tom Keiser wrote:
 > On 10/31/05, Alexander Bergolth <leo@strike.wu-wien.ac.at> wrote:
 >>It is this patch that causes the stalls on my system:
 >>
 >>http://www.openafs.org/cgi-bin/cvsweb.cgi/openafs/src/rx/rx.c#rev1.58.2.19
 >>
 >>- -------------------- snipp! --------------------
 >>DELTA STABLE14-rx-fpq-bulk-free-20050529
 >>AUTHOR tkeiser@psu.edu
[...]
 >>
 >>http://www.openafs.org/cgi-bin/cvsweb.cgi/openafs/src/rx/rx.c.diff?r1=1.58.2.18&r2=1.58.2.19
 >>
 >>I reverted it in 1.3.84 and now the delays are gone.
 >
 > Could you try the attached patch on a pristine >= 1.3.84 tree?  I
 > can't reproduce your bug, but if you're hitting this code path, then
 > the math error this patch fixes may help your problem.

No, sorry. It didn't fix it.

Any other info I could provide? (The stack traces during the stalls 
don't contain any useful information?)

Is there a way to trace function calls inside the openafs-kernel-module 
on Linux 2.6 without using the linux trace toolkit? (And thus having to 
recompile the kernel.)

Cheers,
--leo
-- 
-----------------------------------------------------------------------
Alexander.Bergolth@wu-wien.ac.at                Fax: +43-1-31336-906050
Zentrum fuer Informatikdienste - Wirtschaftsuniversitaet Wien - Austria