[OpenAFS-port-darwin] startup cache scan hang

Derrick J Brashear shadow@dementia.org
Tue, 29 May 2007 13:17:07 -0400 (EDT)


On Sun, 27 May 2007, Nicholas Riley wrote:

> Hi,
>
> Has anyone seen OpenAFS hang on startup, seemingly during a cache
> scan?  I experienced hangs on over 50% of our machines after both
> recent security updates - even after waiting 15 minutes or more, /afs
> doesn't mount.  This is OpenAFS 1.4.4 on both Intel and PowerPC
> machines (the problem seems a bit more prevalent on PowerPC).  We
> don't have any similar problems on Linux or Solaris.
>
> Here's what the system log says:
>
> May 27 14:42:24 bender kernel[0]: Starting AFS cache scan...
> [...]
> May 27 14:42:28 bender kernel[0]: [256] waiting for afs_osi_ctxtp
> May 27 14:42:33 bender kernel[0]: [256] waiting for afs_osi_ctxtp
>
> And there seem to be a bunch of zombie afsds around.  The below
> transcribed from the screen since my console/SSH connections hung
> entirely shortly thereafter, so there may be a few errors in it.
>
> USER    PID %CPU %MEM   VSZ   RSS  TT  STAT STARTED    TIME  COMMAND
> root    237  0.0  0.4  27692  2028  ??  U    2:42PM  0:00.24 /usr/sbin/afsd -afsdb -stat 10000 -dcache 2500 -daemons 5 -volumes 70 -dynroot -fakestat-all
> root    255  0.0  0.0      0     0  ??  Z   31Dec69  0:00.00 (afsd)
> root    256  0.0  0.0  27692  2028  ??  Us   2:42PM  0:00.00 /usr/sbin/afsd -afsdb -stat 10000 -dcache 2500 -daemons 5 -volumes 70 -dynroot -fakestat-all
> root    252  0.0  0.0      0     0  ??  Z   31Dec69  0:00.00 (afsd)
> root    253  0.0  0.0      0     0  ??  Z   31Dec69  0:00.00 (afsd)
> root    254  0.0  0.0      0     0  ??  Z   31Dec69  0:00.00 (afsd)

> I've been thinking of simply blowing away the cache directory before
> starting AFS - would that be likely to help?  Is there any other info
> that's useful in diagnosing the problem?

kernel backtrace would be of tremendous help.