[OpenAFS-port-darwin] startup cache scan hang
Derrick J Brashear
shadow@dementia.org
Tue, 29 May 2007 13:17:07 -0400 (EDT)
On Sun, 27 May 2007, Nicholas Riley wrote:
> Hi,
>
> Has anyone seen OpenAFS hang on startup, seemingly during a cache
> scan? I experienced hangs on over 50% of our machines after both
> recent security updates - even after waiting 15 minutes or more, /afs
> doesn't mount. This is OpenAFS 1.4.4 on both Intel and PowerPC
> machines (the problem seems a bit more prevalent on PowerPC). We
> don't have any similar problems on Linux or Solaris.
>
> Here's what the system log says:
>
> May 27 14:42:24 bender kernel[0]: Starting AFS cache scan...
> [...]
> May 27 14:42:28 bender kernel[0]: [256] waiting for afs_osi_ctxtp
> May 27 14:42:33 bender kernel[0]: [256] waiting for afs_osi_ctxtp
>
> And there seem to be a bunch of zombie afsds around. The below
> transcribed from the screen since my console/SSH connections hung
> entirely shortly thereafter, so there may be a few errors in it.
>
> USER PID %CPU %MEM VSZ RSS TT STAT STARTED TIME COMMAND
> root 237 0.0 0.4 27692 2028 ?? U 2:42PM 0:00.24 /usr/sbin/afsd -afsdb -stat 10000 -dcache 2500 -daemons 5 -volumes 70 -dynroot -fakestat-all
> root 255 0.0 0.0 0 0 ?? Z 31Dec69 0:00.00 (afsd)
> root 256 0.0 0.0 27692 2028 ?? Us 2:42PM 0:00.00 /usr/sbin/afsd -afsdb -stat 10000 -dcache 2500 -daemons 5 -volumes 70 -dynroot -fakestat-all
> root 252 0.0 0.0 0 0 ?? Z 31Dec69 0:00.00 (afsd)
> root 253 0.0 0.0 0 0 ?? Z 31Dec69 0:00.00 (afsd)
> root 254 0.0 0.0 0 0 ?? Z 31Dec69 0:00.00 (afsd)
> I've been thinking of simply blowing away the cache directory before
> starting AFS - would that be likely to help? Is there any other info
> that's useful in diagnosing the problem?
kernel backtrace would be of tremendous help.