[OpenAFS] Delete on large directory tree causes client lockup

Derrick Brashear shadow@gmail.com
Sun, 6 Apr 2008 20:51:41 -0400


On Sun, Apr 6, 2008 at 5:03 PM, Moritz Bechler <mbechler@eenterphace.org> wrote:
> Hi,
>
>  when deleting a somewhat larger (the reproduce case I use at the moment is
> an openafs source tree but that already happened on considerably smaller
> trees as well) directory from afs we experience lockups of the complete afs
> client (all other fs calls on the client start to block) - which is kind of
> bad as our home directories are stored on /afs.
>
>  We are using openafs-1.5.33 on Gentoo Linux (Kernel 2.6.22) at the moment
> but used 1.4.6 some time before (and hoped to fix that problem by
> upgrading). I've not tested it recently but when testing the windows it
> seemed that the same/a similar problem existed there too.
>
>  - cmdebug -long <host> seems to be unable to communicate with the
> cachemanager
>  - we can't get the client to produce debug output (-debug/-logfile does not
> seem to do anything)
>  - strace on rm shows unlinkat() blocking

echo "t" > /proc/sysrq-trigger and collect the output in the messages
file; open a bug in OpenAFS RT.

If you don't have magic sysrq turned on, fix that.