[OpenAFS-devel] Linux deadlocks (possibly fixed in IBM-AFS)

Alf Wachsmann alfw@SLAC.Stanford.EDU
Wed, 03 Jul 2002 08:00:02 -0700 (PDT)


On Wed, 3 Jul 2002, Broughton, Travis V wrote:
> We've been running into some bugs in 1.2.5 that are causing deadlocks and
> hangs on the Linux client.  Unlike most AFS deadlocks I've seen, the system
> load average goes to zero rather than steadily increasing.

We see this problem on a nearly daily basis on some of our heavily used
Linux machines (only with Red Hat kernel 2.4.18 though).
See my posting "Kernel hangs on RH Linux systems" from Thu, 13 Jun 2002.


> We believe this behavior to have been fixed in the most recent IBM-AFS
> release, namely by the following deltas:
>
> 	srikanth-IY31752-afs3.6-race-condition-in-afs-buffer-cache 1.2
> 	srikanth-12885-afs3.6-race.condition.in.linux.event.handling 1.5

We are testing an inofficial IBM/Transarc client release with these
fixes right now. We havn't done enough testing yet but the problem seems
to have gone away with this version.

-- Alf.

-----------------------------------------------------------------------
  Alf Wachsmann                       | e-mail: alfw@slac.stanford.edu
  SLAC Computing Service              | Phone:  +1-650-926-4802
  2575 Sand Hill Road, M/S 97         | FAX:    +1-650-926-3329
  Menlo Park, CA 94025, USA           | Office: Bldg. 50/323
-----------------------------------------------------------------------
                http://www.slac.stanford.edu/~alfw (PGP)
-----------------------------------------------------------------------