[OpenAFS] series of 1.4.2 fileserver crashes in rxi_FreeDataBufsTSFPQ (rx_packet.c)
Rainer Toebbicke
rtb@pclella.cern.ch
Tue, 05 Dec 2006 09:51:25 +0100
We're having a series of fileserver crashes in rxi_FreeDataBufsTSFPQ,
about one a day, on a particular server running 1.4.2.
I looked at 6 dumps, the traceback is not always the same, but so far
the common clue is that call->conn->peer->host points to old afs 3.4
clients!
Never noticed this on 1.4.1, so a recent change might be responsible.
There have been a few deltas in rx, one dealing with congestion?!
Obviously I'll dig further into this, but as this is getting really
ugly now any quick idea is welcome (hence double-post, sorry!).
Downgrading to 1.4.1 has other ill-effects and is therefore only last
resort should identifying the volume(s) involved and moving them to an
old server fail.
--
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Rainer Toebbicke
European Laboratory for Particle Physics(CERN) - Geneva, Switzerland
Phone: +41 22 767 8985 Fax: +41 22 767 7155