[OpenAFS] fileserver goes down overnight

david l goodrich dlg@dsrw.org
Tue, 24 Mar 2009 14:52:05 -0500


--huG+SbfbdD6eblZQ
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Tue, Mar 24, 2009 at 08:02:00PM +0100, Anders Magnusson wrote:
> david l goodrich wrote:
> > On Tue, Mar 24, 2009 at 02:27:35PM -0400, Steven Jenkins wrote:
> >  =20
> >> On Tue, Mar 24, 2009 at 2:13 PM, david l goodrich <dlg@dsrw.org> wrote:
> >> ...
> >>    =20
> >>>>> sprawl# ps auxw | grep /openafs/
> >>>>> root ? 376 ?0.0 ?0.0 2316 ? ? 4 ? ? ? ? DW ? ?5:33PM 0:00.83 /usr/p=
kg/libexec/openafs/volserver
> >>>>> root ? 727 ?0.0 ?0.0 8664 ?2384 ? ? ? ? IW<a ?5:33PM 0:18.29 /usr/p=
kg/libexec/openafs/fileserver
> >>>>>
> >>>>>          =20
> >> ...
> >>
> >> Can you get a pstack and lsof of the volserver process?  (You may not
> >> be able to even get that much info..).
> >>    =20
> > lsof, yes[1].  pstack, no, it's a NetBSD box and I can't find
> > pstack for it.
> >  =20
> Please run ps axl  and ktrace -p 376 to see what you get.  It migth be a
> xen bug.

ps axl is here[1]

sprawl# kdump ktrace.out
   376 volserver EMUL  "netbsd"
   376 volserver CALL  gettimeofday(0x80c0fc0,0)
   376 volserver RET   gettimeofday 0
   376 volserver CALL  gettimeofday(0x80c0fc0,0)
   376 volserver RET   gettimeofday 0
   376 volserver CALL  select(5,0x80a0860,0,0,0x80a0844)
sprawl#

  --david

1. http://www.dsrw.org/~dlg/ps_axl
>=20
> -- Ragge

--huG+SbfbdD6eblZQ
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: Digital signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (GNU/Linux)

iEYEARECAAYFAknJOeUACgkQHDmo5jqnP4T6AQCfVqIfw11obj9rtW/GQv/YuBUb
C1QAnj86IkZr2yQwtXNafJbNPMPMdHkV
=eLvt
-----END PGP SIGNATURE-----

--huG+SbfbdD6eblZQ--