[OpenAFS] Volume corruption on Solaris 2.7.

Larry W. Cashdollar lwc@vapid.ath.cx
Wed, 28 Jan 2004 17:07:01 -0500 (EST)


I have an issue with recovering volumes after a system has crashed.  It
appears after my system comes back online, the salvager and the fileserver
core dump.   They core dump until I remove /vicepa/*.vol.  I verified that
I  have the correct fsck and changes to /sbin/mountall.  What I am
forgetting?  I know it is something I am doing wrong, but cant figure out
what.

here is the gdb output of the fileserver core file.

#0  0xff219c30 in syscall () from /usr/lib/libc.so.1
#1  0x6eb40 in iopen ()
#2  0x68f48 in ih_open (ihP=0x4fd108) at ihandle.c:283
#3  0x64d68 in ReadHeader (ec=0xffbefb7c, h=0x4fd108, to=0x49e5e0 "",
    size=552, magic=2023862981, version=1) at volume.c:396
#4  0x65484 in attach2 (ec=0xffbefb7c,
    path=0xffbef950 "/vicepa/V0536871044.vol", header=0xffbef990,
    partp=0x376c70, isbusy=0) at volume.c:711
#5  0x651bc in VAttachVolumeByName_r (ec=0xffbefb7c,
    partition=0xc7d28 "/vicepa", name=0x7 <Address 0x7 out of bounds>,
mode=3)
    at volume.c:606
#6  0x64f5c in VAttachVolumeByName (ec=0xffbefb7c,
    partition=0xc7d28 "/vicepa", name=0x4fccaa "V0536871044.vol", mode=3)
    at volume.c:524
#7  0x64ab8 in VInitVolumePackage (pt=4294967295, nLargeVnodes=3632240,
    nSmallVnodes=5229624, connect=0, volcache=650240) at volume.c:282
#8  0x2069c in main ()

Salvager core:

(gdb) where
#0  0xff219c30 in syscall () from /usr/lib/libc.so.1
#1  0x2b748 in iopen ()
#2  0x29264 in ih_open ()
#3  0x1c740 in SalvageHeader ()
#4  0x1c310 in SalvageVolumeHeaderFile ()
#5  0x1bc00 in DoSalvageVolumeGroup ()
#6  0x1a754 in SalvageFileSys1 ()
#7  0x1a088 in SalvageFileSysParallel ()
#8  0x19804 in _start ()
#9  0x21f14 in cmd_Dispatch ()
#10 0x19ba0 in main ()

Any help would be wonderful..

-- Larry C$