[OpenAFS] Oopses (SuSE 7.0, 2.2.16)

Cees de Groot cg@cdegroot.com
21 Nov 2001 10:15:23 +0100


Yesterday, I installed 1.2.2 on a couple of our machines to see whether
OpenAFS is ready for prime-time (I want to get rid of NFS). The installation
worked smoothly, until I started doing some load testing on the first machine
by copying ~1.5Gig of binaries into a volume (copying the i386_linux22/dest
directory into a volume worked fine, so I thought "let's start doing some real
work"). The copying was done from an NFS-mounted directory on the same machine
where the volume physically resides.

After a while (copying around 100Mb), I got an oops (ksymoops(1) output below),
the copy process started to spew error messages, and 'ls /afs/' first
generated an error message, then I went to bed ;-), and now it shows an empty
directory. 

Before going to bed, I decided to shut down the client on a different box
because it was filling the log with complaints that the server couldn't be
reached, but it oops'ed as well, also with an IPUT Bad refCount 0 on inode.

I'll probably need to reboot the server to get things going, but I'd like to
know what's causing this - I'm sort of hesitant of putting software into
production that produces oops'es :-)

Starting AFS cache scan...found 0 non-empty cache files (0%).
IPUT Bad refCount 0 on inode 0xfcb33000
Unable to handle kernel paging request at virtual address ffffffff
current->tss.cr3 = 14874000, %cr3 = 14874000
*pde = 002ed067
Oops: 0002
CPU:    0
EIP:    0010:[<fca96c3d>]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010296
eax: 00000028   ebx: fcb33000   ecx: 01000000   edx: 00000202
esi: fcabb6d4   edi: 00000006   ebp: fffffffe   esp: ccec1e3c
ds: 0018   es: 0018   ss: 0018
Process umount (pid: 3042, process nr: 131, stackpage=ccec1000)
Stack: fcb33000 fcaa0bb2 fcb332f0 fcabb6d4 c8b5d000 fcaa0cd0 fcaba040 00000000
       fcb33000 fcaa0bb2 c3035480 fcb33000 00001000 00000087 0000021d d1414000
       40014000 fcaa2fd0 fcb33000 c8b5d000 00000006 fffffffe 00000006 c8b5d000
Call Trace: [<00000000>] [<00001000>] [<00000087>] [<0000021d>] [<00000006>]
[<00000006>] [<00000006>]
       [<c0138fd7>] [<00000000>] [<00006713>] [<c012c804>] [<00000006>]
[<00000001>] [<c012fffa>] [<00000006>]
       [<00000006>] [<00000006>] [<c01300b4>] [<00000006>] [<c01300f1>]
[<c013a7a4>] [<00000006>] [<00000006>]
       [<00000006>] [<00000003>] [<c01301fb>] [<00000006>] [<00000000>]
[<00000000>] [<c01301aa>] [<00000000>]
       [<00000000>] [<00000000>] [<c0130325>] [<00000006>] [<00000000>]
[<00001000>] [<c013035a>] [<00000000>]
       [<00000000>] [<c010957d>] [<c0109444>] [<00000016>] [<0000002b>]
[<0000002b>] [<00000016>] [<00000023>]
       [<00000202>] [<0000002b>]
Code: c6 05 ff ff ff ff 2a 83 c4 10 83 c4 0c c3 90 55 57 56 53 8b

>>EIP; fca96c3d <[libafs-2.2.16-SMP.mp]osi_Panic+25/34>   <=====
Trace; 00000000 Before first symbol
Trace; 00001000 Before first symbol
Trace; 00000087 Before first symbol
Trace; 0000021d Before first symbol
Trace; 00000006 Before first symbol
Trace; 00000006 Before first symbol
Trace; 00000006 Before first symbol
Trace; c0138fd7 <dput+a3/160>
Trace; 00000000 Before first symbol
Trace; 00006713 Before first symbol
Trace; c012c804 <fsync_dev+44/4c>
Trace; 00000006 Before first symbol
Trace; 00000001 Before first symbol
Trace; c012fffa <d_umount+4e/58>
Trace; 00000006 Before first symbol
Trace; 00000006 Before first symbol
Trace; 00000006 Before first symbol
Trace; c01300b4 <do_umount+80/164>
Trace; 00000006 Before first symbol
Trace; c01300f1 <do_umount+bd/164>
Trace; c013a7a4 <get_empty_inode+a0/bc>
Trace; 00000006 Before first symbol
Trace; 00000006 Before first symbol
Trace; 00000006 Before first symbol
Trace; 00000003 Before first symbol
Trace; c01301fb <umount_dev+63/c4>
Trace; 00000006 Before first symbol
Trace; 00000000 Before first symbol
Trace; 00000000 Before first symbol
Trace; c01301aa <umount_dev+12/c4>
Trace; 00000000 Before first symbol
Trace; 00000000 Before first symbol
Trace; 00000000 Before first symbol
Trace; c0130325 <sys_umount+c9/ec>
Trace; 00000006 Before first symbol
Trace; 00000000 Before first symbol
Trace; 00001000 Before first symbol
Trace; c013035a <sys_oldumount+12/1c>
Trace; 00000000 Before first symbol
Trace; 00000000 Before first symbol
Trace; c010957d <error_code+2d/40>
Trace; c0109444 <system_call+34/40>
Trace; 00000016 Before first symbol
Trace; 0000002b Before first symbol
Trace; 0000002b Before first symbol
Trace; 00000016 Before first symbol
Trace; 00000023 Before first symbol
Trace; 00000202 Before first symbol
Trace; 0000002b Before first symbol
Code;  fca96c3d <[libafs-2.2.16-SMP.mp]osi_Panic+25/34>
00000000 <_EIP>:
Code;  fca96c3d <[libafs-2.2.16-SMP.mp]osi_Panic+25/34>   <=====
   0:   c6 05 ff ff ff ff 2a      movb   $0x2a,0xffffffff   <=====
Code;  fca96c44 <[libafs-2.2.16-SMP.mp]osi_Panic+2c/34>
   7:   83 c4 10                  add    $0x10,%esp
Code;  fca96c47 <[libafs-2.2.16-SMP.mp]osi_Panic+2f/34>
   a:   83 c4 0c                  add    $0xc,%esp
Code;  fca96c4a <[libafs-2.2.16-SMP.mp]osi_Panic+32/34>
   d:   c3                        ret
Code;  fca96c4b <[libafs-2.2.16-SMP.mp]osi_Panic+33/34>
   e:   90                        nop
Code;  fca96c4c <[libafs-2.2.16-SMP.mp]osi_utoa+0/ac>
   f:   55                        push   %ebp
Code;  fca96c4d <[libafs-2.2.16-SMP.mp]osi_utoa+1/ac>
  10:   57                        push   %edi
Code;  fca96c4e <[libafs-2.2.16-SMP.mp]osi_utoa+2/ac>
  11:   56                        push   %esi
Code;  fca96c4f <[libafs-2.2.16-SMP.mp]osi_utoa+3/ac>
  12:   53                        push   %ebx
Code;  fca96c50 <[libafs-2.2.16-SMP.mp]osi_utoa+4/ac>
  13:   8b 00                     mov    (%eax),%eax

-- 
Cees de Groot               http://www.cdegroot.com     <cg@cdegroot.com>
GnuPG 1024D/E0989E8B 0016 F679 F38D 5946 4ECD  1986 F303 937F E098 9E8B