[OpenAFS] AFS client crash

Ido Levy IDOL@il.ibm.com
Mon, 10 Apr 2006 18:46:09 +0300


Hello All

I am experiencing major problems with openafs version 1.2.13-rhel3 running
on system with the following details:

> uname -a
2.4.21-32.0.1.ELsmp #1 SMP Tue May 17 17:46:36 EDT 2005 x86_64 x86_64
x86_64 GNU/Linux

> cat /etc/redhat-release
Red Hat Enterprise Linux AS release 3 (Taroon Update 7)

Attached is the crash dump of the machine

| msg
+--------------------------------------------------------------------------------------------------------------------------------+
| [...network console startup...]
| audit_intercept: error 38, killing task
| [<ffffffff8013c2f0>]{__get_user_pages+624}
[<ffffffff801baf02>]{load_elf32_binary+4722}
| Unable to handle kernel paging request at virtual address
00000001977d6948
| [<ffffffff801bbe69>]{load_elf32_binary+8665}
[<ffffffff8016a2f7>]{do_coredump+631}
| [<ffffffff801310e9>]{__dequeue_signal+393}
[<ffffffff801331e1>]{get_signal_to_deliver+1089}
| [<ffffffff80110061>]{do_signal+97} [<ffffffff80243f86>]{sock_read+118}
| [<ffffffff80110b5f>]{error_signal_test+0}
| Process db3_wrapper (pid: 32130, stackpage=101303e9000)
| Stack: 00000101303e91a8 0000000000000000 ffffffffa01459d7
0000000000000000
| 00000101303e8000 0000000000000000 0000000000000000 0000000000000000
| 0000000000000020 0000000000000000 00000101303e8000 00000100dd5f88b0
| 00000100dd5f88b0 0000000000000000 00000000000f3066 00000100dd5f8860
| 00000100dd5f88a8 00000100dd5f8800 00000100dd5f8860 00000100dd5f8830
| 0000000000000004 00000100dd5f8860 ffffffffa01434b1 000001000de71bc0
| printing rip:
| 00000004a0140bef 00000101303e92f4 ffffffffa014f4e7 ffffff0000cbc048
| 0000000000000020 0000000000000002 00000101303e9548 00000100dd5f8800
| 00000100dd5f8860 00000101303e92f4 0000000000000000 000001014f662000
| ffffffffa01439f0 00000101303e9548 00000101303e9378 00000100dd5f8800
| Call Trace:
[<ffffffffa01459d7>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_cv_wait+247}
| [<ffffffffa01434b1>]{:libafs-2.4.21-32.0.1.ELsmp.mp:rxi_ReadProc+241}
|
[<ffffffffa014f4e7>]{:libafs-2.4.21-32.0.1.ELsmp.mp:rxkad_PreparePacket+167}

| [<ffffffffa01439f0>]{:libafs-2.4.21-32.0.1.ELsmp.mp:rx_ReadProc32+208}
| [<ffffffffa01497aa>]{:libafs-2.4.21-32.0.1.ELsmp.mp:xdrrx_getint32+26}
| [<ffffffffa013585c>]{:libafs-2.4.21-32.0.1.ELsmp.mp:xdr_afs_uint32+44}
|
[<ffffffffa0133bc9>]{:libafs-2.4.21-32.0.1.ELsmp.mp:xdr_AFSFetchStatus+25}
|
[<ffffffffa0136093>]{:libafs-2.4.21-32.0.1.ELsmp.mp:EndRXAFS_StoreData+51}
| ffffffffa0145ad1
| [<ffffffffa0193a20>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_global_lock+0}
|
[<ffffffffa01085d7>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_UFSCacheStoreProc+407}

| [<ffffffffa01935a0>]{:libafs-2.4.21-32.0.1.ELsmp.mp:xdrrx_ops+0}
|
[<ffffffffa019f2f8>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_stats_cmfullperf+2584}

|
[<ffffffffa0115bdf>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_StoreAllSegments+5775}

| [<ffffffff80146306>]{do_generic_file_write+662}
[<ffffffffa019ebd8>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_stats_cmfullperf+760}

| PML4 155e22067 PGD 0
| [<ffffffff8017a4b3>]{iput+99}
[<ffffffffa0151b38>]{:libafs-2.4.21-32.0.1.ELsmp.mp:osi_UFSClose+88}
| [<ffffffffa0120b4a>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_CopyOutAttrs+330}
| [<ffffffffa0154753>]{:libafs-2.4.21-32.0.1.ELsmp.mp:vcache2inode+35}
|
[<ffffffffa0158b31>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_linux_writepage_sync+705}

| [<ffffffff80155be8>]{__alloc_pages+152}
[<ffffffff801462a5>]{do_generic_file_write+565}
| Oops: 0002
| [<ffffffff801466e8>]{generic_file_write+296}
[<ffffffffa0193a20>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_global_lock+0}
| [<ffffffffa01554da>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_linux_write+1642}
| CPU 1
| Pid: 32130, comm: db3_wrapper Tainted: PF
|
| Code: f0 fe 88 08 08 00 00 0f 88 0d 02 00 00 65 48 8b 04 25 18 00
| RIP:
0010:[<ffffffffa0145ad1>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_cv_wait+497}
| RSP: 0000:00000101303e91a8  EFLAGS: 00010046
| RAX: 00000001977d6140 RBX: 0000000000000000 RCX: 00000100dd5f88b0
| RDX: 0000000000007d82 RSI: 00000101303e91d8 RDI: 00000100dd5f8860
| RBP: 00000100dd5f88a8 R08: 0000000000000000 R09: 0000000000000001
| R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
| R13: 00000101303e91d8 R14: 00000100dd5f8860 R15: 00000100dd5f8860
| FS:  00000000082e4638(0000) GS:ffffffff805e3a80(005b)
knlGS:00000000402628e0
| CS:  0010 DS: 002b ES: 002b CR0: 000000008005003b
| CR2: 00000001977d6948 CR3: 000000000ddfe000 CR4: 00000000000006e0
| CPU#0 is frozen.
| CPU#1 is executing netdump.
| < netdump activated - performing handshake with the server. >
| WARM shutting down of: CB... afs... BkG... CTrunc... AFSDB... RxEvent...
UnmaskRxkSignals... RxListener...
| Found system call table at 0xffffffff805e3280 (scan: close+chdir+write)
| Found 32-bit system call table at 0xffffffff8043d5c0 (exported)
| Starting AFS cache scan...found 0 non-empty cache files (0%).
| [<ffffffff80110061>]{do_signal+97}
[<ffffffff801398c8>]{compat_sys_futex+216}
| Process rulebase_edl (pid: 8974, stackpage=100735cd000)
| Stack: 00000100735cd1a8 0000000000000018 ffffffffa020d9d7
0000000000000000
| 00000100735cc000 0000000000000000 0000000000000000 0000000000000000
| 0000000000000020 0000000000000000 00000100735cc000 00000100df2ad8b0
| 00000100df2ad8b0 0000000000000000 0000000000055338 00000100df2ad860
| 00000100df2ad8a8 00000100df2ad800 00000100df2ad860 00000100df2ad830
| 0000000000000004 00000100df2ad860 ffffffffa020b4b1 00000101f7dc1480
| Unable to handle kernel paging request at virtual address
0000000137d42a48
| 00000004a0208bef 00000100735cd2f4 ffffffffa02174e7 ffffff0000c68048
| 0000000000000020 0000000000000002 00000100735cd548 00000100df2ad800
| 00000100df2ad860 00000100735cd2f4 0000000000000000 000001018c673000
| ffffffffa020b9f0 00000100735cd548 00000100735cd378 00000100df2ad800
| Call Trace:
[<ffffffffa020d9d7>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_cv_wait+247}
| [<ffffffffa020b4b1>]{:libafs-2.4.21-32.0.1.ELsmp.mp:rxi_ReadProc+241}
|
[<ffffffffa02174e7>]{:libafs-2.4.21-32.0.1.ELsmp.mp:rxkad_PreparePacket+167}

| [<ffffffffa020b9f0>]{:libafs-2.4.21-32.0.1.ELsmp.mp:rx_ReadProc32+208}
| [<ffffffffa02117aa>]{:libafs-2.4.21-32.0.1.ELsmp.mp:xdrrx_getint32+26}
| [<ffffffffa01fd85c>]{:libafs-2.4.21-32.0.1.ELsmp.mp:xdr_afs_uint32+44}
|
[<ffffffffa01fbbc9>]{:libafs-2.4.21-32.0.1.ELsmp.mp:xdr_AFSFetchStatus+25}
|
[<ffffffffa01fe093>]{:libafs-2.4.21-32.0.1.ELsmp.mp:EndRXAFS_StoreData+51}
| [<ffffffffa025ba20>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_global_lock+0}
|
[<ffffffffa01d05d7>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_UFSCacheStoreProc+407}

| [<ffffffffa025b5a0>]{:libafs-2.4.21-32.0.1.ELsmp.mp:xdrrx_ops+0}
|
[<ffffffffa02672f8>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_stats_cmfullperf+2584}

|
[<ffffffffa01ddbdf>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_StoreAllSegments+5775}

| [<ffffffff80146306>]{do_generic_file_write+662}
[<ffffffffa0266bd8>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_stats_cmfullperf+760}

| [<ffffffff8017a4b3>]{iput+99}
[<ffffffffa0219b38>]{:libafs-2.4.21-32.0.1.ELsmp.mp:osi_UFSClose+88}
| [<ffffffffa01e8b4a>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_CopyOutAttrs+330}
| [<ffffffffa021c753>]{:libafs-2.4.21-32.0.1.ELsmp.mp:vcache2inode+35}
|
[<ffffffffa0220b31>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_linux_writepage_sync+705}

| ffffffffa020dad1
| [<ffffffff801466e8>]{generic_file_write+296}
[<ffffffffa025ba20>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_global_lock+0}
| [<ffffffffa021d4da>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_linux_write+1642}
| PML4 7311b067 PGD 0
| Pid: 8974, comm: rulebase_edl Tainted: PF
| RIP:
0010:[<ffffffffa020dad1>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_cv_wait+497}
| RSP: 0018:00000100735cd1a8  EFLAGS: 00013046
| RAX: 0000000137d42240 RBX: 0000000000000000 RCX: 00000100df2ad8b0
| RDX: 000000000000230e RSI: 00000100735cd1d8 RDI: 00000100df2ad860
| RBP: 00000100df2ad8a8 R08: 0000000000000000 R09: 0000000000000001
| R13: 00000100735cd1d8 R14: 00000100df2ad860 R15: 00000100df2ad860
| FS:  0000002a95f7c4c0(0000) GS:ffffffff805e3a80(005b)
knlGS:0000000040262940
| CR2: 0000000137d42a48 CR3: 000000000ddfe000 CR4: 00000000000006e0
| Unable to handle kernel paging request at virtual address
00000001f708dd48
| [<ffffffff801a8f60>]{pmd_huge+0}
[<ffffffff8013c2f0>]{__get_user_pages+624}
| [<ffffffff801baf02>]{load_elf32_binary+4722}
[<ffffffff801bbe69>]{load_elf32_binary+8665}
| [<ffffffffa020f8ba>]{:libafs-2.4.21-32.0.1.ELsmp.mp:rxi_FreePacket+42}
| [<ffffffffa0207f77>]{:libafs-2.4.21-32.0.1.ELsmp.mp:rxi_SendAck+951}
|
[<ffffffffa0207056>]{:libafs-2.4.21-32.0.1.ELsmp.mp:rxi_SendDelayedAck+70}
| [<ffffffffa02040c9>]{:libafs-2.4.21-32.0.1.ELsmp.mp:rx_EndCall+409}
| [<ffffffffa01fe3f2>]{:libafs-2.4.21-32.0.1.ELsmp.mp:RXAFS_StoreStatus+98}
| [<ffffffffa01cc127>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_GetCellStale+55}
| [<ffffffffa01c56ac>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_Analyze+172}
|
[<ffffffffa0267060>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_stats_cmfullperf+1920}

| [<ffffffffa01e4eb8>]{:libafs-2.4.21-32.0.1.ELsmp.mp:afs_WriteVCache+1144}
| [<ffffffff8017b2f0>]{notify_change+640}
[<ffffffff8016a2f7>]{do_coredump+631}
| [<ffffffff80110061>]{do_signal+97} [<ffffffff8011009e>]{do_signal+158}
| [<ffffffff801398c8>]{compat_sys_futex+216}
[<ffffffff80110b5f>]{error_signal_test+0}
| Process rulebase_edl (pid: 5717, stackpage=101f29e7000)
| Stack: 00000101f29e71a8 0000000000000000 ffffffffa020d9d7
0000000000000000
| 00000101f29e6000 0000000000000000 0000000000000000 0000000000000000
| 0000000000000013 0000000000000000 00000101f29e6000 00000101fb8e62b0
| 00000101fb8e62b0 0000000000000000 00000000000b76f4 00000101fb8e6260
| 00000101fb8e62a8 00000101fb8e6200 00000101fb8e6260 00000101fb8e6230
| 0000000000000004 00000101fb8e6260 ffffffffa020b4b1 00000101fefce180
| 00000004a0208bef 00000101f29e72f4 ffffffffa02174e7 ffffff0000c9f048
| 0000000000000013 0000000000000002 00000101f29e7548 00000101fb8e6200
| 00000101fb8e6260 00000101f29e72f4 0000000000000000 0000010198ce1000
| ffffffffa020b9f0 00000101f29e7548 00000101f29e7378 00000101fb8e6200
| PML4 d2b70067 PGD 0
| Pid: 5717, comm: rulebase_edl Tainted: PF
| RSP: 0000:00000101f29e71a8  EFLAGS: 00010046
| RAX: 00000001f708d540 RBX: 0000000000000000 RCX: 00000101fb8e62b0
| RDX: 0000000000001655 RSI: 00000101f29e71d8 RDI: 00000101fb8e6260
| RBP: 00000101fb8e62a8 R08: 0000000000000000 R09: 0000000000000001
| R13: 00000101f29e71d8 R14: 00000101fb8e6260 R15: 00000101fb8e6260
| FS:  0000000008218490(0000) GS:ffffffff805e3a80(005b)
knlGS:0000000040664bb0
| CR2: 00000001f708dd48 CR3: 000000000ddfe000 CR4: 00000000000006e0
+--------------------------------------------------------------------------------------------------------------------------------

I would appreciate your advice in this issue.

Best Regards,

Ido Levy