[OpenAFS] client kernel panic on EL6

Stephan Wiesand stephan.wiesand@desy.de
Mon, 21 Oct 2013 16:33:13 +0200


On Oct 21, 2013, at 16:18 , Stephen Quinney <stephen@jadevine.org.uk> =
wrote:

> Has anyone else seen a kernel panic like this on EL6 with 1.6.5 and =
kernel
> 2.6.32-358.14.1.el6? Or does anyone have any suggestions as to what =
might
> have caused the problem?

We're running all our EL6 systems with this client, many if not all all =
on
-358.* kernels, at least some actually 358.14.1, and x86_64 like your =
system.
I believe we haven't seen this happen.

> afs: disk cache read error in CacheItems slot 353815 off =
28305220/36284420
> code -4/80
> openafs: assertion failed: tdc, file:
> =
/builddir/build/BUILD/openafs-1.6.5/src/libafs/MODLOAD-2.6.32-358.14.1.el6=

> .x86_64-SP/afs_dcache.c, line: 1512
> 2013-10-20T19:27------------[ cut here ]------------
> :13.739622+01:00kernel BUG at
> =
/builddir/build/BUILD/openafs-1.6.5/src/libafs/MODLOAD-2.6.32-358.14.1.el6=
.x86_64-
> SP/afs_dcache.c:1512!
> kubelik kernel:invalid opcode: 0000 [#1]  openafs: assertSMP ion =
failed:
> tdc,
> file: /builddirlast sysfs file:
> =
/sys/devices/pci0000:00/0000:00:02.0/0000:06:00.0/0000:07:00.0/0000:08:00.=
0/000
> 0:09:00.0/net/eth1/ifalias
> /build/BUILD/openafs-1.6.5/src/lCPU 1 ibafs/MODLOAD-2.
> Modules linked in:6.32-358.14.1.el =
openafs(P)6.x86_64-SP/afs_(U)dcache.c,
> line:  bridge1512
> 2013-10-20T ipmi_devintf19:27:13.811101+ pcspkr01:00 kubelik ke =
nfsrnel:
> kernel BUG lockd at /builddir/bu fscach
> eild/BUILD/openaf auth_rpcgsss-1.6.5/src/liba nfs_aclfs/MODLOAD-2.6.3
> sunrpc2-358.14.1.el6.x p4_clockmod86_64-SP
> /afs_dca freq_tableche.c:1512!
> speedstep_lib bonding 8021q garp stp llc ipv6 ses enclosure bnx2 =
microcode
> dcdbas serio_raw sg iTCO_wdt iTCO_ve
> ndor_support i5000_edac edac_core i5k_amb shpchp ext3 jbd mbcache =
sd_mod
> crc_t10dif megaraid_sas sr_mod cdrom pa
> ta_acpi ata_generic ata_piix radeon ttm drm_kms_helper drm =
i2c_algo_bit
> i2c_core dm_mirror dm_region_hash dm_log
> dm_mod [last unloaded: mperf]
> Pid: 21990, comm: du Tainted: P           ---------------
> 2.6.32-358.14.1.el6.x86_64 #1 Dell Inc. PowerEdge 1
> 950/0UR033
> RIP: 0010:[<ffffffffa058a46e>]  [<ffffffffa058a46e>]
> afs_AllocDCache+0x41e/0x4b0 [openafs]
> RSP: 0018:ffff88001382fc48  EFLAGS: 00010292
> RAX: 000000000000009a RBX: 0000000000000000 RCX: 0000000000005d54
> RDX: 0000000000000000 RSI: 0000000000000046 RDI: 0000000000000246
> RBP: ffff88001382fc88 R08: 0000000000000000 R09: 0000000000000000
> R10: 0000000000000000 R11: 0000000000002000 R12: 0000000000000000
> R13: ffff88007bb8a000 R14: 0000000000000000 R15: 00000000000681d0
> FS:  00007fade9acd700(0000) GS:ffff880002240000(0000) =
knlGS:0000000000000000
> CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> CR2: 0000000000ed3108 CR3: 00000000166d1000 CR4: 00000000000007e0
> DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> Process du (pid: 21990, threadinfo ffff88001382e000, task =
ffff88001384e040)
> Stack:
> ffff88000a2825c0 0000000000000001 ffff880027183e80 ffff88007bb8a000
> <d> 00000000001a0740 00000000000681d0 00000000000681d0 =
00000000000681d0
> <d> ffff88001382fda8 ffffffffa058eefd 000001b50b146005 =
0000000420002d55
> Call Trace:
> [<ffffffffa058eefd>] afs_GetDCache+0x1c9d/0x22e0 [openafs]
> [<ffffffffa05ae1fa>] ? afs_CopyOutAttrs+0xba/0x300 [openafs]
> [<ffffffffa05d6225>] afs_linux_readdir+0x175/0xa80 [openafs]
> [<ffffffff81196290>] ? filldir+0x0/0xe0
> [<ffffffff81196290>] ? filldir+0x0/0xe0
> [<ffffffff81196510>] vfs_readdir+0xc0/0xe0
> [<ffffffff81196699>] sys_getdents+0x89/0xf0
> [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
> Code: 58 00 00 00 00 e9 46 fe ff ff b9 e8 05 00 00 48 c7 c2 18 f5 5e =
a0 48
> c7 c6 50 be 5e a0 48 c7 c7 b0 f5 5e a0 31 c0 e8 c2 32 f8 e0 <0f> 0b eb =
fe
> b9 e9 05 00 00 48 c7 c2 18 f5 5e a0 48 c7 c6 54 be
> RIP  [<ffffffffa058a46e>] afs_AllocDCache+0x41e/0x4b0 [openafs]
> RSP <ffff88001382fc48>
> 2013-10-20T19:27:14.500338+01:00 kubelik kernel: RIP  =
[<ffffffffa058a46e>]
> afs_AllocDCache+0x41e---[ end trace b87d32f01b9f2d88 ]---
> /0x4b0 [openafs]Kernel panic - not syncing: Fatal exception
> Pid: 21990, comm: du Tainted: P      D    ---------------
> 2.6.32-358.14.1.el6.x86_64 #1
> Call Trace:
> [<ffffffff8150d668>] ? panic+0xa7/0x16f
> [<ffffffff81511894>] ? oops_end+0xe4/0x100
> [<ffffffff8100f19b>] ? die+0x5b/0x90
> [<ffffffff815110d4>] ? do_trap+0xc4/0x160
> [<ffffffff8100cdb5>] ? do_invalid_op+0x95/0xb0
> [<ffffffffa058a46e>] ? afs_AllocDCache+0x41e/0x4b0 [openafs]
> [<ffffffff8106f261>] ? vprintk+0x251/0x560
> [<ffffffffa0564d98>] ? afs_warn+0x58/0x60 [openafs]
> [<ffffffff8100be5b>] ? invalid_op+0x1b/0x20
> [<ffffffffa058a46e>] ? afs_AllocDCache+0x41e/0x4b0 [openafs]
> [<ffffffffa058a46e>] ? afs_AllocDCache+0x41e/0x4b0 [openafs]
> [<ffffffffa058eefd>] ? afs_GetDCache+0x1c9d/0x22e0 [openafs]
> [<ffffffffa05ae1fa>] ? afs_CopyOutAttrs+0xba/0x300 [openafs]
> [<ffffffffa05d6225>] ? afs_linux_readdir+0x175/0xa80 [openafs]
> [<ffffffff81196290>] ? filldir+0x0/0xe0
> [<ffffffff81196290>] ? filldir+0x0/0xe0
> [<ffffffff81196510>] ? vfs_readdir+0xc0/0xe0
> [<ffffffff81196699>] ? sys_getdents+0x89/0xf0
> [<ffffffff8100b072>] ? system_call_fastpath+0x16/0x1b
> panic occurred, switching back to text console

--=20
Stephan Wiesand
DESY - DV -
Platanenallee 6
15732 Zeuthen, Germany