[OpenAFS] another kernel panic running 1.8.4 on Linux 3.10.0-1062.7.1.el7.x86_64

Chris Cooke cc@inf.ed.ac.uk
Thu, 13 Feb 2020 14:04:18 +0000


Hi,

We have just suffered a kernel panic which may remind you of the one report=
ed in January by my colleague Neil Brown - however, his problem was on a ma=
chine running openafs server 1.8.4, whereas mine uses openafs client 1.8.4.
Can anyone help? The machine upgraded to kernel 3.10.0-1062.7.1.el7.x86_64,=
 then just under 20 hours later it suffered a kernel panic, as follows:

2020-02-13T10:26[69724.289924] ------------[ cut here ]------------
:30.989824+00:00[69724.297908] kernel BUG at /builddir/build/BUILD/openafs-=
1.8.4/src/libafs/MODLOAD-3.10.0-1062.7.1.el7.x86_64-SP/afs_segments.c:556!
 archlute kernel: openafs: afs_I[69724.312303] invalid opcode: 0000 [#1] SMP
nvalidateAllSegm[69724.317921] Modules linked in: openafs(POE)ents tdc count
 btrfs raid6_pq xor msdos xfs libcrc32c nfsv4 dns_resolver nfs fscache nfsd=
 nfs_acl lockd grace fuse bonding bnep bluetooth rfkill drbg ansi_cprng dm_=
crypt dm_mod vfat fat skx_edac intel_powerclamp coretemp intel_rapl iosf_mb=
i iTCO_wdt crc32_pclmul iTCO_vendor_support dell_smbios ghash_clmulni_intel=
 dell_wmi_descriptor dcdbas aesni_intel lrw gf128mul glue_helper ablk_helpe=
r cryptd i2c_i801 lpc_ich sg pcspkr mei_me wmi mei pcc_cpufreq acpi_pad acp=
i_power_meter auth_rpcgss sunrpc ip_tables ext4 mbcache jbd2 sd_mod crc_t10=
dif crct10dif_generic mgag200 i2c_algo_bit drm_kms_helper syscopyarea sysfi=
llrect sysimgblt fb_sys_fops ttm crct10dif_pclmul ahci crct10dif_common lib=
ahci drm tg3 crc32c_intel ptp libata megaraid_sas ipmi_si drm_panel_orienta=
tion_quirks pps_core nfit ipmi_devintf ipmi_msghandler libnvdimm
[69724.405687] CPU: 10 PID: 108817 Comm: BgFileSaver Tainted: P           O=
E  ------------   3.10.0-1062.7.1.el7.x86_64 #1
[69724.417750] Hardware name: Dell Inc. PowerEdge R440/08CYF7, BIOS 2.4.8 1=
1/27/2019
[69724.426511] task: ffff8b85f6ae1070 ti: ffff8b87b4688000 task.ti: ffff8b8=
7b4688000
[69724.435254] RIP: 0010:[<ffffffffc0c1ccff>]  [<ffffffffc0c1ccff>] afs_Inv=
alidateAllSegments+0x4bf/0x4d0 [openafs]
[69724.446719] RSP: 0018:ffff8b87b468bd90  EFLAGS: 00010246
[69724.453292] RAX: 000000000000002c RBX: 000000000000069d RCX: 00000000000=
02619
[69724.461663] RDX: 0000000000000000 RSI: 0000000000000246 RDI: 00000000000=
00246
[69724.470025] RBP: ffff8b87b468bdd0 R08: 0000000000000000 R09: 00000000fff=
fffff
[69724.478376] R10: 0000000000002619 R11: 0000000000000001 R12: 00000000000=
00504
[69724.486691] R13: ffff8b85f6ae1070 R14: 0000000000071866 R15: ffff8ba1faa=
c2a80
[69724.494981] FS:  00002b0843a40700(0000) GS:ffff8b917e540000(0000) knlGS:=
0000000000000000
[69724.504250] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[69724.511323] CR2: 00002b080ee689e0 CR3: 0000003920fe4000 CR4: 00000000007=
607e0
[69724.519628] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 00000000000=
00000
[69724.527889] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 00000000000=
00400
[69724.536216] PKRU: 55555554
[69724.540020] Call Trace:
[69724.543553]  [<ffffffffc0c1d4ba>] afs_StoreAllSegments+0x7aa/0xc40 [open=
afs]
[69724.551830]  [<ffffffffc0c72987>] afs_linux_flush+0x4b7/0x560 [openafs]
[69724.559619]  [<ffffffffa5047dc7>] filp_close+0x37/0x90
[69724.565911]  [<ffffffffa506b58c>] __close_fd+0x8c/0xb0
[69724.572193]  [<ffffffffa50498e3>] SyS_close+0x23/0x50
[69724.578349]  [<ffffffffa558dede>] system_call_fastpath+0x25/0x2a
[69724.585435] Code: ff ff 48 c7 45 c8 00 00 00 00 31 db e9 c6 fc ff ff 48 =
c7 c7 80 f8 c8 c0 e8 ad 7d 95 e4 0f 0b 48 c7 c7 50 f8 c8 c0 e8 9f 7d 95 e4 =
<0f> 0b 0f 1f 44 00 00 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00
[69724.608153] RIP  [<ffffffffc0c1ccff>] afs_InvalidateAllSegments+0x4bf/0x=
4d0 [openafs]
[69724.617080]  RSP <ffff8b87b468bd90>
[69724.624593] ---[ end trace abbf328c8e36046b ]---
2020-02-13T10:26:31.324934+00:00 archlute kernel: kernel BUG at /builddir/b=
uild/BUILD/openafs-1.8.4/src/libafs/MODLOAD-3.10.0-1062.7.1.el7.x86_64-SP/a=
fs_segments.c:556!
[69724.697823] Kernel panic - not syncing: Fatal exception

Chris Cooke

School of Informatics, University of Edinburgh.






--=20
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.