[OpenAFS] [ openafs-devel ] kernel panic

Stephan Wiesand stephan.wiesand@desy.de
Wed, 27 Nov 2013 17:08:17 +0100


Hello Nicolas,

thanks again for the reports, but more information will be needed.

- What platform? (distribution, architecture)
- afsd parameters?
- How to reproduce?
- How reproducible is this? (=3D how confident are you that =
3.11.7+1.6.5.1 is ok?)
- Any chance you could try 3.11.7 with 1.6.6pre1 and/or 3.11.8 with =
1.6.5.1?
- Can you reproduce the problem with an ext4 cache?

Regards,
	Stephan

On 2013-11-18, at 16:34, nicolas prochazka <prochazka.nicolas@gmail.com> =
wrote:

> I've never seen this problem with previous version, but i do some
> update in same time
>=20
> kernel 3.11.7 + openafs 1.6.5.1   : no problem
> kernel 3.11.8 + openafs 1.6.6pre : problem
>=20
>=20
> I've test with  tmpfs , --memory , and zfs cache.   ( same issue with
> tmpfs and zfs  , with --memory, sometime the copy of file is stopped
> and cache does not grow anymore )  for cache system.  backing
> filesystem ( vicea ) is zfs.
>=20
> this problem appears to random way,
> it seems more frequents when i acces to afs mount , very soon after
> afsd is starting .
>=20
> Regards,
>=20
>=20
>=20
>=20
>=20
> 2013/11/18 Stephan Wiesand <stephan.wiesand@desy.de>:
>> Hello Nicolas,
>>=20
>> thanks for testing 1.6.6pre1 before it's even announced!
>>=20
>> Could you give us more details about the client platform, cache =
configuration and backing filesystem, and how you reproduce this crash?
>>=20
>> Are you sure you can't provoke this problem on 1.6.5.1, 1.6.5 or =
1.6.3?
>>=20
>> Regards,
>>        Stephan
>>=20
>> On 2013-11-18, at 14:33, nicolas prochazka =
<prochazka.nicolas@gmail.com> wrote:
>>=20
>>> hello again, and sorry for the spam.
>>> After 30m of copy, the bug persist with 1.6.6pre1 module / afsd
>>>=20
>>> Regards,
>>> Nicolas Prochazka
>>>=20
>>> 2013/11/18 nicolas prochazka <prochazka.nicolas@gmail.com>:
>>>> Hello again,
>>>> some tests after,
>>>> it seems to be a bad configuration in my side
>>>> : use of openafs 1.6.6pre1 kernel module with openafs 1.6.5.1 afsd =
cache daemons
>>>>=20
>>>> Regards,
>>>> Nicolas Prochazka
>>>>=20
>>>> 2013/11/18 nicolas prochazka <prochazka.nicolas@gmail.com>:
>>>>> hello,
>>>>>=20
>>>>> setup  :
>>>>> openafs : OpenAFS 1.6.6pre1
>>>>> kernel 3.11.8
>>>>>=20
>>>>> when i start afsd ,
>>>>> [  231.083287] openafs: Can't get dentry
>>>>> [  231.083323] ------------[ cut here ]------------
>>>>> [  231.083326] kernel BUG at
>>>>> /tmp/openafs-d294e9c/src/libafs/MODLOAD-3.11.8-MP/osi_file.c:53!
>>>>> [  231.083328] invalid opcode: 0000 [#1] SMP
>>>>> [  231.083337] Modules linked in: libafs(PO) zfs(PO) zunicode(PO)
>>>>> zavl(PO) zcommon(PO) znvpair(PO) spl(O) [last unloaded: libafs]
>>>>> [  231.083347] CPU: 0 PID: 16582 Comm: ndvClusterManag Tainted: P
>>>>>    O 3.11.8 #1
>>>>> [  231.083348] Hardware name: innotek GmbH VirtualBox/VirtualBox, =
BIOS
>>>>> VirtualBox 12/01/2006
>>>>> [  231.083350] task: ffff8800a5267000 ti: ffff8800a1b20000 =
task.ti:
>>>>> ffff8800a1b20000
>>>>> [  231.083351] RIP: 0010:[<ffffffffa02c3db6>]  =
[<ffffffffa02c3db6>]
>>>>> afs_linux_raw_open+0x96/0xf0 [libafs]
>>>>> [  231.083372] RSP: 0018:ffff8800a1b21788  EFLAGS: 00010296
>>>>> [  231.083373] RAX: 0000000000000019 RBX: 0000000000000000 RCX: =
ffffffff8277c488
>>>>> [  231.083374] RDX: 000000000000001f RSI: 0000000000000082 RDI: =
0000000000000246
>>>>> [  231.083375] RBP: ffff8800a1b217a8 R08: 0000000000000400 R09: =
ffffffff8277c488
>>>>> [  231.083376] R10: 0000000000000266 R11: 0000000000000265 R12: =
ffff8800a945a000
>>>>> [  231.083377] R13: ffff8800a5267000 R14: 0000000000000000 R15: =
000000003b9ac9ff
>>>>> [  231.083387] FS:  00007f3b75f51700(0000) =
GS:ffff88011fc00000(0000)
>>>>> knlGS:0000000000000000
>>>>> [  231.083389] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>>>>> [  231.083390] CR2: 0000000000458dc0 CR3: 00000000a1af8000 CR4: =
00000000000006f0
>>>>> [  231.083395] Stack:
>>>>> [  231.083396]  000000d0a5267000 ffff88011fff0000 ffffc9000f5d6cc8
>>>>> ffff8800a945a000
>>>>> [  231.083399]  ffff8800a1b217d8 ffffffffa02c3ec4 ffff8800b8ed6000
>>>>> ffff8800b8ed6000
>>>>> [  231.083401]  ffffc9000f5d6c00 0000000000000000 ffff8800a1b21908
>>>>> ffffffffa027f5f1
>>>>> [  231.083403] Call Trace:
>>>>> [  231.083412]  [<ffffffffa02c3ec4>] osi_UFSOpen+0xb4/0x190 =
[libafs]
>>>>> [  231.083422]  [<ffffffffa027f5f1>] afs_GetDCache+0x901/0x2380 =
[libafs]
>>>>> [  231.083439]  [<ffffffff8116159b>] ? =
__mem_cgroup_commit_charge+0xab/0x310
>>>>> [  231.083449]  [<ffffffffa029f433>] ? afs_AccessOK+0x113/0x1e0 =
[libafs]
>>>>> [  231.083457]  [<ffffffffa02a975d>] afs_lookup+0x38d/0x1c30 =
[libafs]
>>>>> [  231.083465]  [<ffffffffa029d3e4>] ? afs_FindVCache+0x354/0x680 =
[libafs]
>>>>> [  231.083474]  [<ffffffffa029e18e>] ? afs_GetVCache+0x7e/0x5d0 =
[libafs]
>>>>> [  231.083482]  [<ffffffffa02a6edc>] ? =
afs_EvalFakeStat_int+0x32c/0x4e0 [libafs]
>>>>> [  231.083491]  [<ffffffff81f6890e>] ? _raw_spin_lock+0xe/0x20
>>>>> [  231.083498]  [<ffffffffa02cc27b>]
>>>>> afs_linux_dentry_revalidate+0x18b/0x450 [libafs]
>>>>> [  231.083506]  [<ffffffffa029f433>] ? afs_AccessOK+0x113/0x1e0 =
[libafs]
>>>>> [  231.083514]  [<ffffffffa029bf89>] ? afs_PutVCache+0x79/0x140 =
[libafs]
>>>>> [  231.083522]  [<ffffffffa029f67b>] ? afs_access+0x17b/0x7d0 =
[libafs]
>>>>> [  231.083526]  [<ffffffff81010101>] ? =
compat_arch_ptrace+0x191/0x220
>>>>> [  231.083529]  [<ffffffff81177325>] lookup_fast+0x245/0x2f0
>>>>> [  231.083532]  [<ffffffff81177759>] ? =
__inode_permission+0x69/0xc0
>>>>> [  231.083534]  [<ffffffff81177c12>] link_path_walk+0x1a2/0x8e0
>>>>> [  231.083536]  [<ffffffff81178ba8>] path_lookupat+0x58/0x7b0
>>>>> [  231.083540]  [<ffffffff811553f7>] ? kmem_cache_alloc+0x67/0x1c0
>>>>> [  231.083543]  [<ffffffff81179333>] =
filename_lookup.isra.48+0x33/0x80
>>>>> [  231.083545]  [<ffffffff8117b889>] user_path_at_empty+0x59/0xa0
>>>>> [  231.083548]  [<ffffffff8103f5f8>] ? __do_page_fault+0x1e8/0x4c0
>>>>> [  231.083550]  [<ffffffff8117b8e1>] user_path_at+0x11/0x20
>>>>> [  231.083553]  [<ffffffff811712a1>] vfs_fstatat+0x51/0xb0
>>>>> [  231.083555]  [<ffffffff8117133b>] vfs_stat+0x1b/0x20
>>>>> [  231.083558]  [<ffffffff8117148a>] SyS_newstat+0x1a/0x40
>>>>> [  231.083560]  [<ffffffff8103f90e>] ? do_page_fault+0xe/0x10
>>>>> [  231.083563]  [<ffffffff8106d318>] ? =
SyS_rt_sigprocmask+0x88/0xb0
>>>>> [  231.083566]  [<ffffffff81f699d9>] =
system_call_fastpath+0x16/0x1b
>>>>> [  231.083567] Code: 00 f0 ff ff 49 89 c4 77 24 48 89 df e8 74 e6 =
eb
>>>>> e0 48 83 c4 10 4c 89 e0 5b 41 5c 5d c3 48 c7 c7 04 35 2e a0 31 c0 =
e8
>>>>> cc 57 c9 e1 <0f> 0b 65 48 8b 04 25 80 c7 00 00 48 8b 90 90 03 00 =
00 be
>>>>> 02 00
>>>>> [  231.083588] RIP  [<ffffffffa02c3db6>] =
afs_linux_raw_open+0x96/0xf0 [libafs]
>>>>> [  231.083595]  RSP <ffff8800a1b21788>
>>>>> [  231.083598] ---[ end trace 3a0f610d92191038 ]---
>>>>>=20
>>>>>=20
>>>>> with kernel 3.11.7 i do not have this issues,
>>>>> but I must re test this case.

--=20
Stephan Wiesand
DESY - DV -
Platanenallee 6
15738 Zeuthen, Germany