[OpenAFS] -stat crash (openafs 1.1.1, kernel 2.4.3)

Jan Hrabe hrabe@balrog.aecom.yu.edu
Mon, 24 Sep 2001 10:28:06 -0400


> Message: 1
> Date: Fri, 21 Sep 2001 18:23:42 -0400 (EDT)
> From: <Warren.Yenson@morganstanley.com>
> To: <openafs-info@openafs.org>
> Cc: <dboyes@sinenomine.net>, <kvanhees@sinenomine.net>,
> 	Eric Hagberg <Eric.Hagberg@morganstanley.com>
> Subject: [OpenAFS] afsd causes crash for openafs1.2 and kernel 2.4.7 (fwd)
>
> We have seen that we can repeatedly crash a Linux box running 2.4.7 and
> OpenAFS 1.2 by doing operations in /afs that open a large number of files
> or directories (e.g. du -sk).
>
> The message on the screen is:
>
>   Sep 20 12:58:07 saloon1 kernel: Increase -stat parameter of afsd(VLRU
> cycle?)<1> Unable to handle kernel paging request at virtual address
> fffffff
>

We just saw the same crash for the first time on our server couple days ago.
The machine has a single processor, running RedHat 7.1 with an updated 
kernel (RH) 2.4.3-12 and openafs 1.1.1 compiled here using gcc 2.96.
The cacheinfo is /afs:/usr/vice/cache:900000 and the cache is on a separate
partition ~1.1GB. LARGE option is used in /etc/sysconfig/afs. We use the same
setup with Transarc afs 3.6 on RH 6.2 (clients only but heavily used for 
processing of large numbers of MRI images) and never saw any problem.
Here is the message log from the crashed machine, hopefully someone
can make some sense of it. 

Thanks a lot.
Honza

Sep 21 20:00:33 claymore kernel: Increase -stat parameter of afsd(VLRU 
cycle?)<1>Unable to handle kernel paging request at virtual address ffffffff
Sep 21 20:00:33 claymore kernel:  printing eip:
Sep 21 20:00:33 claymore kernel: c89103ac
Sep 21 20:00:33 claymore kernel: pgd entry c06ddffc: 0000000000002063
Sep 21 20:00:33 claymore kernel: pmd entry c06ddffc: 0000000000002063
Sep 21 20:00:33 claymore kernel: pte entry c0002ffc: 0000000000000000
Sep 21 20:00:33 claymore kernel: ... pte not present!
Sep 21 20:00:33 claymore kernel: Oops: 0002
Sep 21 20:00:33 claymore kernel: CPU:    0
Sep 21 20:00:33 claymore kernel: EIP:    
0010:[usbcore:__insmod_usbcore_S.bss_L96+685868/197370836]
Sep 21 20:00:33 claymore kernel: EIP:    0010:[<c89103ac>]
Sep 21 20:00:33 claymore kernel: EFLAGS: 00010296
Sep 21 20:00:33 claymore kernel: eax: 0000002d   ebx: c5aa0400   ecx: 
00000009   edx: 00000000
Sep 21 20:00:33 claymore kernel: esi: c8934e84   edi: 00000000   ebp: 
c5aa0510   esp: c52d9d14
Sep 21 20:00:33 claymore kernel: ds: 0018   es: 0018   ss: 0018
Sep 21 20:00:33 claymore kernel: Process du (pid: 23059, stackpage=c52d9000)
Sep 21 20:00:33 claymore kernel: Stack: c88f15d5 c8927420 00000000 00000010 
00000246 00000001 000038ab 00000002
Sep 21 20:00:33 claymore kernel:        00000000 000015e0 0000000f c89a3110 
c89a3110 00000005 000015e2 c52d9e78
Sep 21 20:00:33 claymore kernel:        c52d9e78 c52d9e58 00000000 c52d9e78 
c88f2e3e c52d9e78 00000000 00000001
Sep 21 20:00:33 claymore kernel: Call Trace: 
[usbcore:__insmod_usbcore_S.bss_L96+559445/197497259] 
[usbcore:__insmod_usbcore_S.bss_L96+780192/197276512] 
[usbcore:__insmod_usbcore_S.bss_L96+1287312/196769392] 
[usbcore:__insmod_usbcore_S.bss_L96+1287312/196769392] 
[usbcore:__insmod_usbcore_S.bss_L96+565694/197491010]
Sep 21 20:00:33 claymore kernel: Call Trace: [<c88f15d5>] [<c8927420>] 
[<c89a3110>] [<c89a3110>] [<c88f2e3e>]
Sep 21 20:00:33 claymore kernel:    
[usbcore:__insmod_usbcore_S.bss_L96+523836/197532868] 
[usbcore:__insmod_usbcore_S.bss_L96+524138/197532566] 
[usbcore:__insmod_usbcore_S.bss_L96+523307/197533397] 
[usbcore:__insmod_usbcore_S.bss_L96+5039168/193017536] 
[usbcore:__insmod_usbcore_S.bss_L96+602822/197453882] 
[usbcore:__insmod_usbcore_S.bss_L96+724590/197332114]
Sep 21 20:00:33 claymore kernel:    [<c88e8abc>] [<c88e8bea>] [<c88e88ab>] 
[<c8d370c0>] [<c88fbf46>] [<c8919aee>]
Sep 21 20:00:33 claymore kernel:    
[usbcore:__insmod_usbcore_S.bss_L96+576362/197480342] 
[usbcore:__insmod_usbcore_S.bss_L96+728637/197328067] 
[usbcore:__insmod_usbcore_S.bss_L96+725626/197331078] 
[usbcore:__insmod_usbcore_S.bss_L96+724590/197332114] [d_alloc+22/336] 
[real_lookup+79/192]
Sep 21 20:00:33 claymore kernel:    [<c88f57ea>] [<c891aabd>] [<c8919efa>] 
[<c8919aee>] [<c0144026>] [<c013c11f>]
Sep 21 20:00:33 claymore kernel:    [path_walk+1382/1952] [__user_walk+58/96] 
[sys_lstat64+19/112] [system_call+51/56]
Sep 21 20:00:33 claymore kernel:    [<c013c7e6>] [<c013cd7a>] [<c0139df3>] 
[<c0106d2b>]
Sep 21 20:00:33 claymore kernel:
Sep 21 20:00:33 claymore kernel: Code: c6 05 ff ff ff ff 2a c3 55 57 56 53 56 
8b 7c 24 1c 83 ff 01

**********************************