[OpenAFS-devel] Re: linux 2.6.x (FC2) SMP oops: it's not CONFIG_REGPARM
Matthew Miller
mattdm@mattdm.org
Thu, 24 Jun 2004 15:21:19 -0400
On Thu, Jun 24, 2004 at 02:52:35PM -0400, Derrick J Brashear wrote:
> Nono, widereply, like, why are you CCing me on this message? I read
> openafs-devel.
Note -- actual openafs content below this next paragraph of digression. :)
Oh, sorry. The default for the list is to reply to sender, and I must have
group reply to send to the list. I should properly set up openafs-devel as a
list in mutt and use the special list-reply key, but that's kind of annoying
to have to do (and easy to forget). The list could be switched over to have
reply-to-list as the default (which I think is better for this exact
reason), or alternately, Mailman has a feature called "avoid duplicate
copies of messages", which intelligently doesn't send you the list message
if you're on the CC or To lines already. This doesn't seem to be an option
on the openafs Mailman page, though -- either it's been made not available
as user-selectable, or the version of mailman used is really old.
Hey, wait. You weren't CC'd on that message anyway. It went to Chas, with a
CC to Jeffrey and OpenAFS-devel. Point taken that it'd be nicer for them to
have it just go once, though.
> > > Is that a fedora kernel?
> > Yes, except with the single change to the config file of turning
> > CONFIG_REGPARM off.
> It worked before you did that?
No. It hasn't worked on SMP at all ever, as I've been posting to the list.
However, Chas Williams and Jeffrey Hutzelman suggested that CONFIG_REGPARM
might be related to the problem, and suggested trying a kernel with it off.
Without CONFIG_REGPARM (so, stock FC2 2.6.6-1.435smp), it has similar
behavior (much oopsing), but different oopses:
Unable to handle kernel NULL pointer dereference at virtual address 00000001
printing eip:
00000001
*pde = 00003001
Oops: 0000 [#1]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 1
EIP: 0060:[<00000001>] Tainted: P
EFLAGS: 00010097 (2.6.6-1.435smp)
EIP is at 0x1
eax: 34e63e3c ebx: 34e63e3c ecx: 00000000 edx: 00000003
esi: 00000001 edi: 34e63e48 ebp: 35ef4fc0 esp: 35ef4fa0
ds: 007b es: 007b ss: 0068
Process tart (pid: 10463, threadinfo=35ef4000 task=40ba3890)
Stack: 0211d66f 00000000 41f84c00 34e63e00 00000003 34e63e44 00000216 00000000
35ef4fd8 0211d744 00000000 00000000 35ef4000 34e63dfc 00000000 43b20845
40ba3af2 43b2df56 43b205a7 00000000 021051f1 34e63dfc 00000000 00000000
Call Trace:
[<0211d66f>] __wake_up_common+0x36/0x5b
[<0211d744>] complete+0x2c/0x3c
[<43b20845>] afsd_thread+0x29e/0x2ff [libafs]
[<43b205a7>] afsd_thread+0x0/0x2ff [libafs]
[<021051f1>] kernel_thread_helper+0x5/0xb
Code: Bad EIP value.
<1>general protection fault: 0000 [#2]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 1
EIP: 0060:[<02121f99>] Tainted: P
EFLAGS: 00010246 (2.6.6-1.435smp)
EIP is at put_files_struct+0x6/0xaa
eax: ffffffff ebx: ffffffff ecx: 0000000b edx: ffffffff
esi: 022bc47c edi: 40ba3890 ebp: 0000000b esp: 35ef4e60
ds: 007b es: 007b ss: 0068
Process tart (pid: 10463, threadinfo=35ef4000 task=40ba3890)
Stack: 00000000 022bc47c 40ba3890 0000000b 02122b29 35ef4000 022bc47c 00000000
00000000 02107727 0000000b 34e63e3c 00003000 02003000 02119f73 022bc46b
00003001 022c03f2 00000001 022bc458 022bc43d 00000001 00000000 00000000
Call Trace:
[<02122b29>] do_exit+0x211/0x390
[<02107727>] do_divide_error+0x0/0xaa
[<02119f73>] do_page_fault+0x33f/0x4f6
[<0211b6d7>] recalc_task_prio+0x128/0x133
[<0211b768>] activate_task+0x86/0x93
[<02119c34>] do_page_fault+0x0/0x4f6
[<0211d66f>] __wake_up_common+0x36/0x5b
[<0211d744>] complete+0x2c/0x3c
[<43b20845>] afsd_thread+0x29e/0x2ff [libafs]
[<43b205a7>] afsd_thread+0x0/0x2ff [libafs]
[<021051f1>] kernel_thread_helper+0x5/0xb
Code: f0 ff 08 0f 94 c0 84 c0 0f 84 91 00 00 00 31 ed 89 e9 c1 e1
<1>general protection fault: 0000 [#3]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 1
EIP: 0060:[<02122bfc>] Tainted: P
EFLAGS: 00010246 (2.6.6-1.435smp)
EIP is at do_exit+0x2e4/0x390
eax: 40ba3800 ebx: 022f2000 ecx: 0000000b edx: ffffffff
esi: 022b568f edi: 40ba3890 ebp: 0000000b esp: 35ef4dfc
ds: 007b es: 007b ss: 0068
Process tart (pid: 10463, threadinfo=35ef4000 task=40ba3890)
Stack: 35ef4000 022b568f 00000000 0000000b 02107727 0000000b ffffffff 00000000
02107ba0 fffebea8 35ef4e2c 00000000 ffffffff 0000000b ffffffff 022bc47c
40ba3890 0000000b ffffffff 022b007b 0000007b ffffffff 02121f99 00000060
Call Trace:
[<02107727>] <1>Unable to handle kernel NULL pointer dereference at virtual address 00000246
printing eip:
022a59ed
*pde = 00003001
do_divide_error+0x0/0xaa
[<02107ba0>] do_general_protection+0x0/0x23b
[<02121f99>] put_files_struct+0x6/0xaa
[<02122b29>] do_exit+0x211/0x390
[<02107727>] do_divide_error+0x0/0xaa
[<02119f73>] do_page_fault+0x33f/0x4f6
[<0211b6d7>] recalc_task_prio+0x128/0x133
[<0211b768>] activate_task+0x86/0x93
[<02119c34>] do_page_fault+0x0/0x4f6
[<0211d66f>] __wake_up_common+0x36/0x5b
[<0211d744>] complete+0x2c/0x3c
[<43b20845>] afsd_thread+0x29e/0x2ff [libafs]
[<43b205a7>] afsd_thread+0x0/0x2ff [libafs]
[<021051f1>] kernel_thread_helper+0x5/0xb
Code: f0 ff 0a 0f 94 c0 84 c0 74 07 89 d0 e8 80 7b 04 00 e8 58 26
<1>Oops: 0002 [#4]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 0
EIP: 0060:[<022a59ed>] Tainted: P
EFLAGS: 00010002 (2.6.6-1.435smp)
EIP is at wait_for_completion+0x6d/0xc5
eax: 2f225e4c ebx: 2f225e44 ecx: 2f225dd4 edx: 00000246
esi: 2f225dbc edi: 2f225ddc ebp: 2f225df4 esp: 2f225da8
ds: 007b es: 007b ss: 0068
Process afsd (pid: 10464, threadinfo=2f225000 task=415351d0)
Stack: 00000000 415351d0 0211d62d 00000000 00000000 2f225dd4 0211d6b1 00000000
00000001 415351d0 0211d62d 2f225e4c 00000000 00000246 03966000 00000001
2f225e44 00000000 fef01cf8 ffffffe7 43b20989 00000000 2f225e44 00000001
Call Trace:
[<0211d62d>] default_wake_function+0x0/0xc
[<0211d6b1>] __wake_up+0x1d/0x2c
[<0211d62d>] default_wake_function+0x0/0xc
[<43b20989>] afs_DaemonOp+0xc2/0xc9 [libafs]
[<43b208a6>] afsd_launcher+0x0/0x21 [libafs]
[<43b209fa>] afs_syscall_call+0x6a/0x7d9 [libafs]
[<0214fa0e>] rw_vm+0x242/0x26b
[<43b21239>] afs_syscall+0xaa/0x1a6 [libafs]
[<43b1d858>] afs_ioctl+0x41/0x4d [libafs]
[<021513ba>] filp_open+0x36/0x3c
[<02160d37>] file_ioctl+0x167/0x17b
[<02160f34>] sys_ioctl+0x1e9/0x243
Code: 89 0a 89 55 e4 b8 00 f0 ff ff 21 e0 8b 00 c7 00 02 00 00 00
<1>Unable to handle kernel NULL pointer dereference at virtual address 00000001
printing eip:
00000001
*pde = 00003001
Oops: 0000 [#5]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 0
EIP: 0060:[<00000001>] Tainted: P
EFLAGS: 00010097 (2.6.6-1.435smp)
EIP is at 0x1
eax: 2f225e3c ebx: 2f225e3c ecx: 00000000 edx: 00000003
esi: 00000001 edi: 2f225e48 ebp: 3481afc0 esp: 3481afa0
ds: 007b es: 007b ss: 0068
Process art (pid: 10466, threadinfo=3481a000 task=40db8750)
Stack: 0211d66f 00000000 00100011 2f225e46 00000003 2f225e44 00000216 00000000
3481afd8 0211d744 00000000 00000000 00000000 2f225dfc 00000000 43b205ee
40db89b2 43b2decf 43b205a7 00000000 021051f1 2f225dfc 00000000 <1>Unable to handle kernel NULL pointer dereference at virtual address 00000246
printing eip:
022a59ed
*pde = 00003001
00000000
Call Trace:
[<0211d66f>] __wake_up_common+0x36/0x5b
[<0211d744>] complete+0x2c/0x3c
[<43b205ee>] afsd_thread+0x47/0x2ff [libafs]
[<43b205a7>] afsd_thread+0x0/0x2ff [libafs]
[<021051f1>] kernel_thread_helper+0x5/0xb
Code: Bad EIP value.
<1>Oops: 0002 [#6]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 1
EIP: 0060:[<022a59ed>] Tainted: P
EFLAGS: 00010002 (2.6.6-1.435smp)
EIP is at wait_for_completion+0x6d/0xc5
eax: 35ef4e4c ebx: 35ef4e44 ecx: 35ef4dd4 edx: 00000246
esi: 35ef4dbc edi: 35ef4ddc ebp: 35ef4df4 esp: 35ef4da8
ds: 007b es: 007b ss: 0068
Process afsd (pid: 10465, threadinfo=35ef4000 task=40ba3890)
Stack: 00000000 40ba3890 0211d62d 00000000 00000000 35ef4dd4 0211d6b1 00000000
00000001 40ba3890 0211d62d 35ef4e4c 00000000 00000202 03966000 00000001
35ef4e44 00000013 fffffff6 ffffffe7 43b20989 00000013 35ef4e44 00000001
Call Trace:
[<0211d62d>] default_wake_function+0x0/0xc
[<0211d6b1>] __wake_up+0x1d/0x2c
[<0211d62d>] default_wake_function+0x0/0xc
[<43b20989>] afs_DaemonOp+0xc2/0xc9 [libafs]
[<43b208a6>] afsd_launcher+0x0/0x21 [libafs]
[<43b209fa>] afs_syscall_call+0x6a/0x7d9 [libafs]
[<0214fa0e>] rw_vm+0x242/0x26b
[<43b21239>] afs_syscall+0xaa/0x1a6 [libafs]
[<43b1d858>] afs_ioctl+0x41/0x4d [libafs]
[<021513ba>] filp_open+0x36/0x3c
[<02160d37>] file_ioctl+0x167/0x17b
[<02160f34>] sys_ioctl+0x1e9/0x243
Code: 89 0a 89 55 e4 b8 00 f0 ff ff 21 e0 8b 00 c7 00 02 00 00 00
<1>general protection fault: 0000 [#7]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 0
EIP: 0060:[<02121f99>] Tainted: P
EFLAGS: 00010246 (2.6.6-1.435smp)
Unable to handle kernel NULL pointer dereference at virtual address 00000001
printing eip:
00000001
*pde = 00003001
EIP is at put_files_struct+0x6/0xaa
eax: ffffffff ebx: ffffffff ecx: 0000000b edx: ffffffff
esi: 022bc47c edi: 40db8750 ebp: 0000000b esp: 3481ae60
ds: 007b es: 007b ss: 0068
Process art (pid: 10466, threadinfo=3481a000 task=40db8750)
Stack: 00000000 022bc47c 40db8750 0000000b 02122b29 3481a000 022bc47c 00000000
00000000 02107727 0000000b 2f225e3c 00003000 02003000 02119f73 022bc46b
00003001 022c03f2 00000001 022bc458 022bc43d 00000001 00000000 00000000
Call Trace:
[<02122b29>] do_exit+0x211/0x390
[<02107727>] do_divide_error+0x0/0xaa
[<02119f73>] do_page_fault+0x33f/0x4f6
[<0211b6d7>] recalc_task_prio+0x128/0x133
[<0211b768>] activate_task+0x86/0x93
[<02119c34>] do_page_fault+0x0/0x4f6
[<0211d66f>] __wake_up_common+0x36/0x5b
[<0211d744>] complete+0x2c/0x3c
[<43b205ee>] afsd_thread+0x47/0x2ff [libafs]
[<43b205a7>] afsd_thread+0x0/0x2ff [libafs]
[<021051f1>] kernel_thread_helper+0x5/0xb
Code: f0 ff 08 0f 94 c0 84 c0 0f 84 91 00 00 00 31 ed 89 e9 c1 e1
<1>Oops: 0000 [#8]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 1
EIP: 0060:[<00000001>] Tainted: P
EFLAGS: 00010097 (2.6.6-1.435smp)
EIP is at 0x1
eax: 35ef4e3c ebx: 35ef4e3c ecx: 00000000 edx: 00000003
esi: 00000001 edi: 35ef4e48 ebp: 3521ffc0 esp: 3521ffa0
ds: 007b es: 007b ss: 0068
Process tart (pid: 10467, threadinfo=3521f000 task=40efa730)
Stack: 0211d66f 00000000 00000000 35ef4e46 00000003 35ef4e44 00000216 00000000
3521ffd8 0211d744 00000000 00000000 00000013 35ef4dfc 00000000 43b207e6
40efa992 43b2df3d 43b205a7 00000000 021051f1 35ef4dfc 00000000 00000000
Call Trace:
[<0211d66f>] __wake_up_common+0x36/0x5b
[<0211d744>] complete+0x2c/0x3c
[<43b207e6>] afsd_thread+0x23f/0x2ff [libafs]
[<43b205a7>] afsd_thread+0x0/0x2ff [libafs]
[<021051f1>] kernel_thread_helper+0x5/0xb
Code: Bad EIP value.
<1>general protection fault: 0000 [#9]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 0
EIP: 0060:[<02122bfc>] Tainted: P
EFLAGS: 00010246 (2.6.6-1.435smp)
EIP is at do_exit+0x2e4/0x390
eax: 40db8700 ebx: 022f2000 ecx: 0000000b edx: ffffffff
esi: 022b568f edi: 40db8750 ebp: 0000000b esp: 3481adfc
ds: 007b es: 007b ss: 0068
Process art (pid: 10466, threadinfo=3481a000 task=40db8750)
Stack: 3481a000 022b568f 00000000 0000000b 02107727 0000000b ffffffff 00000000
02107ba0 fffebea8 3481ae2c 00000000 ffffffff 0000000b ffffffff 022bc47c
40db8750 0000000b ffffffff 022b007b 0000007b ffffffff 02121f99 00000060
Call Trace:
[<02107727>] do_divide_error+0x0/0xaa
[<02107ba0>] do_general_protection+0x0/0x23b
[<02121f99>] put_files_struct+0x6/0xaa
[<02122b29>] do_exit+0x211/0x390
[<02107727>] do_divide_error+0x0/0xaa
[<02119f73>] do_page_fault+0x33f/0x4f6
[<0211b6d7>] recalc_task_prio+0x128/0x133
[<0211b768>] activate_task+0x86/0x93
[<02119c34>] do_page_fault+0x0/0x4f6
[<0211d66f>] __wake_up_common+0x36/0x5b
[<0211d744>] complete+0x2c/0x3c
[<43b205ee>] afsd_thread+0x47/0x2ff [libafs]
[<43b205a7>] afsd_thread+0x0/0x2ff [libafs]
[<021051f1>] kernel_thread_helper+0x5/0xb
Code: f0 ff 0a 0f 94 c0 84 c0 74 07 89 d0 e8 80 7b 04 00 e8 58 26
<1>general protection fault: 0000 [#10]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 1
EIP: 0060:[<02121f99>] Tainted: P
EFLAGS: 00010246 (2.6.6-1.435smp)
EIP is at put_files_struct+0x6/0xaa
eax: ffffffff ebx: ffffffff ecx: 0000000b edx: ffffffff
esi: 022bc47c edi: 40efa730 ebp: 0000000b esp: 3521fe60
ds: 007b es: 007b ss: 0068
Process tart (pid: 10467, threadinfo=3521f000 task=40efa730)
Stack: 00000000 022bc47c 40efa730 0000000b 02122b29 3521f000 022bc47c 00000000
00000000 02107727 0000000b 35ef4e3c 00003000 02003000 02119f73 022bc46b
00003001 022c03f2 00000001 022bc458 022bc43d 00000001 00000000 00000000
Call Trace:
[<02122b29>] do_exit+0x211/0x390
[<02107727>] Starting AFS cache scan...do_divide_error+0x0/0xaa
[<02119f73>] do_page_fault+0x33f/0x4f6
[<0211b6d7>] recalc_task_prio+0x128/0x133
[<0211b768>] activate_task+0x86/0x93
[<02119c34>] do_page_fault+0x0/0x4f6
[<0211d66f>] __wake_up_common+0x36/0x5b
[<0211d744>] complete+0x2c/0x3c
[<43b207e6>] afsd_thread+0x23f/0x2ff [libafs]
[<43b205a7>] afsd_thread+0x0/0x2ff [libafs]
[<021051f1>] kernel_thread_helper+0x5/0xb
Code: f0 ff 08 0f 94 c0 84 c0 0f 84 91 00 00 00 31 ed 89 e9 c1 e1
<1>general protection fault: 0000 [#11]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 1
EIP: 0060:[<02122bfc>] Tainted: P
EFLAGS: 00010246 (2.6.6-1.435smp)
EIP is at do_exit+0x2e4/0x390
eax: 40efa700 ebx: 022f2000 ecx: 0000000b edx: ffffffff
esi: 022b568f edi: 40efa730 ebp: 0000000b esp: 3521fdfc
ds: 007b es: 007b ss: 0068
Process tart (pid: 10467, threadinfo=3521f000 task=40efa730)
Stack: 3521f000 022b568f 00000000 0000000b 02107727 0000000b ffffffff 00000000
02107ba0 fffebea8 3521fe2c 00000000 ffffffff 0000000b ffffffff 022bc47c
40efa730 0000000b ffffffff 022b007b 0000007b ffffffff 02121f99 00000060
Call Trace:
[<02107727>] do_divide_error+0x0/0xaa
[<02107ba0>] do_general_protection+0x0/0x23b
[<02121f99>] put_files_struct+0x6/0xaa
[<02122b29>] do_exit+0x211/0x390
[<02107727>] do_divide_error+0x0/0xaa
[<02119f73>] do_page_fault+0x33f/0x4f6
[<0211b6d7>] recalc_task_prio+0x128/0x133
[<0211b768>] activate_task+0x86/0x93
[<02119c34>] do_page_fault+0x0/0x4f6
[<0211d66f>] __wake_up_common+0x36/0x5b
[<0211d744>] complete+0x2c/0x3c
[<43b207e6>] afsd_thread+0x23f/0x2ff [libafs]
[<43b205a7>] afsd_thread+0x0/0x2ff [libafs]
[<021051f1>] kernel_thread_helper+0x5/0xb
Code: f0 ff 0a 0f 94 c0 84 c0 74 07 89 d0 e8 80 7b 04 00 e8 58 26
<1>Unable to handle kernel NULL pointer dereference at virtual address 00000019
printing eip:
43b1d241
*pde = 00003001
Oops: 0000 [#12]
SMP
Modules linked in: libafs 3c59x microcode ohci1394 ieee1394 pcspkr dm_mod uhci_hcd ipv6 ext3 jbd aic7xxx sd_mod scsi_mod
CPU: 1
EIP: 0060:[<43b1d241>] Tainted: P
EFLAGS: 00010246 (2.6.6-1.435smp)
EIP is at osi_InitCacheInfo+0x23/0x6b [libafs]
eax: 2fe54d18 ebx: 39fa7d58 ecx: 00000000 edx: 00000001
esi: 39fa7d58 edi: 39fa7d72 ebp: ffffffe7 esp: 3531ee40
ds: 007b es: 007b ss: 0068
Process afsd (pid: 10461, threadinfo=3531e000 task=41534c30)
Stack: 2fe54d18 43aed751 00000100 0806dd60 39fa7d58 ffffffe7 0214fcc9 00000000
00000007 39fa7d58 00000007 43b20e06 00000000 000318c9 00000000 00000000
00000000 fef01cf8 0806dd60 00000000 00000010 00000010 00000890 3531eefc
Call Trace:
[<43aed751>] afs_InitCacheInfo+0x26/0xce [libafs]
[<0214fcc9>] get_user_size+0x2e/0x55
[<43b20e06>] afs_syscall_call+0x476/0x7d9 [libafs]
[<0214fa0e>] rw_vm+0x242/0x26b
[<43b21239>] afs_syscall+0xaa/0x1a6 [libafs]
[<43b1d858>] afs_ioctl+0x41/0x4d [libafs]
[<021513ba>] filp_open+0x36/0x3c
[<02160d37>] file_ioctl+0x167/0x17b
[<02160f34>] sys_ioctl+0x1e9/0x243
Code: 8b 52 18 89 15 bc 0e b4 43 8b 50 08 8b 92 8c 00 00 00 8b 52
--
Matthew Miller mattdm@mattdm.org <http://www.mattdm.org/>
Boston University Linux ------> <http://linux.bu.edu/>