[OpenAFS] Bug in OpenAFS 1.4.12?

Claudio Prono claudio.prono@atpss.net
Tue, 12 Oct 2010 10:00:12 +0200


Hello all,

I use OpenAFS into an OpenSuSE 11.3 (32 bit). This morning i have found
my system completely locked up. No video signal, no response.

I have looked into the logs of the machine, and i have found this:


Oct 12 05:07:46 kerberos kernel: [1537890.352252] BUG: soft lockup -
CPU#1 stuck
 for 61s! [afs_cachetrim:7467]
Oct 12 05:07:46 kerberos kernel: [1537890.352252] Modules linked in:
bluetooth r
fkill af_packet libafs(P) mperf reiserfs loop dm_mod sg pl2303 floppy
osst sr_mo
d pcspkr i2c_piix4 usbserial cdrom container tg3 i2c_core st cpqphp
pci_hotplug
mptctl hpwdt hpilo sworks_agp button ohci_hcd rtc_cmos ehci_hcd rtc_core
rtc_lib
 usbcore edd fan processor ata_generic mptspi mptscsih mptbase
scsi_transport_sp
i pata_serverworks libata thermal thermal_sys hwmon cciss scsi_mod [last
unloade
d: preloadtrace]
Oct 12 05:07:46 kerberos kernel: [1537890.352252] Modules linked in:
bluetooth r
fkill af_packet libafs(P) mperf reiserfs loop dm_mod sg pl2303 floppy
osst sr_mo
d pcspkr i2c_piix4 usbserial cdrom container tg3 i2c_core st cpqphp
pci_hotplug
mptctl hpwdt hpilo sworks_agp button ohci_hcd rtc_cmos ehci_hcd rtc_core
rtc_lib
 usbcore edd fan processor ata_generic mptspi mptscsih mptbase
scsi_transport_sp
i pata_serverworks libata thermal thermal_sys hwmon cciss scsi_mod [last
unloade
d: preloadtrace]
Oct 12 05:07:46 kerberos kernel: [1537890.352252]
Oct 12 05:07:46 kerberos kernel: [1537890.352252] Pid: 7467, comm:
afs_cachetrim
 Tainted: P           2.6.34.7-0.2-default #1 /ProLiant DL380 G3
Oct 12 05:07:46 kerberos kernel: [1537890.352252] EIP: 0060:[<f981bb26>]
EFLAGS:
 00000202 CPU: 1
Oct 12 05:07:46 kerberos kernel: [1537890.352252] EIP is at
afs_CacheTruncateDae
mon+0x366/0x4c0 [libafs]
Oct 12 05:07:46 kerberos kernel: [1537890.352252] EAX: 00000000 EBX:
00000004 EC
X: 00000001 EDX: 00000000
Oct 12 05:07:46 kerberos kernel: [1537890.352252] ESI: 00000000 EDI:
000002a8 EB
P: 0000000a ESP: f5c41fb4
Oct 12 05:07:46 kerberos kernel: [1537890.352252]  DS: 007b ES: 007b FS:
00d8 GS
: 0000 SS: 0068
Oct 12 05:07:46 kerberos kernel: [1537890.352252] Process afs_cachetrim
(pid: 74
67, ti=f5c40000 task=c23304b0 task.ti=f5c40000)
Oct 12 05:07:46 kerberos kernel: [1537890.352252] Stack:
Oct 12 05:07:46 kerberos kernel: [1537890.352252]  c23304b0 fffffe28
c23304b0 c2
3304b0 c2249e08 00000000 f9869a35 c23307c0
Oct 12 05:07:46 kerberos kernel: [1537890.352252] <0> f9878382 000000d8
00000000
 f9869640 c2249e08 c02037a6 00000000 00000000
Oct 12 05:07:46 kerberos kernel: [1537890.352252] <0> 00000000 0812d630
00000000
Oct 12 05:07:46 kerberos kernel: [1537890.352252] Call Trace:
Oct 12 05:07:46 kerberos kernel: [1537890.352252]  [<f9869a35>]
afsd_thread+0x3f
5/0x620 [libafs]
Oct 12 05:07:47 kerberos kernel: [1537890.352252]  [<c02037a6>]
kernel_thread_he
lper+0x6/0x10
Oct 12 05:07:47 kerberos kernel: [1537890.352252] Code: a3 34 c2 88 f9
89 15 30
c2 88 f9 7e 18 2d 40 42 0f 00 83 c2 01 a3 34 c2 88 f9 89 15 30 c2 88 f9
90 8d 74
 26 00 8b 15 ec 74 89 f9 <81> fa d5 00 00 00 0f 85 06 fd ff ff 8d b6 00
00 00 00
 b8 ec 74
Oct 12 05:07:47 kerberos kernel: [1537890.352252] Call Trace:
Oct 12 05:07:47 kerberos kernel: [1537890.352252]  [<f9869a35>]
afsd_thread+0x3f
5/0x620 [libafs]
Oct 12 05:07:47 kerberos kernel: [1537890.352252]  [<c02037a6>]
kernel_thread_he
lper+0x6/0x10
Oct 12 05:08:52 kerberos kernel: [1537955.848252] BUG: soft lockup -
CPU#1 stuck
 for 61s! [afs_cachetrim:7467]
Oct 12 05:08:52 kerberos kernel: [1537955.848252] Modules linked in:
bluetooth r
fkill af_packet libafs(P) mperf reiserfs loop dm_mod sg pl2303 floppy
osst sr_mo
d pcspkr i2c_piix4 usbserial cdrom container tg3 i2c_core st cpqphp
pci_hotplug
mptctl hpwdt hpilo sworks_agp button ohci_hcd rtc_cmos ehci_hcd rtc_core
rtc_lib
 usbcore edd fan processor ata_generic mptspi mptscsih mptbase
scsi_transport_sp
i pata_serverworks libata thermal thermal_sys hwmon cciss scsi_mod [last
unloade
d: preloadtrace]
Oct 12 05:08:52 kerberos kernel: [1537955.848252] Modules linked in:
bluetooth r
fkill af_packet libafs(P) mperf reiserfs loop dm_mod sg pl2303 floppy
osst sr_mo
d pcspkr i2c_piix4 usbserial cdrom container tg3 i2c_core st cpqphp
pci_hotplug
mptctl hpwdt hpilo sworks_agp button ohci_hcd rtc_cmos ehci_hcd rtc_core
rtc_lib
 usbcore edd fan processor ata_generic mptspi mptscsih mptbase
scsi_transport_sp
i pata_serverworks libata thermal thermal_sys hwmon cciss scsi_mod [last
unloade
d: preloadtrace]

It repeats until 7:10, with this last messages:

Oct 12 07:10:02 kerberos kernel: [1545226.156253] Call Trace:
Oct 12 07:10:02 kerberos kernel: [1545226.156253]  [<f9869a35>]
afsd_thread+0x3f
5/0x620 [libafs]
Oct 12 07:10:02 kerberos kernel: [1545226.156253]  [<c02037a6>]
kernel_thread_he
lper+0x6/0x10

And then, nothing more until the system was rebooted manually....

What can be the problem? AFS bug?


Cordially,

Claudio Prono.



-- 
--------------------------------------------------------------------------------
Claudio Prono                         OPST
System Developer               
                                      Gsm: +39-349-54.33.258
@PSS Srl                              Tel: +39-011-32.72.100
Via San Bernardino, 17                Fax: +39-011-32.46.497
10141 Torino - ITALY                  http://atpss.net/disclaimer
--------------------------------------------------------------------------------
PGP Key - http://keys.atpss.net/c_prono.asc