[OpenAFS] Stability of AFS

Paul Blackburn mpb@est.ibm.com
Thu, 24 Oct 2002 09:35:46 +0100


Hello Marcin,

Just to let you know that we are using OpenAFS 1.2.7 installed
as a set of RPMs on a RedHat 7.3 with kernel 2.4.18-10smp.

The kernel module is: libafs-2.4.18-10-i686.mp.
Our AFS fileserver partitions are ext3.

This has been stable for us.
--
cheers
paul                          http://acm.org/~mpb

makowskm@chemia.uj.edu.pl wrote:

>We are using AFS for few months in our organization. For two weeks we have
>constant problems with stability of file system. Every 2(3) days it
>collapses producing system logs like those:
>
>Oct 23 15:18:31 porsacz kernel: Unable to handle kernel paging request at
>virtual address 0f3c8b21
>Oct 23 15:18:31 porsacz kernel:  printing eip:
>Oct 23 15:18:31 porsacz kernel: f883bde3
>Oct 23 15:18:31 porsacz kernel: *pde = 00000000
>Oct 23 15:18:31 porsacz kernel: Oops: 0002
>Oct 23 15:18:31 porsacz kernel: libafs-2.4.18-10-athlon.mp soundcore
>eepro100 ext3 jbd 3w-xxxx sd_mod scsi_mod
>Oct 23 15:18:31 porsacz kernel: CPU:    1
>Oct 23 15:18:31 porsacz kernel: EIP:    0010:[<f883bde3>]    Tainted: PF
>Oct 23 15:18:31 porsacz kernel: EFLAGS: 00010246
>Oct 23 15:18:31 porsacz kernel:
>Oct 23 15:18:31 porsacz kernel: EIP is at journal_commit_transaction [jbd]
>0x7c3 (2.4.18-10smp)
>Oct 23 15:18:31 porsacz kernel: eax: 0f3c8b11   ebx: f6488c90   ecx:
>00000b5c   edx: f6837840
>Oct 23 15:18:31 porsacz kernel: esi: 00000000   edi: f6946600   ebp:
>e3787f90   esp: f69bde80
>Oct 23 15:18:31 porsacz kernel: ds: 0018   es: 0018   ss: 0018
>Oct 23 15:18:31 porsacz kernel: Process kjournald (pid: 149,
>stackpage=f69bd000)
>Oct 23 15:18:31 porsacz kernel: Stack: 00003016 00000000 00000f9c c5363064
>0000000a cc065ac0 cd977bd0 00000d77
>Oct 23 15:18:31 porsacz kernel:        00000001 ec274700 ec7e15c0 00000000
>d7bbc3c0 cb1c1240 cb1c11c0 cb1c1140
>Oct 23 15:18:31 porsacz kernel:        cb1c10c0 cb5d3f40 cb5d3ec0 cb5d3e40
>cb5d3dc0 cb5d3d40 cb1c1d40 cb1c1cc0
>Oct 23 15:18:31 porsacz kernel: Call Trace: [<f883e7e6>] kjournald [jbd]
>0x136
>Oct 23 15:18:31 porsacz kernel: [<f883e690>] commit_timeout [jbd] 0x0
>Oct 23 15:18:31 porsacz kernel: [<c0107286>] kernel_thread [kernel] 0x26
>Oct 23 15:18:31 porsacz kernel: [<f883e6b0>] kjournald [jbd] 0x0
>Oct 23 15:18:31 porsacz kernel:
>Oct 23 15:18:31 porsacz kernel:
>Oct 23 15:18:31 porsacz kernel: Code: f0 ff 40 10 8b 03 f0 0f ba 68 18 0a
>8b 44 24 1c 50 8d 44 24
>
>	Checking the server status after such events don't show anything wrong,
>but in fact none of the AFS clients can get to file system. All what can
>be done is to obtain a token.The only way to bring back functionality is
>restarting the server machine.
>
>	We are using OpenAFS ver.1.2.6 on RedHat 7.3 with OpenAFS modules
>compiled for our kernel (2.4.18-10smp). The server works as SMP with two
>Athlons1800+.The file system is located on the RAID5 with ext3 type
>partition. The machine has both AFS server and client functionality and
>the client cache is located on a separate partition of ext2 type.
>
>Could anyone help us to explain the instability of AFS in such configuration?
>
>Yours,
>
>Marcin Makowski
>Department of the Theoretical Chemistry
>Jagiellonian University
>makowskm@chemia.uj.edu.pl
>
>
>_______________________________________________
>OpenAFS-info mailing list
>OpenAFS-info@openafs.org
>https://lists.openafs.org/mailman/listinfo/openafs-info
>