[OpenAFS-devel] Re: Patched Solaris 8 crashes openAFS

Kostas Georgiou k.georgiou@imperial.ac.uk
Thu, 13 May 2004 14:19:50 +0100 (BST)


Hi,

I am afraid i have no clue on what's wrong, it can be that one of the latest 
solaris patches broke the kernel compatibility. I am adding openafs-devel
to the discussion it's likely that someone else has the same problem 
(nothing in the archives though).

Kostas

On Thu, 13 May 2004, Rhys Morris wrote:

> 
> Hi Kostas + Hepsysman,
> 
> OK, downloaded 8_recommended yesterday, installed it, rebooted this 
> morning. we're now at kernel 117350-01 (what happened to the 117000 
> series I don't know).
> 
> # uname -a
> SunOS horus 5.8 Generic_117350-01 sun4u sparc SUNW,Ultra-4
> 
> I deleted the AFS cache, then started AFS, and guess what? After a 
> couple of minutes of checking the cache, down it went...
> 
> Log messages follow:
> 
> Any suggestions? I could build AFS from source, but I have bad 
> memories of doing that last time...
> 
> Rhys
> 
> 
> May 13 13:13:13 horus afs: [ID 888289 kern.notice] Starting AFS cache 
> scan...
> May 13 13:24:16 horus unix: [ID 836849 kern.notice]
> May 13 13:24:16 horus ^Mpanic[cpu2]/thread=300050c61a0:
> May 13 13:24:16 horus unix: [ID 340138 kern.notice] BAD TRAP: type=31 
> rp=2a10074
> ab00 addr=0 mmu_fsr=0 occurred in module "genunix" due to a NULL 
> pointer derefer
> ence
> May 13 13:24:16 horus unix: [ID 100000 kern.notice]
> May 13 13:24:16 horus unix: [ID 839527 kern.notice] afsd:
> May 13 13:24:16 horus unix: [ID 520581 kern.notice] trap type = 0x31
> May 13 13:24:16 horus unix: [ID 101969 kern.notice] pid=914, 
> pc=0x100ee120, sp=0
> x2a10074a3a1, tstate=0x980001600, context=0x11a7
> May 13 13:24:16 horus unix: [ID 743441 kern.notice] g1-g7: 78018080, 
> 0, 300015ea
> 240, 0, 2, 0, 300050c61a0
> May 13 13:24:16 horus unix: [ID 100000 kern.notice]
> May 13 13:24:16 horus genunix: [ID 723222 kern.notice] 
> 000002a10074a710 unix:die
> +80 (31, 0, 10415278, 0, 2a10074ab00, c2492000)
> May 13 13:24:16 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000000000001
>  0000000001010100 00000000ff000000 0000000000ff0000
> May 13 13:24:16 horus   %l4-7: 000000000000ff00 00000000fefefeff 
> 000000000000000
> 1 0000000000000000
> May 13 13:24:16 horus genunix: [ID 723222 kern.notice] 
> 000002a10074a7f0 unix:tra
> p+8b8 (0, 1, 5, 0, 2a10074ab00, 0)
> May 13 13:24:16 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000000000001
>  0000000078016800 0000030005109568 0000000000000000
> May 13 13:24:16 horus   %l4-7: 0000000000000031 00000300050b6ac8 
> 000000000001000
> 0 000002a10074bba0
> May 13 13:24:16 horus genunix: [ID 723222 kern.notice] 
> 000002a10074a930 unix:sfm
> mu_tsb_miss+66c (10429b30, 0, 3000016bf88, 0, 3000016bf88, 19)
> May 13 13:24:16 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000000000000
>  0000000000000004 000000000000f69c 000003100007f100
> May 13 13:24:16 horus   %l4-7: 0000000000000000 000000007801c208 
> 000000000000000
> 0 0000000000000003
> May 13 13:24:16 horus genunix: [ID 723222 kern.notice] 
> 000002a10074aa50 unix:pro
> m_rtt+0 (300015ea240, 0, 40a36917, 40a36911, 1fb6, 40a368f0)
> May 13 13:24:16 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000000000001
>  0000000000001400 0000000980001600 000000001001a8a0
> May 13 13:24:16 horus   %l4-7: 0000030002ba76e0 0000000040a368f0 
> 000000000000000
> 0 000002a10074ab00
> May 13 13:24:16 horus genunix: [ID 723222 kern.notice] 
> 000002a10074aba0 afs:afs_
> choose_cell_by_name+8 (30005148f20, 0, 400000, 0, 16, 14)
> May 13 13:24:16 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000000000000
>  0000000078016400 00000300015e2028 000002a1005e55f8
> May 13 13:24:16 horus   %l4-7: 0000030002ba7420 0000000000000400 
> 00000000ffbec7e
> 4 000000000006bf47
> May 13 13:24:16 horus genunix: [ID 723222 kern.notice] 
> 000002a10074ac70 afs:afs_
> TraverseCells_nl+40 (781085c0, 0, 30005148fc0, 30005148f20, 0, 
> 30005148f20)
> May 13 13:24:16 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 00000300016af7d8
>  0000000078018080 0000000000000041 0000000000000000
> May 13 13:24:16 horus   %l4-7: 0000000000000000 0000000000400000 
> 000000000005c94
> 0 0000000000000000
> May 13 13:24:17 horus genunix: [ID 723222 kern.notice] 
> 000002a10074ad60 afs:afs_
> TraverseCells+94 (781085c0, 0, 300050c61a0, 300050c61a0, 10424d10, 
> 3000170ab28)
> May 13 13:24:17 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000000000392
>  0000000078019400 0000000000000041 0000000000000000
> May 13 13:24:17 horus   %l4-7: 0000000000000000 0000000000400000 
> 000000000005a8c
> 0 0000000000000000
> May 13 13:24:17 horus genunix: [ID 723222 kern.notice] 
> 000002a10074ae40 afs:afs_
> FindCellByName+20 (0, 1, 11440000, c00000007d480676, 3000170aab0, 
> 3000016a9e8)
> May 13 13:24:17 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 000000001041d440
>  0000000000000016 0000000000000009 000000001000a408
> May 13 13:24:17 horus   %l4-7: 000003000008e188 00000300015f9578 
> 000000000000000
> 0 000002a10000b910
> May 13 13:24:17 horus genunix: [ID 723222 kern.notice] 
> 000002a10074af10 afs:afs_
> GetCellByName+8 (0, 1, 2a10074b098, 0, 16, 2a10074b1fc)
> May 13 13:24:17 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000000000009
>  0000000001010100 00000000ff000000 0000000000ff0000
> May 13 13:24:17 horus   %l4-7: 000000000000ff00 00000000fefefeff 
> 000000000000000
> 1 0000000000000000
> May 13 13:24:17 horus genunix: [ID 723222 kern.notice] 
> 000002a10074afe0 afs:afs_
> GetPrimaryCell+1c (1, 7819c9a0, 10424d10, 21, 5e4, 0)
> May 13 13:24:17 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000078021800
>  0000000078016800 00000300015e2028 00000300050c5200
> May 13 13:24:17 horus   %l4-7: 00000300050c6ca0 00000300050b6ac8 
> 00000300050c61a
> 0 000002a10074bba0
> May 13 13:24:17 horus genunix: [ID 723222 kern.notice] 
> 000002a10074b0a0 afs:afs_
> CheckRootVolume+c4 (0, 40a368f0, 0, 1, 0, 300050c5200)
> May 13 13:24:17 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000000000000
>  0000000078016400 000000000000f69c 0000000000000000
> May 13 13:24:17 horus   %l4-7: 0000000078018c00 000000007801c208 
> 000000000000000
> 0 000002a10074b1b0
> May 13 13:24:17 horus genunix: [ID 723222 kern.notice] 
> 000002a10074b220 afs:afs_
> Daemon+66c (40a36808, 40a3695f, 40a36917, 40a36911, 1fb6, 40a368f0)
> May 13 13:24:17 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000000000065
>  0000000040a368f0 0000000040a36970 000002a10074b5f8
> May 13 13:24:17 horus   %l4-7: 0000030002ba76e0 0000000040a368f0 
> 000000000000000
> 0 0000000040a36938
> May 13 13:24:17 horus genunix: [ID 723222 kern.notice] 
> 000002a10074b310 afs:afs_
> syscall_call+390 (1, 1, ffbeebe4, ffbedbe4, 2216c, 0)
> May 13 13:24:17 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000000000002
>  000000007801f000 000003000016a9e0 0000000000000001
> May 13 13:24:17 horus   %l4-7: 0000000000062000 0000000000042000 
> 000000000000000
> f 0000031002a81c20
> May 13 13:24:17 horus genunix: [ID 723222 kern.notice] 
> 000002a10074b7b0 afs:Afs_
> syscall+d4 (2a10074b978, 2a10074b978, 23, 1, 0, 18)
> May 13 13:24:17 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000030000a413e8
>  0000000000000001 0000000000000000 0000000000000001
> May 13 13:24:17 horus   %l4-7: 00000000ffbeebe4 00000000ffbedbe4 
> 000000000002216
> c 0000000000000000
> May 13 13:24:18 horus genunix: [ID 723222 kern.notice] 
> 000002a10074b8c0 genunix:
> syscall_ap+6c (10435638, 2a10074bba0, 300050b6ac8, ffbeebe4, ffbedbe4, 
> 300000988
> d8)
> May 13 13:24:18 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000078188ce0
>  000003000016a9e0 0000030002e91890 0000000000000000
> May 13 13:24:18 horus   %l4-7: 0000000000062000 0000000000064000 
> 0000030002bb100
> 0 0000000000002000
> May 13 13:24:18 horus genunix: [ID 723222 kern.notice] 
> 000002a10074b980 genunix:
> loadable_syscall+90 (300000988d8, 10435638, 1400, 6, ffbedbe4, 2216c)
> May 13 13:24:18 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 00000000ffbedbe4
>  00000000ffbeebe4 000000000000001c 0000000000000001
> May 13 13:24:18 horus   %l4-7: 0000000000000001 0000000000022000 
> 0000030002bb0fc
> 8 0000000000002000
> May 13 13:24:18 horus genunix: [ID 723222 kern.notice] 
> 000002a10074ba40 genunix:
> indir+a4 (10435638, 1c, 1, 1, ffbeebe4, ffbedbe4)
> May 13 13:24:18 horus genunix: [ID 179002 kern.notice]   %l0-3: 
> 0000000010071af0
>  0000030005109568 0000030005109568 0000000000000000
> May 13 13:24:18 horus   %l4-7: 0000000000010009 00000300050b6ac8 
> 000000000000000
> 0 0000000000000003
> May 13 13:24:18 horus unix: [ID 100000 kern.notice]
> May 13 13:24:18 horus genunix: [ID 672855 kern.notice] syncing file 
> systems...
> May 13 13:24:18 horus genunix: [ID 733762 kern.notice]  7
> May 13 13:24:40 horus last message repeated 20 times
> May 13 13:24:41 horus genunix: [ID 622722 kern.notice]  done (not all 
> i/o comple
> ted)
> May 13 13:24:42 horus genunix: [ID 353387 kern.notice] dumping to 
> /dev/dsk/c0t0d
> 0s3, offset 419561472
> May 13 13:24:57 horus genunix: [ID 409368 kern.notice] ^M100% done: 
> 23114 pages
> dumped, compression ratio 4.99,
> May 13 13:24:57 horus genunix: [ID 851671 kern.notice] dump succeeded
> 
> 
> 
> On Tue, 11 May 2004, Kostas Georgiou wrote:
> 
> >Hi,
> >
> >117000-05 is out so it's probably a good idea to try it, i have it installed
> >but i haven't rebooted since 108528-29 so i can't comment on afs (yet).
> >
> >Kostas
> >
> >On Tue, 11 May 2004, Rhys Morris wrote:
> >
> >> Hi All,
> >>
> >> I recently ran the 8_recommended patch cluster on our E450 and it
> >> upgraded the kernel to Generic_117000-03
> >>
> >> # uname -a
> >> SunOS horus 5.8 Generic_117000-03 sun4u sparc SUNW,Ultra-4
> >>
> >> However, bringing up AFS now crashes the system. This is the
> >>
> >> sun4x_58.tar.gz (14MB)
> >> [MD5: a9f98cc2edc9ea2f2270a2daa044a652]
> >>
> >> I downloaded from openafs.org this morning. Has anyone else seen this
> >> problem, and more importantly, solved it?
> >>
> >> Thanks,
> >>
> >> Rhys