[OpenAFS] Re: Debugging opportunity (time-sensitive)

Jeff Blaine jblaine@kickflop.net
Tue, 07 Jun 2011 21:23:15 -0400


I was unable to get a shell this time, but tonight
we experienced what I believe to be the same exact
thing (total /afs wedge for all processes) on
a different Solaris 10 SPARC host with 272 day
uptime.

[ for the record ]

On 5/18/2011 3:59 PM, Jeff Blaine wrote:
> On 5/18/2011 3:03 PM, Andrew Deason wrote:
>> On Wed, 18 May 2011 13:51:06 -0400
>> Jeff Blaine<jblaine@kickflop.net> wrote:
>>
>>> 0 -> afs_osi_Sleep
>>> 0 | afs_osi_Sleep:entry event 705ac1bc = 1023, 1,
>>> 1, 1, 0, 0, 0, 2062683024, 2062683824, 0, 2062684288
>>
>> This is looking a little weird, but I'm not really used to looking at a
>> lock structure like this. Are you running a 32-bit kernel module?
>
> bash-3.00# file /kernel/fs/sparcv9/afs
> /kernel/fs/sparcv9/afs: ELF 64-bit MSB relocatable SPARCV9 Version 1
> bash-3.00#
>
>> If you run that again, do these values change?
>
> I ran it once just after receiving this email, and yes,
> it did "more stuff" then hung with a similar line.
>
> Now when I run it over and over, the trace shows the same
> ~25 lines as reported above, and hangs there as well.
> The values shown for afs_osi_Sleep:entry do not change.