[OpenAFS] openafs 1.6.0pre4 and OSX 10.6.7 and 64bit kernel (NOT really) FIXED ;(

Andy Cobaugh phalenor@gmail.com
Tue, 29 Mar 2011 15:22:55 -0400 (EDT)


On 2011-03-29 at 20:10, Chris Jones ( christopher.rob.jones@cern.ch ) said:
> Hi,
>
>> Chris-Jones-Macbook-Pro /Library/OpenAFS/Tools/bin > ./cmdebug localhost
>> Lock afs_discon_lock status: (none_waiting, 1 read_locks(pid:1133))
>> ** Cache entry @ 0xd35161a0 for 0.1.16777996.1 [dynroot]
>>    locks: (none_waiting, write_locked(pid:1133 at:599))
>>              18 bytes  DV            1  refcnt     0
>>    callback 00000000	expires 0
>>    0 opens	0 writers
>>    mount point
>>    states (0x5), stat'd, read-only
>
> and a slightly different one later on (whilst waiting to just cd into a directory under /afs/cern.ch)
>
> Chris-Jones-Macbook-Pro /Library/OpenAFS/Tools/bin > ./cmdebug localhost
> Lock afs_discon_lock status: (none_waiting, 1 read_locks(pid:1156))
> ** Cache entry @ 0xd35184b0 for 382.537112396.26.32 [cern.ch]
>    locks: (none_waiting, write_locked(pid:1156 at:66))
>               7 bytes  DV            1  refcnt     0
>    callback 263a6708	expires 1301440202
>    0 opens	0 writers
>    normal file
>    states (0x1), stat'd

fwiw, we started seeing this on Leopard as early as 1.5.77. I just now saw 
this on 1.6.0pre4 on Snow Leopard with the 32-bit kernel. It's also 
happened with 1.6.0pre2 on Leopard.

Sometimes it hangs for only a few minutes. Other times, it will hang for 
hours until someone reboots.

cmdebug always reports 1 or more read locks on afs_discon_lock, with a 
random pid.

--andy