[OpenAFS-port-darwin] OpenAFS 1.6.2 on OS X 10.8: suspected deadlock

Duncan S Kincaid dsk@MIT.EDU
Tue, 9 Apr 2013 10:55:38 +0000


viel Dank stephan und derrick!
if you've an installer you'd like me to test, i'd be delighted.

ciao
dk


On Apr 9, 2013, at 4:02 AM, Stephan Wiesand <Stephan.Wiesand@desy.de>
 wrote:

> 9747 was merged.
>=20
> On Apr 8, 2013, at 22:27 , Derrick Brashear <shadow@gmail.com> wrote:
>=20
>> Indeed. Sorry. 1.6 version is 9747. Stephan, this is a 1.6.2
>> regression and only applies
>> to MacOS, so it should be noncontroversial.
>>=20
>> On Mon, Apr 8, 2013 at 4:21 PM, Duncan S Kincaid <dsk@mit.edu> wrote:
>>> this problem was evident in OpenAFS Client 1.6.1 but resolved in OpenAF=
S Client 1.6.1a.
>>> It seems to have resurfaced under 1.6.2.
>>>=20
>>> Issue with 1.6.1 was identified by Derrick and fixed:
>>> "the good: you're out of Rx packets.
>>> the bad: not sure where they all are yet!"
>>> and
>>> "never mind. found it. gerrit 7788."
>>>=20
>>> with thanks
>>> dk
>>>=20
>>> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D
>>> Behaviour: All clients with AFS home directories see 20 minute delay be=
fore
>>> files accessible. Then suddenly all files available and all proceeds no=
rmally.
>>>=20
>>> For reference, please find report filed July 14 2012 re 1.6.1 client be=
low:
>>> =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D
>>> 1. spinning beach ball for *exactly* 20 minutes, then session returns t=
o normal
>>> 2. can confirm user gets tokens and TGT at login (using LoginHook and L=
aunchAgent scripts)
>>> 3. ssh connections as root into the beach balling mac reveal following:
>>>       a. no CPU load. All processes 'stuck' or 'sleeping' (apart from '=
top', of course)
>>>       b. virtually no system.log file entires from time of beach ballin=
g. at 20 minute mark lots of 'time out' errors logged.
>>>       c. attempts to traverse afs directory tree results in shell locki=
ng up completely
>>> 4. tcpdumps show very little afs traffic**. mostly pings (rx ack) to se=
rvers/databases at rate of 2-5/sec. at the 20 minute mark, a barrage of tra=
ffic (rx data). (500-4000/sec)****.
>>> 5. local users (those whose home directories reside on computer hard dr=
ive) have never seen this problem
>>> 6. this problem occurs irrespective of particular user with home direct=
ory in AFS, AFS server storing his home directory, AFS cell,  and client co=
mputer OS.
>>> 7. 'Action At A Distance': when the beach ball spins on Macintosh, moun=
ting the same user's AFS home from another computer (Linux box) takes 2 min=
utes as opposed to customary 1-2 seconds. subsequent Linux logins are norma=
l.
>>>=20
>>>=20
>>> |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||=
||||||||||
>>> duncan kincaid
>>> cron | mit school of architecture and planning
>>>=20
>>>=20
>>>=20
>>>=20
>>> _______________________________________________
>>> port-darwin mailing list
>>> port-darwin@openafs.org
>>> https://lists.openafs.org/mailman/listinfo/port-darwin
>=20
> --=20
> Stephan Wiesand
> DESY - DV -
> Platanenallee 6
> 15732 Zeuthen, Germany
>=20


|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||=
||||||
duncan kincaid
cron | mit school of architecture and planning