[OpenAFS] "stillborn client" in src/viced/host.c

Derrick J Brashear shadow@dementia.org
Tue, 17 Oct 2006 17:47:40 -0400 (EDT)


On Tue, 17 Oct 2006, Bill Stivers wrote:

> Hey all:
>
> Thanks much for the tremendous help you've provided to me and my cohorts.
>
> I have another lame question.  I've been speaking to Joe about error 
> information he's seeing on our AFS servers, and he noted one particularly odd 
> one:
>
> Tue Oct 17 13:15:59 2006 FindClient: stillborn client b3f528(b17753cc);
> conn b5b348 (host 128.114.104.230:7001) had client b4c030(b17753cc)
> Mon Oct 16 19:27:28 2006 FindClient: stillborn client b3f7c8(2b1cc290);
> conn b450d0 (host 128.114.30.230:7001) had client 6d52f0(2b1cc290)
>
> I looked at the code, and found the lines that are generating the message in 
> src/viced/host.c, which are as follows:
>
> Can someone who knows the codebase well shed some light as to what's going 
> on?  Is this another one of those: "You have OpenAFS in part of your 
> infrastructure and TransARC in part of it" issues?  is this, perhaps, part of 
> the locking code?

We can remove that. It was a potential race we have cleaned up, something 
in the logs so we'd know it happened if we needed to debug something.

> I'm trying to do due diligence to make sure my clients aren't partially to 
> blame for some of the things that our server administrators are fixing now, 
> and this is part of that effort.

Well, it's more likely to happen with old windows clients, but, it can 
happen with any client, depending on circumstance.