[OpenAFS-devel] diagnosing my problem with ubik elections... bug in ubik

Neulinger, Nathan nneul@umr.edu
Tue, 3 Apr 2001 12:07:36 -0500


Yeah, I've been waiting long enough... learned that much about the protocol
already, head about to explode from it too...

I've let it sit overnight in a couple cases, it's just looping forever. I've
about got it tracked down, has taken me a while to get enough debugging
added to ubik stuff to where I can understand exactly how it works.

-- Nathan

> -----Original Message-----
> From: Ken Hornstein [mailto:kenh@cmf.nrl.navy.mil]
> Sent: Tuesday, April 03, 2001 12:03 PM
> To: Neulinger, Nathan
> Cc: 'openafs-devel@openafs.org'
> Subject: Re: [OpenAFS-devel] diagnosing my problem with ubik
> elections... bug in ubik 
> 
> 
> >Once I changed that, the lowestHost calculation is looking 
> much better.
> >Still not syncing up cause no one is ever sending a yes 
> vote, but I'm still
> >looking at that. 
> 
> Just FYI: as part of the protocol, no one can send a "yes" 
> vote for BIG
> seconds after startup (I think "BIG" is something like 90, but I don't
> remember).  If you're restarting it before that timer elapses, then
> that might be part of the problem.
> 
> I have a document which describes the basic Ubik protocol which IMHO
> is essential for debugging these sorts of things; Derrick, 
> maybe it should
> be added to the base distribution?  (If it isn't already).
> 
> --Ken
>