[OpenAFS] weird problem... can't add a third DB server for pt
server
Neulinger, Nathan R.
nneul@umr.edu
Fri, 16 Mar 2001 10:55:08 -0600
I tried that, and it didn't help.
----
All three ptservers shut down
All three sets of prdb.DB0/DBSYS1 are identical, verified with sum
All three CellServDB's are identical.
Did startup on all three ptservers
All three servers see the other ubik servers:
Mar 16 10:46:31 srvtst02 ptserver[3484]: ubik:server 131.151.1.229
is back up:
will be contacted through 131.151.1.229
Mar 16 10:46:31 srvtst02 ptserver[3484]: ubik:server 131.151.1.228
is back up:
will be contacted through 131.151.1.228
Waited 5 minutes
Pretty much all three say the same thing:
Note - if I JUST bring up srvtst02 and srvtst03, and just have those two in
the CellServDB it works just fine. It syncs up in about 20-30 seconds or so.
troot-srvtst04(215)> udebug srvtst02 7002 -long
Host's addresses are: 131.151.1.227
Host's 131.151.1.227 time is Fri Mar 16 10:53:46 2001
Local time is Fri Mar 16 10:53:46 2001 (time differential 0 secs)
Last yes vote for 131.151.1.227 was 4 secs ago (not sync site);
Last vote started 4 secs ago (at Fri Mar 16 10:53:42 2001)
Local db version is 984753747.5
I am not sync site
Lowest host 131.151.1.227 was set 1 secs ago
Sync host 0.0.0.0 was set 984761626 secs ago
Sync site's db version is 984753747.5
0 locked pages, 0 of them for write
Server (131.151.1.229): (db 0.0)
last vote rcvd 4 secs ago (at Fri Mar 16 10:53:42 2001),
last beacon sent 4 secs ago (at Fri Mar 16 10:53:42 2001), last vote was
no
dbcurrent=0, up=1 beaconSince=1
Server (131.151.1.228): (db 0.0)
last vote rcvd 4 secs ago (at Fri Mar 16 10:53:42 2001),
last beacon sent 4 secs ago (at Fri Mar 16 10:53:42 2001), last vote was
no
dbcurrent=0, up=1 beaconSince=1
---
-- Nathan
> -----Original Message-----
> From: Mindy Kubis [mailto:mindyk@us.ibm.com]
> Sent: Friday, March 16, 2001 10:34 AM
> To: Neulinger, Nathan R.
> Cc: 'Erik J Burckart'; 'openafs-info@openafs.org'
> Subject: RE: [OpenAFS] weird problem... can't add a third DB
> server for
> pt server
>
>
> Hi Nathan -
>
> What this looks like is that when you added server C to the
> CellServDB's,
> the ptserver instances on all 3 machines were not restarted.
>
> My suggestion is to restart the ptserver process on all 3 database
> machines.
>
> Thanks,
>
> Mindy
>
>
>
>
>
> "Neulinger, Nathan R." <nneul@umr.edu>@openafs.org on
> 03/16/2001 10:58:57
> AM
>
> Sent by: openafs-info-admin@openafs.org
>
>
> To: "'Erik J Burckart'" <erik@burckart.org>
> cc: "'openafs-info@openafs.org'" <openafs-info@openafs.org>
> Subject: RE: [OpenAFS] weird problem... can't add a third DB
> server for pt
> server
>
>
>
> This is what I get on C after adding C to CellServDB and
> starting ptserver
> on it with no prdb* files in /usr/afs/db.
>
> ------------------------
> troot-srvtst04(136)> udebug srvtst04 7002 -long
> Host's addresses are: 131.151.1.229
> Host's 131.151.1.229 time is Fri Mar 16 09:55:35 2001
> Local time is Fri Mar 16 09:55:36 2001 (time differential 1 secs)
> Last yes vote not cast yet
> Local db version is 1.1
> I am not sync site
> Lowest host 131.151.1.229 was set 0 secs ago
> Sync host 0.0.0.0 was set 984758135 secs ago
> Sync site's db version is 0.0
> 0 locked pages, 0 of them for write
>
> Server (131.151.1.228): (db 0.0)
> last vote rcvd 0 secs ago (at Fri Mar 16 09:55:36 2001),
> last beacon sent 0 secs ago (at Fri Mar 16 09:55:36
> 2001), last vote
> was
> no
> dbcurrent=0, up=1 beaconSince=1
>
> Server (131.151.1.227): (db 0.0)
> last vote rcvd 0 secs ago (at Fri Mar 16 09:55:36 2001),
> last beacon sent 0 secs ago (at Fri Mar 16 09:55:36
> 2001), last vote
> was
> no
> dbcurrent=0, up=1 beaconSince=1
> ---------------------------
>
> One thing that is very interesting is that on all three servers - the
> "lowest host was set" has the wrong IP. It has the local
> host's ip, which
> is
> a little odd.
>
> -- Nathan
>
> > -----Original Message-----
> > From: Erik J Burckart [mailto:erik@burckart.org]
> > Sent: Friday, March 16, 2001 9:51 AM
> > To: Neulinger, Nathan R.
> > Cc: 'openafs-info@openafs.org'
> > Subject: Re: [OpenAFS] weird problem... can't add a third DB
> > server for
> > ptserver
> >
> >
> > Sounds right to me. What do the outputs from `udebug
> > x.x.x.227 7002 -long` and
> > `udebug x.x.x.229 7002 -long` suggest?
> >
> > -E
> >
> >
> > "Neulinger, Nathan R." wrote:
> >
> > > For the life of me, I can't figure out what is going on...
> > I have three
> > > servers with sequential IP addresses (x.x.x.227, 228, 229).
> > I can get the DB
> > > server running fine on the first server by itself. I can
> > add the second
> > > server, let it run for a bit, and it happily syncs up and
> > sets 227 as the
> > > sync site. However, no matter what I try, I can't get the
> > third server to
> > > join in.
> > >
> > > I followed this:
> > > A+B in all /usr/afs/etc/CellServDB
> > > A+B running ptserver happily, sync site is A
> > > Add C to all /usr/afs/etc/CellServDB
> > > Run ptserver C (either with an existing copy of prdb* or
> nothing in
> > > /usr/afs/db/)
> > > Wait.
> > >
> > > It never sees a syncsite. If I then go and restart ptserver
> > on A, NONE of
> > > the servers will ever elect a syncsite.
> > >
> > > Connectivity between the servers appears fine (they should
> > all be on the
> > > same switch I think). Everything else on the servers is
> > happy, including
> > > running the VLDB on all three servers.
> > >
> > > Is there something obvious I am missing or not doing?
> > >
> > > -- Nathan
> > >
> > > ------------------------------------------------------------
> > > Nathan Neulinger EMail: nneul@umr.edu
> > > University of Missouri - Rolla Phone: (573) 341-4841
> > > Computing Services Fax: (573) 341-4216
> > > _______________________________________________
> > > OpenAFS-info mailing list
> > > OpenAFS-info@openafs.org
> > > https://lists.openafs.org/mailman/listinfo.cgi/openafs-info
> >
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo.cgi/openafs-info
>
>