[OpenAFS] vos listaddrs/changeaddr problems with old multihomed host

Sven Oehme oehmes@de.ibm.com
Sun, 12 Dec 2004 10:20:53 +0100


This is a multipart message in MIME format.
--=_alternative 003348F2C1256F68_=
Content-Type: text/plain; charset="US-ASCII"

Hi Jeffrey,

we had the same problem in multihomed environments if we forgot to 
configure the NetRestrict NetInfo files before starting the server. i used 
that procedure several times .. on modern hardware it takes less than 5 
sec to finish. i wouldn't name that really outage and no user every 
complained about it.

May be there are smarter ways to do so, but it fixes the Problem for me 
and it's very quick.

I agree that if the NetInfo NetRestict isn't configured correctly and the 
ip still exists on the Server this wouldn't fix the Problem in long terms, 
but after he fixed his configuration, my procedure would clean his volume 
still exists problem  ...

Sven
-------------------------------------------------------------------------------------------------------------------------
Dept. A141,  TG/SSG EMEA AIS Strategy and Architecture
Development Leader Stonehenge 
IBM intranet ---> http://w3.ais.mainz.de.ibm.com/stonehenge/
internet ---> http://www-5.ibm.com/services/de/its/filestore.html
Phone (+49)-6131-84-3151
Fax      (+49)-6131-84-6708
Mobil   (+49)-171-970-6664
E-Mail : oehmes@de.ibm.com

openafs-info-admin@openafs.org wrote on 12/12/2004 01:16:30:

> On Saturday, December 11, 2004 03:00:16 -0500 Kenneth J Baker 
> <bakerkj@umich.edu> wrote:
> 
> > If I try to remove either of them I get the following error:
> 
> You don't want to do that.  What you're asking the vlserver to do is to 
> remove the entire server, which it can't do because it knows about 
volumes 
> on that server -- and which isn't what you want anyway.
> 
> What you need to do is to get the fileserver to register the correct set 
of 
> addresses.  The fileserver registers its addresses on startup, based on 
the 
> interfaces present and the contents of the NetInfo and NetRestrict 
files.
> 
> If you want the 172.17 addresses to go away, you need to either make 
those 
> interfaces go away (configuring them down may not be enough), or add 
those 
> addresses to the NetRestrict files on the fileservers.
> 
> > I investigated and it appears that the file servers themselves are 
using
> > only the correct interface: Sat Dec 11 02:37:00 2004 Getting 
FileServer
> > name...
> > Sat Dec 11 02:37:00 2004 FileServer host name is 'deedee'
> > Sat Dec 11 02:37:00 2004 Getting FileServer address...
> > Sat Dec 11 02:37:00 2004 FileServer deedee has address 66.92.68.189
> > (0xbd445c42 or 0x425c44bd in host byte order) Sat Dec 11 02:37:00 2004
> > File Server started Sat Dec 11 02:37:00 2004 ** **
> > Sat Dec 11 01:51:19 2004 Getting FileServer name...
> > Sat Dec 11 01:51:19 2004 FileServer host name is 'dharma'
> > Sat Dec 11 01:51:19 2004 Getting FileServer address...
> > Sat Dec 11 01:51:19 2004 FileServer dharma has address 66.205.64.240
> > (0xf040cd42 or 0x42cd40f0 in host byte order) Sat Dec 11 01:51:19 2004
> > File Server started Sat Dec 11 01:51:19 2004
> 
> No; that's not what that means.  The "Fileserver... has address" message 

> tells you what the fileserver's primary address is; it does not indicate 

> that no other addresses have been registered with the VLDB.
> 
> 
> Sven Oehme wrote:
> > just do the following per server :
> >
> > vos delentry -server 172.17.193.x
> > vos syncvldb servername
> > vos syncserv servername
> >
> > now the vos changeaddr 172.17.193.2 -remove -local should work ...
> 
> No, don't do that.  Not only will it result in a service outage during 
the 
> time when you remove every volume on that server from the VLDB and the 
time 
> when vos syncvldb puts them back, it also won't make your problem go 
away. 
> As the output from 'vos listaddrs -printuuid' indicates, each server has 

> both addresses.  Removing and re-adding the volumes won't change the set 
of 
> addresses associated with the server, and won't split the one server 
into 
> two.
> 
> 
> -- Jeffrey T. Hutzelman (N3NHS) <jhutz+@cmu.edu>
>    Sr. Research Systems Programmer
>    School of Computer Science - Research Computing Facility
>    Carnegie Mellon University - Pittsburgh, PA
> 
> 
> 
> 
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info

--=_alternative 003348F2C1256F68_=
Content-Type: text/html; charset="US-ASCII"


<br><font size=2 face="sans-serif">Hi Jeffrey,</font>
<br>
<br><font size=2 face="sans-serif">we had the same problem in multihomed
environments if we forgot to configure the NetRestrict NetInfo files before
starting the server. i used that procedure several times .. on modern hardware
it takes less than 5 sec to finish. i wouldn't name that really outage
and no user every complained about it.</font>
<br>
<br><font size=2 face="sans-serif">May be there are smarter ways to do
so, but it fixes the Problem for me and it's very quick.</font>
<br>
<br><font size=2 face="sans-serif">I agree that if the NetInfo NetRestict
isn't configured correctly and the ip still exists on the Server this wouldn't
fix the Problem in long terms, but after he fixed his configuration, my
procedure would clean his volume still exists problem &nbsp;...</font>
<br>
<br><font size=2 face="sans-serif">Sven<br>
-------------------------------------------------------------------------------------------------------------------------<br>
Dept. A141, &nbsp;TG/SSG EMEA AIS Strategy and Architecture<br>
Development Leader Stonehenge <br>
IBM intranet ---&gt; http://w3.ais.mainz.de.ibm.com/stonehenge/<br>
internet ---&gt; http://www-5.ibm.com/services/de/its/filestore.html<br>
Phone (+49)-6131-84-3151<br>
Fax &nbsp; &nbsp; &nbsp;(+49)-6131-84-6708<br>
Mobil &nbsp; (+49)-171-970-6664<br>
E-Mail : oehmes@de.ibm.com</font>
<br>
<br><font size=2><tt>openafs-info-admin@openafs.org wrote on 12/12/2004
01:16:30:<br>
<br>
&gt; On Saturday, December 11, 2004 03:00:16 -0500 Kenneth J Baker <br>
&gt; &lt;bakerkj@umich.edu&gt; wrote:<br>
&gt; <br>
&gt; &gt; If I try to remove either of them I get the following error:<br>
&gt; <br>
&gt; You don't want to do that. &nbsp;What you're asking the vlserver to
do is to <br>
&gt; remove the entire server, which it can't do because it knows about
volumes <br>
&gt; on that server -- and which isn't what you want anyway.<br>
&gt; <br>
&gt; What you need to do is to get the fileserver to register the correct
set of <br>
&gt; addresses. &nbsp;The fileserver registers its addresses on startup,
based on the <br>
&gt; interfaces present and the contents of the NetInfo and NetRestrict
files.<br>
&gt; <br>
&gt; If you want the 172.17 addresses to go away, you need to either make
those <br>
&gt; interfaces go away (configuring them down may not be enough), or add
those <br>
&gt; addresses to the NetRestrict files on the fileservers.<br>
&gt; <br>
&gt; &gt; I investigated and it appears that the file servers themselves
are using<br>
&gt; &gt; only the correct interface: Sat Dec 11 02:37:00 2004 Getting
FileServer<br>
&gt; &gt; name...<br>
&gt; &gt; Sat Dec 11 02:37:00 2004 FileServer host name is 'deedee'<br>
&gt; &gt; Sat Dec 11 02:37:00 2004 Getting FileServer address...<br>
&gt; &gt; Sat Dec 11 02:37:00 2004 FileServer deedee has address 66.92.68.189<br>
&gt; &gt; (0xbd445c42 or 0x425c44bd in host byte order) Sat Dec 11 02:37:00
2004<br>
&gt; &gt; File Server started Sat Dec 11 02:37:00 2004 ** **<br>
&gt; &gt; Sat Dec 11 01:51:19 2004 Getting FileServer name...<br>
&gt; &gt; Sat Dec 11 01:51:19 2004 FileServer host name is 'dharma'<br>
&gt; &gt; Sat Dec 11 01:51:19 2004 Getting FileServer address...<br>
&gt; &gt; Sat Dec 11 01:51:19 2004 FileServer dharma has address 66.205.64.240<br>
&gt; &gt; (0xf040cd42 or 0x42cd40f0 in host byte order) Sat Dec 11 01:51:19
2004<br>
&gt; &gt; File Server started Sat Dec 11 01:51:19 2004<br>
&gt; <br>
&gt; No; that's not what that means. &nbsp;The &quot;Fileserver... has
address&quot; message <br>
&gt; tells you what the fileserver's primary address is; it does not indicate
<br>
&gt; that no other addresses have been registered with the VLDB.<br>
&gt; <br>
&gt; <br>
&gt; Sven Oehme wrote:<br>
&gt; &gt; just do the following per server :<br>
&gt; &gt;<br>
&gt; &gt; vos delentry -server 172.17.193.x<br>
&gt; &gt; vos syncvldb servername<br>
&gt; &gt; vos syncserv servername<br>
&gt; &gt;<br>
&gt; &gt; now the vos changeaddr 172.17.193.2 -remove -local should work
...<br>
&gt; <br>
&gt; No, don't do that. &nbsp;Not only will it result in a service outage
during the <br>
&gt; time when you remove every volume on that server from the VLDB and
the time <br>
&gt; when vos syncvldb puts them back, it also won't make your problem
go away. <br>
&gt; As the output from 'vos listaddrs -printuuid' indicates, each server
has <br>
&gt; both addresses. &nbsp;Removing and re-adding the volumes won't change
the set of <br>
&gt; addresses associated with the server, and won't split the one server
into <br>
&gt; two.<br>
&gt; <br>
&gt; <br>
&gt; -- Jeffrey T. Hutzelman (N3NHS) &lt;jhutz+@cmu.edu&gt;<br>
&gt; &nbsp; &nbsp;Sr. Research Systems Programmer<br>
&gt; &nbsp; &nbsp;School of Computer Science - Research Computing Facility<br>
&gt; &nbsp; &nbsp;Carnegie Mellon University - Pittsburgh, PA<br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; <br>
&gt; _______________________________________________<br>
&gt; OpenAFS-info mailing list<br>
&gt; OpenAFS-info@openafs.org<br>
&gt; https://lists.openafs.org/mailman/listinfo/openafs-info<br>
</tt></font>
--=_alternative 003348F2C1256F68_=--