[OpenAFS] AFS Fileserver Won't Start

Karl M. Davis karl@ridgetop-group.com
Wed, 3 Oct 2007 19:40:21 -0700


This is a multipart message in MIME format.

------=_NextPart_000_0081_01C805F5.3C9C6CF0
Content-Type: text/plain;
	charset="us-ascii"
Content-Transfer-Encoding: 7bit

Hello all,

 

I'm having a pretty crappy day right now, hope someone can help me out.

 

What started this is my attempt to move our OpenAFS server from a VM to a
dedicated physical box.  I'm running Ubuntu/Debian and using their openafs
packages on both machines.

 

Somewhere towards the end of moving the volumes from the old server to the
new server, things got badly goofed.  The fs process will no longer start on
the new server and I find the following entry in the
/var/log/openafs/FileLog file:

<< 

Wed Oct  3 19:26:59 2007 File server starting

Wed Oct  3 19:26:59 2007 afs_krb_get_lrealm failed, using
ridgetop-group.local.

Wed Oct  3 19:26:59 2007 VL_RegisterAddrs rpc failed; The IP address exists
on a different server; repair it

Wed Oct  3 19:26:59 2007 VL_RegisterAddrs rpc failed; See VLLog for details

Wed Oct  3 19:26:59 2007 Fatal error in library initialization, exiting!!

>> 

 

Unfortunately, there's nothing helpful in VLLog.  Interestingly, "vos
listaddrs" returns nothing on the new server, either.

 

Running "vos listvldb" returns the following:

<< 

VLDB entries for all servers

 

lib

    RWrite: 536870933

    number of sites -> 1

       server picacho.ridgetop-group.local partition /vicepa RW Site

 

lib.pdks

    RWrite: 536870936

    number of sites -> 1

       server picacho.ridgetop-group.local partition /vicepa RW Site

 

root.afs

    RWrite: 536870915     ROnly: 536870916

    number of sites -> 3

       server picacho.ridgetop-group.local partition /vicepa RW Site

       server picacho.ridgetop-group.local partition /vicepa RO Site

       server picacho.ridgetop-group.local partition /vicepa RO Site

 

root.cell

    RWrite: 536870918     ROnly: 536870919

    number of sites -> 3

       server picacho.ridgetop-group.local partition /vicepa RW Site

       server picacho.ridgetop-group.local partition /vicepa RO Site

       server picacho.ridgetop-group.local partition /vicepa RO Site

 

Total entries: 4

>> 

 

I'm unsure why there are duplicate RO entries, but the last thing I was
working on was recreating RO volumes for root.cell and root.afs on the new
server.  

 

I'm panicking because all of the volumes are now on the new server and
non-accessible.  Anyone have some clue what I did wrong and how I can fix
things?

 

Thanks!

Karl

 

 


------=_NextPart_000_0081_01C805F5.3C9C6CF0
Content-Type: text/html;
	charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" =
xmlns=3D"http://www.w3.org/TR/REC-html40">

<head>
<meta http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii">
<meta name=3DGenerator content=3D"Microsoft Word 12 (filtered medium)">
<style>
<!--
 /* Font Definitions */
 @font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
 /* Style Definitions */
 p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:purple;
	text-decoration:underline;}
span.EmailStyle17
	{mso-style-type:personal-compose;
	font-family:"Calibri","sans-serif";
	color:windowtext;}
.MsoChpDefault
	{mso-style-type:export-only;}
@page Section1
	{size:8.5in 11.0in;
	margin:1.0in 1.0in 1.0in 1.0in;}
div.Section1
	{page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
 <o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
 <o:shapelayout v:ext=3D"edit">
  <o:idmap v:ext=3D"edit" data=3D"1" />
 </o:shapelayout></xml><![endif]-->
</head>

<body lang=3DEN-US link=3Dblue vlink=3Dpurple>

<div class=3DSection1>

<p class=3DMsoNormal>Hello all,<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>I&#8217;m having a pretty crappy day right now, =
hope someone
can help me out.<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>What started this is my attempt to move our OpenAFS =
server
from a VM to a dedicated physical box.&nbsp; I&#8217;m running =
Ubuntu/Debian
and using their openafs packages on both machines.<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>Somewhere towards the end of moving the volumes =
from the old
server to the new server, things got badly goofed.&nbsp; The fs process =
will no
longer start on the new server and I find the following entry in the
/var/log/openafs/FileLog file:<o:p></o:p></p>

<p class=3DMsoNormal>&lt;&lt;<o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Wed =
Oct&nbsp; 3
19:26:59 2007 File server starting<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Wed =
Oct&nbsp; 3
19:26:59 2007 afs_krb_get_lrealm failed, using =
ridgetop-group.local.<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Wed =
Oct&nbsp; 3
19:26:59 2007 VL_RegisterAddrs rpc failed; The IP address exists on a =
different
server; repair it<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Wed =
Oct&nbsp; 3
19:26:59 2007 VL_RegisterAddrs rpc failed; See VLLog for =
details<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Wed =
Oct&nbsp; 3
19:26:59 2007 Fatal error in library initialization, =
exiting!!<o:p></o:p></span></p>

<p class=3DMsoNormal>&gt;&gt;<o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>Unfortunately, there&#8217;s nothing helpful in =
VLLog.&nbsp;
Interestingly, &#8220;vos listaddrs&#8221; returns nothing on the new =
server,
either.<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>Running &#8220;vos listvldb&#8221; returns the =
following:<o:p></o:p></p>

<p class=3DMsoNormal>&lt;&lt;<o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>VLDB =
entries for all
servers<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>lib<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;
RWrite: 536870933<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;
number of sites -&gt; 1<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
server picacho.ridgetop-group.local partition /vicepa RW =
Site<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>lib.pdks<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;
RWrite: 536870936<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;
number of sites -&gt; 1<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
server picacho.ridgetop-group.local partition /vicepa RW =
Site<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>root.afs<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;
RWrite: 536870915&nbsp;&nbsp;&nbsp;&nbsp; ROnly: =
536870916<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;
number of sites -&gt; 3<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
server picacho.ridgetop-group.local partition /vicepa RW =
Site<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
server picacho.ridgetop-group.local partition /vicepa RO =
Site<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
server picacho.ridgetop-group.local partition /vicepa RO =
Site<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>root.cell<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;
RWrite: 536870918&nbsp;&nbsp;&nbsp;&nbsp; ROnly: =
536870919<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;
number of sites -&gt; 3<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
server picacho.ridgetop-group.local partition /vicepa RW =
Site<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
server picacho.ridgetop-group.local partition /vicepa RO =
Site<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
server picacho.ridgetop-group.local partition /vicepa RO =
Site<o:p></o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'><o:p>&nbsp;</o:p></span></p>

<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Total =
entries: 4<o:p></o:p></span></p>

<p class=3DMsoNormal>&gt;&gt;<o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>I&#8217;m unsure why there are duplicate RO =
entries, but the
last thing I was working on was recreating RO volumes for root.cell and
root.afs on the new server.&nbsp; <o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>I&#8217;m panicking because all of the volumes are =
now on
the new server and non-accessible.&nbsp; Anyone have some clue what I =
did wrong
and how I can fix things?<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>Thanks!<o:p></o:p></p>

<p class=3DMsoNormal>Karl<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

</div>

</body>

</html>

------=_NextPart_000_0081_01C805F5.3C9C6CF0--