[OpenAFS] AFS Fileserver Won't Start
Karl M. Davis
karl@ridgetop-group.com
Wed, 3 Oct 2007 19:40:21 -0700
This is a multipart message in MIME format.
------=_NextPart_000_0081_01C805F5.3C9C6CF0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: 7bit
Hello all,
I'm having a pretty crappy day right now, hope someone can help me out.
What started this is my attempt to move our OpenAFS server from a VM to a
dedicated physical box. I'm running Ubuntu/Debian and using their openafs
packages on both machines.
Somewhere towards the end of moving the volumes from the old server to the
new server, things got badly goofed. The fs process will no longer start on
the new server and I find the following entry in the
/var/log/openafs/FileLog file:
<<
Wed Oct 3 19:26:59 2007 File server starting
Wed Oct 3 19:26:59 2007 afs_krb_get_lrealm failed, using
ridgetop-group.local.
Wed Oct 3 19:26:59 2007 VL_RegisterAddrs rpc failed; The IP address exists
on a different server; repair it
Wed Oct 3 19:26:59 2007 VL_RegisterAddrs rpc failed; See VLLog for details
Wed Oct 3 19:26:59 2007 Fatal error in library initialization, exiting!!
>>
Unfortunately, there's nothing helpful in VLLog. Interestingly, "vos
listaddrs" returns nothing on the new server, either.
Running "vos listvldb" returns the following:
<<
VLDB entries for all servers
lib
RWrite: 536870933
number of sites -> 1
server picacho.ridgetop-group.local partition /vicepa RW Site
lib.pdks
RWrite: 536870936
number of sites -> 1
server picacho.ridgetop-group.local partition /vicepa RW Site
root.afs
RWrite: 536870915 ROnly: 536870916
number of sites -> 3
server picacho.ridgetop-group.local partition /vicepa RW Site
server picacho.ridgetop-group.local partition /vicepa RO Site
server picacho.ridgetop-group.local partition /vicepa RO Site
root.cell
RWrite: 536870918 ROnly: 536870919
number of sites -> 3
server picacho.ridgetop-group.local partition /vicepa RW Site
server picacho.ridgetop-group.local partition /vicepa RO Site
server picacho.ridgetop-group.local partition /vicepa RO Site
Total entries: 4
>>
I'm unsure why there are duplicate RO entries, but the last thing I was
working on was recreating RO volumes for root.cell and root.afs on the new
server.
I'm panicking because all of the volumes are now on the new server and
non-accessible. Anyone have some clue what I did wrong and how I can fix
things?
Thanks!
Karl
------=_NextPart_000_0081_01C805F5.3C9C6CF0
Content-Type: text/html;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" =
xmlns=3D"http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii">
<meta name=3DGenerator content=3D"Microsoft Word 12 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri","sans-serif";
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;}
@page Section1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.Section1
{page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext=3D"edit">
<o:idmap v:ext=3D"edit" data=3D"1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=3DEN-US link=3Dblue vlink=3Dpurple>
<div class=3DSection1>
<p class=3DMsoNormal>Hello all,<o:p></o:p></p>
<p class=3DMsoNormal><o:p> </o:p></p>
<p class=3DMsoNormal>I’m having a pretty crappy day right now, =
hope someone
can help me out.<o:p></o:p></p>
<p class=3DMsoNormal><o:p> </o:p></p>
<p class=3DMsoNormal>What started this is my attempt to move our OpenAFS =
server
from a VM to a dedicated physical box. I’m running =
Ubuntu/Debian
and using their openafs packages on both machines.<o:p></o:p></p>
<p class=3DMsoNormal><o:p> </o:p></p>
<p class=3DMsoNormal>Somewhere towards the end of moving the volumes =
from the old
server to the new server, things got badly goofed. The fs process =
will no
longer start on the new server and I find the following entry in the
/var/log/openafs/FileLog file:<o:p></o:p></p>
<p class=3DMsoNormal><<<o:p> </o:p></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Wed =
Oct 3
19:26:59 2007 File server starting<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Wed =
Oct 3
19:26:59 2007 afs_krb_get_lrealm failed, using =
ridgetop-group.local.<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Wed =
Oct 3
19:26:59 2007 VL_RegisterAddrs rpc failed; The IP address exists on a =
different
server; repair it<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Wed =
Oct 3
19:26:59 2007 VL_RegisterAddrs rpc failed; See VLLog for =
details<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Wed =
Oct 3
19:26:59 2007 Fatal error in library initialization, =
exiting!!<o:p></o:p></span></p>
<p class=3DMsoNormal>>><o:p> </o:p></p>
<p class=3DMsoNormal><o:p> </o:p></p>
<p class=3DMsoNormal>Unfortunately, there’s nothing helpful in =
VLLog.
Interestingly, “vos listaddrs” returns nothing on the new =
server,
either.<o:p></o:p></p>
<p class=3DMsoNormal><o:p> </o:p></p>
<p class=3DMsoNormal>Running “vos listvldb” returns the =
following:<o:p></o:p></p>
<p class=3DMsoNormal><<<o:p> </o:p></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>VLDB =
entries for all
servers<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'><o:p> </o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>lib<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
RWrite: 536870933<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
number of sites -> 1<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
server picacho.ridgetop-group.local partition /vicepa RW =
Site<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'><o:p> </o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>lib.pdks<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
RWrite: 536870936<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
number of sites -> 1<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
server picacho.ridgetop-group.local partition /vicepa RW =
Site<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'><o:p> </o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>root.afs<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
RWrite: 536870915 ROnly: =
536870916<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
number of sites -> 3<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
server picacho.ridgetop-group.local partition /vicepa RW =
Site<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
server picacho.ridgetop-group.local partition /vicepa RO =
Site<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
server picacho.ridgetop-group.local partition /vicepa RO =
Site<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'><o:p> </o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>root.cell<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
RWrite: 536870918 ROnly: =
536870919<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
number of sites -> 3<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
server picacho.ridgetop-group.local partition /vicepa RW =
Site<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
server picacho.ridgetop-group.local partition /vicepa RO =
Site<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'>
server picacho.ridgetop-group.local partition /vicepa RO =
Site<o:p></o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier =
New"'><o:p> </o:p></span></p>
<p class=3DMsoNormal><span style=3D'font-family:"Courier New"'>Total =
entries: 4<o:p></o:p></span></p>
<p class=3DMsoNormal>>><o:p> </o:p></p>
<p class=3DMsoNormal><o:p> </o:p></p>
<p class=3DMsoNormal>I’m unsure why there are duplicate RO =
entries, but the
last thing I was working on was recreating RO volumes for root.cell and
root.afs on the new server. <o:p></o:p></p>
<p class=3DMsoNormal><o:p> </o:p></p>
<p class=3DMsoNormal>I’m panicking because all of the volumes are =
now on
the new server and non-accessible. Anyone have some clue what I =
did wrong
and how I can fix things?<o:p></o:p></p>
<p class=3DMsoNormal><o:p> </o:p></p>
<p class=3DMsoNormal>Thanks!<o:p></o:p></p>
<p class=3DMsoNormal>Karl<o:p></o:p></p>
<p class=3DMsoNormal><o:p> </o:p></p>
<p class=3DMsoNormal><o:p> </o:p></p>
</div>
</body>
</html>
------=_NextPart_000_0081_01C805F5.3C9C6CF0--