[OpenAFS] Problems with power outages

Karl M. Davis karl@ridgetop-group.com
Wed, 15 Aug 2007 13:11:02 -0700


This is a multipart message in MIME format.

------=_NextPart_000_00CA_01C7DF3D.BB212860
Content-Type: text/plain;
	charset="us-ascii"
Content-Transfer-Encoding: 7bit

Hey there all,

 

I just recently set up the Debian openafs 1.4.4 packages on an Ubuntu server
box, running in a virtual machine.  It's monsoon season here in Tucson and
we've had a couple of long power outages and problems with the UPS.  Both
times the server has gone done unexpectedly, AFS didn't come back up
correctly.  The symptoms I note are that "ls /afs" returns empty on the
server and the Windows client can't connect.

 

For whatever reason, the thing that has fixed it both times is running "fs
checkvolumes".  Of course, "fs checkvolumes" segfaults when I run it, but if
I reboot after that, everything comes back up fine, clients can connect, and
further "fs checkvolumes" don't segfault.  Rebooting before running that
specific command (with the segfault) does nothing-"ls /afs" still returns
empty.

 

So. a couple of questions:

How do I ensure AFS can survive a power outage/unexpected poweroff without
getting borked?

If it does get borked, why would a segfaulting "fs checkvolumes" fix things?

 

Thanks much!

Karl M. Davis


------=_NextPart_000_00CA_01C7DF3D.BB212860
Content-Type: text/html;
	charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" =
xmlns=3D"http://www.w3.org/TR/REC-html40">

<head>
<meta http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii">
<meta name=3DGenerator content=3D"Microsoft Word 12 (filtered medium)">
<style>
<!--
 /* Font Definitions */
 @font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
 /* Style Definitions */
 p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:purple;
	text-decoration:underline;}
span.EmailStyle17
	{mso-style-type:personal-compose;
	font-family:"Calibri","sans-serif";
	color:windowtext;}
.MsoChpDefault
	{mso-style-type:export-only;}
@page Section1
	{size:8.5in 11.0in;
	margin:1.0in 1.0in 1.0in 1.0in;}
div.Section1
	{page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
 <o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
 <o:shapelayout v:ext=3D"edit">
  <o:idmap v:ext=3D"edit" data=3D"1" />
 </o:shapelayout></xml><![endif]-->
</head>

<body lang=3DEN-US link=3Dblue vlink=3Dpurple>

<div class=3DSection1>

<p class=3DMsoNormal>Hey there all,<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>I just recently set up the Debian openafs 1.4.4 =
packages on
an Ubuntu server box, running in a virtual machine.&nbsp; It&#8217;s =
monsoon
season here in Tucson and we&#8217;ve had a couple of long power outages =
and
problems with the UPS.&nbsp; Both times the server has gone done =
unexpectedly,
AFS didn&#8217;t come back up correctly.&nbsp; The symptoms I note are =
that &#8220;ls
/afs&#8221; returns empty on the server and the Windows client =
can&#8217;t
connect.<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>For whatever reason, the thing that has fixed it =
both times
is running &#8220;fs checkvolumes&#8221;.&nbsp; Of course, &#8220;fs
checkvolumes&#8221; segfaults when I run it, but if I reboot after that, =
everything
comes back up fine, clients can connect, and further &#8220;fs =
checkvolumes&#8221;
don&#8217;t segfault.&nbsp; Rebooting before running that specific =
command
(with the segfault) does nothing&#8212;&#8220;ls /afs&#8221; still =
returns empty.<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>So&#8230; a couple of questions:<o:p></o:p></p>

<p class=3DMsoNormal>How do I ensure AFS can survive a power =
outage/unexpected
poweroff without getting borked?<o:p></o:p></p>

<p class=3DMsoNormal>If it does get borked, why would a segfaulting =
&#8220;fs
checkvolumes&#8221; fix things?<o:p></o:p></p>

<p class=3DMsoNormal><o:p>&nbsp;</o:p></p>

<p class=3DMsoNormal>Thanks much!<o:p></o:p></p>

<p class=3DMsoNormal>Karl M. Davis<o:p></o:p></p>

</div>

</body>

</html>

------=_NextPart_000_00CA_01C7DF3D.BB212860--