[OpenAFS] Re: nightly failure since upgrading to 1.6.5

Tracy Di Marco White gendalia@gmail.com
Mon, 10 Feb 2014 15:47:22 -0600


--001a11c2bcf2e05cf604f214487d
Content-Type: text/plain; charset=ISO-8859-1

On Mon, Feb 10, 2014 at 3:22 PM, Andrew Deason <adeason@sinenomine.net>wrote:

> On Mon, 10 Feb 2014 15:09:25 -0600
> Tracy Di Marco White <gendalia@gmail.com> wrote:
>


> I may have misinterpreted something up there. Were you running a prior
> 1.6 release with DAFS before, and this just started happening with
> 1.6.5? Or did you "switch" to DAFS and this started happening? Or did
> you upgrade from 1.4 and switch to DAFS at the same time?


I've had a single fileserver running DAFS with less valuable data for more
than a year, but as the only issue it saw was some interaction issues with
an AFS client of Harald's, I had no fear of finally upgrading the rest. That
server was running 1.6.2. The rest were running 1.4.something. I emptied
three servers, upgraded them to NetBSD 6.1.3 and OpenAFS 1.6.5 from
pkgsrc, adding a patch for davolserver. (I'll update the package to 1.6.6
in my copious free time this week, maybe, unless I'm beaten to that.)
Then I dumped another fileserver at them. As far as I can tell the other
two are working flawlessly. At least by comparison. The oldest is fine.


> > It happens on one server, of four, and it's most of the way through
> > creating backup volumes on this particular server. It is consistently
> > happening on one, and only one, server.
>
> Oh okay, well that makes me feel a little better :)


I will note that the volumes on the server that's falling over at
midnight:02
every night were previously on a different server that was also not staying
up more than a few days at a time. So there may be something odd with
a volume, I just don't know which one yet.

For what it's worth, when I did the restart this morning, the backupsys
continued on its merry way.

-Tracy

--001a11c2bcf2e05cf604f214487d
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">On Mon, Feb 10, 2014 at 3:22 PM, Andrew Deason <span dir=
=3D"ltr">&lt;<a href=3D"mailto:adeason@sinenomine.net" target=3D"_blank">ad=
eason@sinenomine.net</a>&gt;</span> wrote:<br><div class=3D"gmail_extra"><d=
iv class=3D"gmail_quote">
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex">On Mon, 10 Feb 2014 15:09:25 -0600<br>
<div class=3D"">Tracy Di Marco White &lt;<a href=3D"mailto:gendalia@gmail.c=
om">gendalia@gmail.com</a>&gt; wrote:=A0</div></blockquote><div>=A0</div><b=
lockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-le=
ft-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;pad=
ding-left:1ex">
<div class=3D"">I may have misinterpreted something up there. Were you runn=
ing a prior<br></div>
1.6 release with DAFS before, and this just started happening with<br>
1.6.5? Or did you &quot;switch&quot; to DAFS and this started happening? Or=
 did<br>
you upgrade from 1.4 and switch to DAFS at the same time?</blockquote><div>=
<br></div><div>I&#39;ve had a single fileserver running DAFS with less valu=
able data for more</div><div>than a year, but as the only issue it saw was =
some interaction issues with</div>
<div>an AFS client of Harald&#39;s, I had no fear of finally upgrading the =
rest. That</div><div>server was running 1.6.2. The rest were running 1.4.so=
mething. I emptied</div><div>three servers, upgraded them to NetBSD 6.1.3 a=
nd OpenAFS 1.6.5 from</div>
<div>pkgsrc, adding a patch for davolserver. (I&#39;ll update the package t=
o 1.6.6</div><div>in my copious free time this week, maybe, unless I&#39;m =
beaten to that.)</div><div>Then I dumped another fileserver at them. As far=
 as I can tell the other</div>
<div>two are working flawlessly. At least by comparison. The oldest is fine=
.</div><div>=A0<br></div><blockquote class=3D"gmail_quote" style=3D"margin:=
0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);=
border-left-style:solid;padding-left:1ex">
<div class=3D"">
&gt; It happens on one server, of four, and it&#39;s most of the way throug=
h<br>
&gt; creating backup volumes on this particular server. It is consistently<=
br>
&gt; happening on one, and only one, server.<br>
<br>
</div>Oh okay, well that makes me feel a little better :)</blockquote><div>=
=A0</div><div><div>I will note that the volumes on the server that&#39;s fa=
lling over at midnight:02</div><div>every night were previously on a differ=
ent server that was also not staying</div>
<div>up more than a few days at a time. So there may be something odd with<=
/div><div>a volume, I just don&#39;t know which one yet.</div></div><div><b=
r></div><div>For what it&#39;s worth, when I did the restart this morning, =
the backupsys</div>
<div>continued on its merry way.</div><div><br></div><div>-Tracy=A0</div></=
div></div></div>

--001a11c2bcf2e05cf604f214487d--