[OpenAFS] Re: odd problem with RW site after a botched replica

Timothy Balcer timothy@telmate.com
Mon, 29 Oct 2012 12:41:09 -0700


--20cf307cfe6ee7bfdb04cd37d9e0
Content-Type: text/plain; charset=ISO-8859-1

On Mon, Oct 29, 2012 at 12:03 PM, Andrew Deason <adeason@sinenomine.net>wrote:

> On Mon, 29 Oct 2012 11:41:09 -0700
> Timothy Balcer <timothy@telmate.com> wrote:
>
> > I have a volume that had a replica, which has now been removed with
> > vos remsite.
>
> In the future, you should remove RO sites with 'vos remove' if the RO
> site has any data on it. 'vos remsite' just modifies the vldb entry, and
> doesn't remove the RO volume from disk.
>

Ahh.. thanks for that! :)


>
> > I had made a mistake with the server directive originally, and I
> > attempted to correct the error midstream...  ultimately, the RO volume
> > seemed to release.
>
> Can you explain a little more what you mean by this?
>

I did an addsite but specified the same server as the RW volume and,
foolishly, tried to interrupt the process.  I ended up vos removing the RO
volume, but it wouldn't do it, so I did a forced zap. I then did an vos
addsite with the proper server directive, and it appeared to go ok, and I
was able to release.


> > However, last night the RW volume went offline, as well as the RO
> > volume.
>
> FileLog or VolserLog should say something around the time it went
> offline, which should help say why it went offline.
>

Unfortunately, it looks like I need to change the logging prefs for openafs
on my system, as it has wiped those out already after two restarts.


>
> > 10/29/2012 01:51:10 SYNC_ask: negative response on circuit 'FSSYNC'
> > 10/29/2012 01:51:10 FSYNC_askfs: FSSYNC request denied for reason=101
> > 10/29/2012 01:51:10 AskOnline:  file server denied online request to
> volume
> > 536870935 partition /vicepb; trying again...
>
> FileLog should have some entries from around the same time that say why
> this error is occurring.
>
> What version of OpenAFS are you running? Is this on linux, or what
> platform is this?
>

1.61 on Ubuntu, as shown in the first line of the salvage log.

OpenAFS 1.6.1-2ubuntu2-debian built  2012-09-12

I would add in addition, a vos examine says the volume does not exist, and
shows only the VLDB dump... I am guessing this is because it is offline?
FYI the volume file is present on /vicepb.

root@afs-db:/var/log/openafs# ls /vicepb
AFSIDat  Lock  lost+found  V0536870935.vol


> --
> Andrew Deason
> adeason@sinenomine.net
>
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info
>



-- 
Timothy Balcer

--20cf307cfe6ee7bfdb04cd37d9e0
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

On Mon, Oct 29, 2012 at 12:03 PM, Andrew Deason <span dir=3D"ltr">&lt;<a hr=
ef=3D"mailto:adeason@sinenomine.net" target=3D"_blank">adeason@sinenomine.n=
et</a>&gt;</span> wrote:<br><div class=3D"gmail_quote"><blockquote class=3D=
"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding=
-left:1ex">
<div class=3D"im">On Mon, 29 Oct 2012 11:41:09 -0700<br>
Timothy Balcer &lt;<a href=3D"mailto:timothy@telmate.com">timothy@telmate.c=
om</a>&gt; wrote:<br>
<br>
&gt; I have a volume that had a replica, which has now been removed with<br=
>
&gt; vos remsite.<br>
<br>
</div>In the future, you should remove RO sites with &#39;vos remove&#39; i=
f the RO<br>
site has any data on it. &#39;vos remsite&#39; just modifies the vldb entry=
, and<br>
doesn&#39;t remove the RO volume from disk.<br></blockquote><div><br>Ahh.. =
thanks for that! :)<br>=A0<br></div><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div class=3D"im"><br>
&gt; I had made a mistake with the server directive originally, and I<br>
&gt; attempted to correct the error midstream... =A0ultimately, the RO volu=
me<br>
&gt; seemed to release.<br>
<br>
</div>Can you explain a little more what you mean by this?<br></blockquote>=
<div><br>I did an addsite but specified the same server as the RW volume an=
d, foolishly, tried to interrupt the process.=A0 I ended up vos removing th=
e RO volume, but it wouldn&#39;t do it, so I did a forced zap. I then did a=
n vos addsite with the proper server directive, and it appeared to go ok, a=
nd I was able to release.<br>
<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex">
<div class=3D"im"><br>
&gt; However, last night the RW volume went offline, as well as the RO<br>
&gt; volume.<br>
<br>
</div>FileLog or VolserLog should say something around the time it went<br>
offline, which should help say why it went offline.<br></blockquote><div><b=
r>Unfortunately, it looks like I need to change the logging prefs for opena=
fs on my system, as it has wiped those out already after two restarts.<br>
=A0<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;b=
order-left:1px #ccc solid;padding-left:1ex">
<div class=3D"im"><br>
&gt; 10/29/2012 01:51:10 SYNC_ask: negative response on circuit &#39;FSSYNC=
&#39;<br>
&gt; 10/29/2012 01:51:10 FSYNC_askfs: FSSYNC request denied for reason=3D10=
1<br>
&gt; 10/29/2012 01:51:10 AskOnline: =A0file server denied online request to=
 volume<br>
&gt; 536870935 partition /vicepb; trying again...<br>
<br>
</div>FileLog should have some entries from around the same time that say w=
hy<br>
this error is occurring.<br>
<br>
What version of OpenAFS are you running? Is this on linux, or what<br>
platform is this?<br></blockquote><div><br>1.61 on Ubuntu, as shown in the =
first line of the salvage log.<br><br>OpenAFS 1.6.1-2ubuntu2-debian built=
=A0 2012-09-12<br><br>I would add in addition, a vos examine says the volum=
e does not exist, and shows only the VLDB dump... I am guessing this is bec=
ause it is offline?=A0 FYI the volume file is present on /vicepb.<br>
<br><span style=3D"font-family:courier new,monospace">root@afs-db:/var/log/=
openafs# ls /vicepb<br>AFSIDat=A0 Lock=A0 lost+found=A0 V0536870935.vol</sp=
an><br><br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8=
ex;border-left:1px #ccc solid;padding-left:1ex">

<span class=3D"HOEnZb"><font color=3D"#888888"><br>
--<br>
Andrew Deason<br>
<a href=3D"mailto:adeason@sinenomine.net">adeason@sinenomine.net</a><br>
<br>
_______________________________________________<br>
OpenAFS-info mailing list<br>
<a href=3D"mailto:OpenAFS-info@openafs.org">OpenAFS-info@openafs.org</a><br=
>
<a href=3D"https://lists.openafs.org/mailman/listinfo/openafs-info" target=
=3D"_blank">https://lists.openafs.org/mailman/listinfo/openafs-info</a><br>
</font></span></blockquote></div><br><br clear=3D"all"><br>-- <br><span sty=
le=3D"border-collapse:collapse;color:rgb(102,102,102);font-family:verdana,s=
ans-serif;font-size:x-small">Timothy Balcer </span><br>

--20cf307cfe6ee7bfdb04cd37d9e0--