[OpenAFS] Connection timed out and device doesn't exist finally solved

Timothy Balcer timothy@telmate.com
Tue, 24 Dec 2013 00:19:37 -0800


--001a11330e60be633604ee436769
Content-Type: text/plain; charset=ISO-8859-1

Very very odd behavior. To put it in short.. an entire fileserver's RW
volumes became unavailable to our colo sites, but not the local site. Every
effort to determine the cause was met with frustration (all sorts of
cachemanager operations yielded nothing)

That is, until I did an fs whereis on the affected volume, on the
fileserver machine itself...

It told me the RW volume was available on host 192.168.122.1. Formerly a
virtual host bridge interface, but no longer used.

VLDB did not show this.. syncserv and syncvldb's had not fixed the problem.
Restarting the fileserver process did not release it, even though the IP
was no longer active.

So I moved one volume. That worked. But I didn't want to do that for the
entire fileserver.

So I entered -rxbind to the fileserver process and restarted it.

Voila. Problem solved.

-- 
Timothy Balcer / IT Services
Telmate / San Francisco, CA
Direct / (415) 300-4313
Customer Service / (800) 205-5510

--001a11330e60be633604ee436769
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div><div>Very very odd behavior. To put it in short.=
. an entire fileserver&#39;s RW volumes became unavailable to our colo site=
s, but not the local site. Every effort to determine the cause was met with=
 frustration (all sorts of cachemanager operations yielded nothing)<br>
<br>That is, until I did an fs whereis on the affected volume, on the files=
erver machine itself...<br><br></div>It told me the RW volume was available=
 on host 192.168.122.1. Formerly a virtual host bridge interface, but no lo=
nger used.<br>
<br></div>VLDB did not show this.. syncserv and syncvldb&#39;s had not fixe=
d the problem. Restarting the fileserver process did not release it, even t=
hough the IP was no longer active.<br><br></div>So I moved one volume. That=
 worked. But I didn&#39;t want to do that for the entire fileserver.<br>
<br>So I entered -rxbind to the fileserver process and restarted it.<br><br=
>Voila. Problem solved.<br clear=3D"all"><div><div><div><div><br>-- <br><sp=
an style=3D"border-collapse:collapse;color:rgb(102,102,102);font-family:ver=
dana,sans-serif;font-size:x-small">Timothy Balcer / IT Services<br>
Telmate / San Francisco, CA<br>Direct / </span><span style=3D"border-collap=
se:collapse;font-family:verdana,sans-serif;font-size:x-small"><font color=
=3D"#1155cc">(415) 300-4313</font><br><font color=3D"#666666">Customer Serv=
ice /=A0</font><a value=3D"+18002055510" style=3D"color:rgb(17,85,204)">(80=
0) 205-5510</a></span>
</div></div></div></div></div>

--001a11330e60be633604ee436769--