[OpenAFS-devel] volserver / replication question with older version of afs

Josh Fiske jfiske@clarkson.edu
Thu, 2 Feb 2006 10:54:21 -0500


This is a multipart message in MIME format.
--=_alternative 005727D485257109_=
Content-Type: text/plain; charset="US-ASCII"

Hi all,

We have been having a strange issue with one of our AFS servers lately, 
and I hope that someone on this list might have some helpful input on the 
situation.

We have a cell with three older AFS servers (1.2.11).  They have been 
running great for quite some time.  However, twice in the past two weeks 
the Volserver has stopped responding on one of the servers.  When this 
happens, if I do a 'bos status' on the server, it tells me that everything 
is running normally.  But, I know from trying to do a 'vos listvol' on the 
server, that things are not normal, because it times out.  Both times this 
has happened, the server that the volserver died on was the sync site for 
the cell.

Also of note, we have quite a few volumes that are replicated.  When the 
volserver died on the sync site, the read-only replicas were no longer 
accessible.  If a read-only replica is unavailable on one server, 
shouldn't the client know to try one of the others?  I thought this was 
the whole point of replication.

This is truly a perplexing issue to me, so I appreciate your input both in 
determing the problem with my replication and with determing why the 
volserver keeps dying.

Thanks,

-- Josh
- - - - -
Joshua Fiske, Network and Security Engineer
Clarkson University, Office of Information Technology
(315) 268-6722 -- Fax: (315) 268-6570
jfiske@clarkson.edu

CONFIDENTIALITY:  This e-mail (including any attachments) may contain 
confidential, proprietary and privileged information, and unauthorized 
disclosure or use is prohibited.  If you received this e-mail in error, 
please notify the sender and delete this e-mail from your system.

--=_alternative 005727D485257109_=
Content-Type: text/html; charset="US-ASCII"


<br><font size=2 face="sans-serif">Hi all,</font>
<br>
<br><font size=2 face="sans-serif">We have been having a strange issue
with one of our AFS servers lately, and I hope that someone on this list
might have some helpful input on the situation.</font>
<br>
<br><font size=2 face="sans-serif">We have a cell with three older AFS
servers (1.2.11). &nbsp;They have been running great for quite some time.
&nbsp;However, twice in the past two weeks the Volserver has stopped responding
on one of the servers. &nbsp;When this happens, if I do a 'bos status'
on the server, it tells me that everything is running normally. &nbsp;But,
I know from trying to do a 'vos listvol' on the server, that things are
not normal, because it times out. &nbsp;Both times this has happened, the
server that the volserver died on was the sync site for the cell.</font>
<br>
<br><font size=2 face="sans-serif">Also of note, we have quite a few volumes
that are replicated. &nbsp;When the volserver died on the sync site, the
read-only replicas were no longer accessible. &nbsp;If a read-only replica
is unavailable on one server, shouldn't the client know to try one of the
others? &nbsp;I thought this was the whole point of replication.</font>
<br>
<br><font size=2 face="sans-serif">This is truly a perplexing issue to
me, so I appreciate your input both in determing the problem with my replication
and with determing why the volserver keeps dying.</font>
<br>
<br><font size=2 face="sans-serif">Thanks,</font>
<br><font size=2 face="sans-serif"><br>
-- Josh<br>
- - - - -<br>
Joshua Fiske, Network and Security Engineer<br>
Clarkson University, Office of Information Technology<br>
(315) 268-6722 -- Fax: (315) 268-6570<br>
jfiske@clarkson.edu<br>
<br>
CONFIDENTIALITY:&nbsp; This e-mail (including any attachments) may contain
confidential, proprietary and privileged information, and unauthorized
disclosure or use is prohibited.&nbsp; If you received this e-mail in error,
please notify the sender and delete this e-mail from your system.<br>
</font>
--=_alternative 005727D485257109_=--