[OpenAFS] Re: odd problem with RW site after a botched replica

Andrew Deason adeason@sinenomine.net
Mon, 29 Oct 2012 14:03:02 -0500


On Mon, 29 Oct 2012 11:41:09 -0700
Timothy Balcer <timothy@telmate.com> wrote:

> I have a volume that had a replica, which has now been removed with
> vos remsite.

In the future, you should remove RO sites with 'vos remove' if the RO
site has any data on it. 'vos remsite' just modifies the vldb entry, and
doesn't remove the RO volume from disk.

> I had made a mistake with the server directive originally, and I
> attempted to correct the error midstream...  ultimately, the RO volume
> seemed to release.

Can you explain a little more what you mean by this?

> However, last night the RW volume went offline, as well as the RO
> volume.

FileLog or VolserLog should say something around the time it went
offline, which should help say why it went offline.

> 10/29/2012 01:51:10 SYNC_ask: negative response on circuit 'FSSYNC'
> 10/29/2012 01:51:10 FSYNC_askfs: FSSYNC request denied for reason=101
> 10/29/2012 01:51:10 AskOnline:  file server denied online request to volume
> 536870935 partition /vicepb; trying again...

FileLog should have some entries from around the same time that say why
this error is occurring.

What version of OpenAFS are you running? Is this on linux, or what
platform is this?

-- 
Andrew Deason
adeason@sinenomine.net