[OpenAFS] resurrecting dead server

Jason Edgecombe jason@rampaginggeek.com
Sun, 16 Jan 2011 15:26:30 -0500


I recommend a reinstall, then copy /usr/afs/etc from another server to 
copy the KeyFile, CellServ, etc.

Jason

On 01/16/2011 01:48 PM, Ted Creedon wrote:
> How about moving the /usr/afs directories from a known good server to the RW
> server and doing the same?
>
> ted
>
> On Sun, Jan 16, 2011 at 7:10 AM, Jason Edgecombe<jason@rampaginggeek.com>wrote:
>
>> On 01/15/2011 01:05 PM, Ted Creedon wrote:
>>
>>> my R/W server died but /vicepa and /vicepb were saved on their raid
>>> drives.
>>> this server also ran the krb5kdc...
>>>
>>> there are 2 other RO servers still OK but 2 out of 20 volumes are not up
>>> to
>>> date. both were running upclient etc
>>>
>>> the data on the RO servers is:
>>> /usr/afs/db:
>>> total 316
>>> drwx------ 2 root root   4096 May  5  2009 ./
>>> drwxr-xr-x 7 root root   4096 May  6  2009 ../
>>> -rw------- 1 root root   1088 May  2  2009 bdb.DB0
>>> -rw------- 1 root root     64 Jan  7 12:18 bdb.DBSYS1
>>> -rw------- 1 root root  68672 May  5  2009 prdb.DB0
>>> -rw------- 1 root root  68672 May  5  2009 prdb.DB0.bak
>>> -rw------- 1 root root     64 Jan  7 12:18 prdb.DBSYS1
>>> -rw------- 1 root root     64 May  5  2009 prdb.DBSYS1.bak
>>> -rw------- 1 root root 144448 Jan  7 12:18 vldb.DB0
>>> -rw------- 1 root root     64 Jan  7 13:50 vldb.DBSYS1
>>>
>>> /usr/afs/etc:
>>> total 80
>>> drwxr-xr-x 2 root root  4096 Jun 16  2010 ./
>>> drwxr-xr-x 7 root root  4096 May  6  2009 ../
>>> -rw-r--r-- 1 root root   162 Jan  7 12:17 CellServDB
>>> -rw------- 1 root root   100 May  6  2009 KeyFile
>>> -rw-r--r-- 1 root root    10 May  2  2009 NetRestrict
>>> -rw-r--r-- 1 root root    11 May  6  2009 ThisCell
>>> -rw-r--r-- 1 root root    39 Nov 15  2008 UserList
>>>
>>> /usr/afs/local:
>>> total 24
>>> drwx------ 2 root root 4096 Jan  7 13:53 ./
>>> drwxr-xr-x 7 root root 4096 May  6  2009 ../
>>> -rw-r--r-- 1 root root  313 Apr 30  2009 BosConfig
>>> -rw-r--r-- 1 root root   10 May  6  2009 NetRestrict
>>> -rw-r--r-- 1 root root    0 Jan  7 13:53 SALVAGE.fs
>>> srwxr-xr-x 1 root root    0 Jan  7 12:22 fssync.sock=
>>> -rw-r--r-- 1 root root    0 May  2  2009 salvage.lock
>>> -rw-r--r-- 1 root root   32 Jan  7 11:55 sysid
>>> -rw-r--r-- 1 root root   32 Jan  7 11:00 sysid.old
>>>
>>>
>>> what's the best way to proceed after I rekey
>>>
>>> PS I also have vos dump files but the 2 aforementiined volumes are not up
>>> to
>>> date.
>>>
>>> thanks
>>>
>>> tedc
>>>
>>>   I suggest connecting the raid drives to another server, possibly one of
>> your R/O servers. Mount the /vicepX partitions, restart the fileserver, then
>> run "vos syncserv" and "vos syncvldb" to have the recovered volumes be
>> remapped to the different server.