[OpenAFS] Workshop troubleshooting session & duplicate VLDB entries
Jeff Quinn
jq@msu.edu
Fri, 30 May 2008 16:05:18 -0400
After moving, it still exhibits the same issue. Moving it from one server
to another, and creating backups all function normally, but the vldb remains
erroneous. Any attempts to synchronize with syncserv or syncvldb are also
ineffective.
Any suggestions as to what I should do now?
-Jeff
Below is the output of the various commands run
> -----Original Message-----
> From: Jeffrey Altman [mailto:jaltman@secure-endpoints.com]
> Sent: Thursday, May 29, 2008 12:39 PM
> To: Jeff Quinn
> Cc: openafs-info@openafs.org
> Subject: Re: [OpenAFS] WOrkshop troubleshooting session & duplicate VLDB
> entries
>
> Jeff Quinn wrote:
>> I appreciate the help given at the workshop, and wanted to be clear on
>> what my next steps were. Please correct me if I am wrong:
>>
>> The problem was traced to an error in the source code of vsprocs.c -
>>
>
http://www.openafs.org/cgi-bin/cvsweb.cgi/~checkout~/openafs/src/volser/vspr
> ocs.c?rev=1.33.2.19.
>>
>> So in order to fix our already messed up vldb, we would just need to
>> compile vsprocs.c in with openafs 1.4.7, and run the vos syncvldb
command.
>>
>> We have done this, and the vldb still contains two entries of each
>> volume on that partition.
>>
>>
>>
>> I tried both syncvldb and syncserv on afsfs5, afsfs10 and a specific
>> volume, both simulated and real. Each time, it makes no changes to the
>> vldb.
> >
>> Am I missing something?
>
> I think you need to move the volume to a new server. That will move the
> real RW to the new server and might leave behind the backup volume that
> has been marked as a RW on the old server. If so, "vos zap" the backup
> volume from the old server and resync.
>
> Jeffrey Altman
>
>
-------------------------Terminal Output begins
below--------------------------
jax > vos listvol afsfs10 vicepa -verbose
Total number of volumes on server afsfs10 partition /vicepa: 2
user.10142 1938678432 RW 2 K On-line
user.10142.backup 1938678434 BK 2 K On-line
Total volumes onLine 2 ; Total volumes offLine 0 ; Total busy 0
jax > vos listvldb -server afsfs10 -partition vicepa -verbose
VLDB entries for server afsfs10 partition /vicepa
user.10142
RWrite: 1938678432 Backup: 1938678434
number of sites -> 2
server afsfs10.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Site
user.10658
RWrite: 1938678216 Backup: 1938678218
number of sites -> 2
server afsfs5.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Site
... (1986 similar entries to the one above)
Total entries: 1988
jax > vos exam user.10142
user.10142 1938678432 RW 2 K On-line
afsfs10.cl.msu.edu /vicepa
RWrite 1938678432 ROnly 0 Backup 1938678434
MaxQuota 5000 K
Creation Tue Apr 8 10:54:19 2008
Copy Thu May 29 14:43:29 2008
Backup Thu May 29 22:02:48 2008
Last Update Tue Apr 8 10:54:19 2008
0 accesses in the past day (i.e., vnode references)
RWrite: 1938678432 Backup: 1938678434
number of sites -> 2
server afsfs10.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Site
jax > vos move user.10142 afsfs10 vicepa afsfs5 vicepa -verbose
Starting transaction on source volume 1938678432 ... done
Allocating new volume id for clone of volume 1938678432 ... done
Cloning source volume 1938678432 ... done
Ending the transaction on the source volume 1938678432 ... done
Starting transaction on the cloned volume 1939602763 ... done
Setting flags on cloned volume 1939602763 ... done
Getting status of cloned volume 1939602763 ... done
Creating the destination volume 1938678432 ... done
Setting volume flags on destination volume 1938678432 ... done
Dumping from clone 1939602763 on source to volume 1938678432 on destination
... done
Ending transaction on cloned volume 1939602763 ... done
Starting transaction on source volume 1938678432 ... done
Doing the incremental dump from source to destination for volume 1938678432
... done
Setting volume flags on old source volume 1938678432 ... done
Setting volume flags on new source volume 1938678432 ... done
Ending transaction on destination volume 1938678432 ... done
Releasing lock on VLDB entry for volume 1938678432 ... done
Setting forwarding pointer for volume 1938678432 ... done
Deleting old volume 1938678432 on source ... done
Ending transaction on old volume 1938678432 on the source ... done
Creating transaction for backup volume 1938678434 on source ... done
Starting transaction on the cloned volume 1939602763 ... done
Deleting the cloned volume 1939602763 ... done
Ending transaction on cloned volume 1939602763 ... done
Volume 1938678432 moved from afsfs10 /vicepa to afsfs5 /vicepa
jax > vos exam user.10142
user.10142 1938678432 RW 2 K On-line
afsfs5.cl.msu.edu /vicepa
RWrite 1938678432 ROnly 0 Backup 0
MaxQuota 5000 K
Creation Tue Apr 8 10:54:19 2008
Copy Fri May 30 15:39:04 2008
Backup Thu May 29 22:02:48 2008
Last Update Tue Apr 8 10:54:19 2008
0 accesses in the past day (i.e., vnode references)
RWrite: 1938678432
number of sites -> 2
server afsfs5.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Site
jax > vos backup user.10142 -verbose
Creating a new backup clone 1938678434 ... done
Created backup volume for user.10142
jax > vos exam user.10142 -verbose
Fetching VLDB entry for 1938678432 .. done
Getting volume listing from the server afsfs5.cl.msu.edu .. done
user.10142 1938678432 RW 2 K On-line
afsfs5.cl.msu.edu /vicepa
RWrite 1938678432 ROnly 0 Backup 1938678434
MaxQuota 5000 K
Creation Tue Apr 8 10:54:19 2008
Copy Fri May 30 15:39:04 2008
Backup Fri May 30 15:40:17 2008
Last Update Tue Apr 8 10:54:19 2008
0 accesses in the past day (i.e., vnode references)
RWrite: 1938678432 Backup: 1938678434
number of sites -> 2
server afsfs5.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Siteuser.10142
1938678432 RW 2 K On-line
afsfs5.cl.msu.edu /vicepa
RWrite 1938678432 ROnly 0 Backup 1938678434
MaxQuota 5000 K
Creation Tue Apr 8 10:54:19 2008
Copy Fri May 30 15:39:04 2008
Backup Fri May 30 15:40:17 2008
Last Update Tue Apr 8 10:54:19 2008
0 accesses in the past day (i.e., vnode references)
RWrite: 1938678432 Backup: 1938678434
number of sites -> 2
server afsfs5.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Site
jax > vos listvol afsfs10 vicepa -verbose
Total number of volumes on server afsfs10 partition /vicepa: 0
Total volumes onLine 0 ; Total volumes offLine 0 ; Total busy 0
jax > vos listvldb -server afsfs10 -partition vicepa -verbose
VLDB entries for server afsfs10 partition /vicepa
user.10142
RWrite: 1938678432 Backup: 1938678434
number of sites -> 2
server afsfs5.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Site
user.10658
RWrite: 1938678216 Backup: 1938678218
number of sites -> 2
server afsfs5.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Site
... (1986 similar entries to the ones above)
Total entries: 1988
jax > vos syncvldb -server afsfs10 -partition vicepa -volume user.10142
-verbose
Processing VLDB entry user.10142 ...
_______________________________
-- status before --
user.10142
RWrite: 1938678432 Backup: 1938678434
number of sites -> 2
server afsfs5.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Site
-- status after --
**no change**
_______________________________
...done entry
VLDB volume user.10142 synchronized with state of server afsfs10 partition
/vicepa
jax > vos syncvldb -volume user.10142 -verbose
Processing VLDB entry user.10142 ...
_______________________________
-- status before --
user.10142
RWrite: 1938678432 Backup: 1938678434
number of sites -> 2
server afsfs5.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Site
-- status after --
**no change**
_______________________________
...done entry
VLDB volume user.10142 synchronized
jax > vos syncserv -server afsfs10 -partition vicepa -verbose
... (Same as the sample volume) ...
Processing VLDB entry 472 ...
_______________________________
-- status before --
user.10142
RWrite: 1938678432 Backup: 1938678434
number of sites -> 2
server afsfs5.cl.msu.edu partition /vicepa RW Site
server afsfs10.cl.msu.edu partition /vicepa RW Site
-- status after --
**no change**
_______________________________
...done entry 472
... (Same as the sample volume)
Total entries: 1988, Failed to process 0, Changed 0
Server afsfs10 partition /vicepa synchronized with VLDB
jax >