[OpenAFS] RClone - Volume already exists
Nicolas DEFFAYET
nicolas-ml@deffayet.com
Tue, 15 Oct 2013 11:37:50 +0000
Hello,
I have troubles with some volumes during the vos release:
# vos release backups -verbose -localauth
backups
RWrite: 536870926 ROnly: 536870936 Backup: 536870946
RClone: 536870936
number of sites -> 2
server primary partition /vicepa RW Site -- New release
server secondary partition /vicepa RO Site -- Old release
This is a complete release of volume 536870926
Cloning RW volume 536870926 to temporary RO...Failed to clone the RW
volume 536870926
Volume already exists
Error in vos release command.
Volume already exists
#
Ted have reported a problem of RClone
(http://lists.openafs.org/pipermail/openafs-info/2012-May/038225.html),
the feedback from Andrew was:
"That usually appears after an interrupted volume release. It should
stop
being displayed after you release the volume successfully, or delete it,
etc."
I have same problem but how fix it without using vos delsite and vos
addsite and re-download the full volume from primary server to secondary
server ?
The problem have appeared after the reboot of secondary server and only
few volumes have this problem of RClone generating the error "Volume
already exists" during release.
According the documentation from IBM:
"If necessary, the Volume Server creates a temporary copy (a clone) of
the read/write source called the ReleaseClone (see the following
discussion of when the Volume Server does or does not create a new
ReleaseClone.) It assigns the ReleaseClone its own volume ID number,
which the VL Server records in the RClone field of the source volume's
VLDB entry."
"To override the default behavior, forcing the Volume Server to create
and release a new ReleaseClone to the read-only sites, include the -f
flag. This is appropriate if, for example, the data at the read/write
site has changed since the existing ReleaseClone was created during the
previous release operation."
-> -f flags don't fix the issue, same error "Volume already exists" It
seem that the flag doesn't exist with OpenAFS and so no effect when
trying to use it.
How remove this temporary copy ?
I have tried to salvage the volume but it don't fix the issue, same
error "Volume already exists"
# bos salvage -server 127.0.0.1 -partition /vicepa -volume backups
-verbose -localauth
# vos exa backups -verbose -localauth
Fetching VLDB entry for 536870926 .. done
Getting volume listing from the server primary .. done
backups 536870926 RW 62387494 K On-line
primary /vicepa
RWrite 536870926 ROnly 536870936 Backup 536870946
MaxQuota 0 K
Creation Tue Sep 11 16:28:18 2007
Copy Tue Sep 11 16:13:44 2007
Backup Tue Oct 15 06:25:02 2013
Last Access Mon Oct 14 23:37:41 2013
Last Update Mon Oct 14 14:53:53 2013
0 accesses in the past day (i.e., vnode references)
RWrite: 536870926 ROnly: 536870936 Backup: 536870946
RClone: 536870936
number of sites -> 2
server primary partition /vicepa RW Site -- New release
server secondary partition /vicepa RO Site -- Old release
#
# vos listvldb -verbose -localauth
VLDB entries for all servers
archives
RWrite: 536870925 ROnly: 536870935 Backup: 536870945
number of sites -> 2
server primary partition /vicepa RW Site
server secondary partition /vicepa RO Site
backups
RWrite: 536870926 ROnly: 536870936 Backup: 536870946
RClone: 536870936
number of sites -> 2
server primary partition /vicepa RW Site -- New release
server secondary partition /vicepa RO Site -- Old release
#
# vos listvol -server primary -verbose -localauth
Total number of volumes on server primary partition /vicepa: 28
archives 536870925 RW 55518591 K On-line
archives.backup 536870945 BK 55518591 K On-line
backups 536870926 RW 62387494 K On-line
backups.backup 536870946 BK 62387494 K On-line
[...]
#
# vos listvol -server secondary -verbose -localauth
Total number of volumes on server secondary partition /vicepa: 14
archives.readonly 536870935 RO 55518591 K On-line
backups.readonly 536870936 RO 65102879 K On-line
[...]
#
Thanks
Best Regards,
--
Nicolas DEFFAYET