[OpenAFS] RClone - Volume already exists

Nicolas DEFFAYET nicolas-ml@deffayet.com
Tue, 15 Oct 2013 11:37:50 +0000


Hello,

I have troubles with some volumes during the vos release:
# vos release backups -verbose -localauth

backups 
    RWrite: 536870926     ROnly: 536870936     Backup: 536870946
RClone: 536870936 
    number of sites -> 2
       server primary partition /vicepa RW Site  -- New release
       server secondary partition /vicepa RO Site  -- Old release
This is a complete release of volume 536870926
Cloning RW volume 536870926 to temporary RO...Failed to clone the RW
volume 536870926
Volume already exists
Error in vos release command.
Volume already exists
#

Ted have reported a problem of RClone
(http://lists.openafs.org/pipermail/openafs-info/2012-May/038225.html),
the feedback from Andrew was:
"That usually appears after an interrupted volume release. It should
stop
being displayed after you release the volume successfully, or delete it,
etc."

I have same problem but how fix it without using vos delsite and vos
addsite and re-download the full volume from primary server to secondary
server ?

The problem have appeared after the reboot of secondary server and only
few volumes have this problem of RClone generating the error "Volume
already exists" during release.

According the documentation from IBM:
"If necessary, the Volume Server creates a temporary copy (a clone) of
the read/write source called the ReleaseClone (see the following
discussion of when the Volume Server does or does not create a new
ReleaseClone.) It assigns the ReleaseClone its own volume ID number,
which the VL Server records in the RClone field of the source volume's
VLDB entry."
"To override the default behavior, forcing the Volume Server to create
and release a new ReleaseClone to the read-only sites, include the -f
flag. This is appropriate if, for example, the data at the read/write
site has changed since the existing ReleaseClone was created during the
previous release operation."

-> -f flags don't fix the issue, same error "Volume already exists" It
seem that the flag doesn't exist with OpenAFS and so no effect when
trying to use it.

How remove this temporary copy ?

I have tried to salvage the volume but it don't fix the issue, same
error "Volume already exists"
# bos salvage -server 127.0.0.1 -partition /vicepa -volume backups
-verbose -localauth

# vos exa backups -verbose -localauth
Fetching VLDB entry for 536870926 .. done
Getting volume listing from the server primary .. done
backups                           536870926 RW   62387494 K  On-line
    primary /vicepa 
    RWrite  536870926 ROnly  536870936 Backup  536870946 
    MaxQuota          0 K 
    Creation    Tue Sep 11 16:28:18 2007
    Copy        Tue Sep 11 16:13:44 2007
    Backup      Tue Oct 15 06:25:02 2013
    Last Access Mon Oct 14 23:37:41 2013
    Last Update Mon Oct 14 14:53:53 2013
    0 accesses in the past day (i.e., vnode references)

    RWrite: 536870926     ROnly: 536870936     Backup: 536870946
RClone: 536870936 
    number of sites -> 2
       server primary partition /vicepa RW Site  -- New release
       server secondary partition /vicepa RO Site  -- Old release
#

# vos listvldb -verbose -localauth
VLDB entries for all servers 

archives 
    RWrite: 536870925     ROnly: 536870935     Backup: 536870945 
    number of sites -> 2
       server primary partition /vicepa RW Site 
       server secondary partition /vicepa RO Site 

backups 
    RWrite: 536870926     ROnly: 536870936     Backup: 536870946
RClone: 536870936 
    number of sites -> 2
       server primary partition /vicepa RW Site  -- New release
       server secondary partition /vicepa RO Site  -- Old release
#

# vos listvol -server primary -verbose -localauth
Total number of volumes on server primary partition /vicepa: 28 
archives                          536870925 RW   55518591 K On-line
archives.backup                   536870945 BK   55518591 K On-line
backups                           536870926 RW   62387494 K On-line
backups.backup                    536870946 BK   62387494 K On-line
[...]
#

# vos listvol -server secondary -verbose -localauth
Total number of volumes on server secondary partition /vicepa: 14 
archives.readonly                 536870935 RO   55518591 K On-line
backups.readonly                  536870936 RO   65102879 K On-line
[...]
#


Thanks

Best Regards,

-- 
Nicolas DEFFAYET