[OpenAFS] Cannot move volumes between two fileservers

Peter Schmid pschmid@hsz-t.ch
Fri, 15 Oct 2004 18:07:08 +0200 (MEST)


We are using two OpenAFS server: homer is a solaris 9 machine, willie
is a GNU/Linux box. They are connected over a 2 Mb/s leased line. We use
on both of them OpenAFS 1.2.11.

I tried to move a volume from homer to willie, but that did not
work (I can move volumes within the servers):

# vos move -id user.babt -fromserver homer -frompartition /vicepd -toserver willie -topartition /vicepa -verbose
Starting transaction on source volume 536878191 ... done
Cloning source volume 536878191 ... done
Ending the transaction on the source volume 536878191 ... done
Starting transaction on the cloned volume 536878279 ... done
Creating the destination volume 536878191 ... done
Dumping from clone 536878279 on source to volume 536878191 on destination ...Failed to move data for the volume 536878191
   VOLSER: Problems encountered in doing the dump !
vos move: operation interrupted, cleanup in progress...
clear transaction contexts
access VLDB
move incomplete - attempt cleanup of target partition - no guarantee
cleanup complete - user verify desired result


# bos listhosts homer
Cell name is hsz-t.ch
    Host 1 is homer.isz.ch
    Host 2 is willie.isz.ch

# udebug homer 7003
Host's addresses are: 193.5.54.111 
Host's 193.5.54.111 time is Fri Oct 15 17:27:06 2004
Local time is Fri Oct 15 17:27:07 2004 (time differential 1 secs)
Last yes vote for 193.5.54.111 was 2 secs ago (sync site); 
Last vote started 2 secs ago (at Fri Oct 15 17:27:05 2004)
Local db version is 1097852371.3
I am sync site until 58 secs from now (at Fri Oct 15 17:28:05 2004) (2 servers)
Recovery state f
Sync site's db version is 1097852371.3
0 locked pages, 0 of them for write
Last time a new db version was labelled was:
         1655 secs ago (at Fri Oct 15 16:59:32 2004)

Server (193.5.55.180): (db 0.0)
    last vote never rcvd 
    last beacon never sent 
    dbcurrent=0, up=0 beaconSince=0

# udebug willie 7003
Host's addresses are: 193.5.55.180 
Host's 193.5.55.180 time is Fri Oct 15 17:27:32 2004
Local time is Fri Oct 15 17:27:32 2004 (time differential 0 secs)
Last yes vote for 193.5.55.180 was 1 secs ago (not sync site); 
Last vote started 1 secs ago (at Fri Oct 15 17:27:31 2004)
Local db version is 1.1
I am not sync site
Lowest host 193.5.55.180 was set 1 secs ago
Sync host 0.0.0.0 was set 1097854052 secs ago
Sync site's db version is 1.1
0 locked pages, 0 of them for write


It seems there is something wrong with the synchronisation (Ubik). The
"Recovery state" is "f" and not "1f". Why willie never votes?

Regards
Peter Schmid