[OpenAFS] Re: Vos move/release failure

Derrick Brashear shadow@gmail.com
Mon, 5 Apr 2010 07:56:19 -0400


On Mon, Apr 5, 2010 at 3:12 AM, Sidney Cammeresi <sac@cheesecake.org> wrote=
:
> On Sun, 04 Apr 2010 at 13.21.07 -0400, Derrick Brashear wrote:
>> On Sun, Apr 4, 2010 at 7:00 AM, Sidney Cammeresi <sac@cheesecake.org> wr=
ote:
>> > One of my fileservers is returning errors when I try to move or releas=
e
>> > certain volumes to it. =A0For example,
>> >
>> > ----------
>> >
>> > $ vos ex test
>> > test =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0536871=
126 RW =A0 =A0 =A0 =A0 =A02 K =A0On-line
>> > =A0 =A0good /vicepa
>> > =A0 =A0RWrite =A0536871126 ROnly =A0536871133 Backup =A0 =A0 =A0 =A0 =
=A00
>> > =A0 =A0MaxQuota =A0 =A0 =A0 5000 K
>> > =A0 =A0Creation =A0 =A0Sun Apr =A04 12:40:27 2010
>> > =A0 =A0Copy =A0 =A0 =A0 =A0Sun Apr =A04 12:41:53 2010
>> > =A0 =A0Backup =A0 =A0 =A0Never
>> > =A0 =A0Last Update Never
>> >
>> > =A0 =A0RWrite: 536871126
>> > =A0 =A0number of sites -> 1
>> > =A0 =A0 =A0 server good partition /vicepa RW Site
>> > $ vos move test good a bad a
>> >
>> > Failed to move data for the volume 536871126
>> > =A0 : No such file or directory
>> > vos move: operation interrupted, cleanup in progress...
>> > clear transaction contexts
>> > move incomplete - attempt cleanup of target partition - no guarantee
>> > cleanup complete - user verify desired result
>> >
>> > ----------
>> >
>> > All I see in VolserLog on the bad server is
>> >
>> > Sun Apr =A04 12:50:09 2010 VAttachVolume: Failed to open /vicepa/V0536=
871126.vol (errno 2)
>> > Sun Apr =A04 12:50:09 2010 1 Volser: CreateVolume: volume 536871126 (t=
est) created
>> > Sun Apr =A04 12:50:12 2010 1 Volser: Delete: volume 536871126 deleted
>> >
>> >
>> >
>> > I deleted the volume "test," and then I performed this sequence of
>> > commands to get a different failure:
>> >
>> > ----------
>> >
>> > $ vos create bad a test
>> > Volume 536871134 created on partition /vicepa of bad
>> > $ vos addsite bad a test
>> > Added replication site bad /vicepa for volume test
>> > $ vos addsite good a test
>> > Added replication site good /vicepa for volume test
>> > $ vos release test
>> > Released volume test successfully
>> > $ vos move test bad a good a
>> > WARNING : readOnly copies still exist
>> > Volume 536871134 moved from bad /vicepa to good /vicepa
>> > $ vos release test
>> > Release failed: VOLSER: Problems encountered in doing the dump !
>>
>> So you gave us one log. There are 2 servers in play. How about that othe=
r log?
>
> If I do the create, addsite, addsite, release sequence that I previously
> described, I get the following in the VolserLog on the good server:
>
> Mon Apr =A05 09:05:17 2010 1 Volser: CreateVolume: volume 536871141 (test=
) created
> Mon Apr =A05 09:06:19 2010 1 Volser: Clone: Cloning volume 536871141 to n=
ew volume 536871142
>
> and the following in the VolserLog on the bad server:
>
> Mon Apr =A05 09:06:27 2010 VAttachVolume: Failed to open /vicepa/V0536871=
142.vol (errno 2)
> Mon Apr =A05 09:06:27 2010 1 Volser: CreateVolume: volume 536871142 (test=
.readonly) created

Yeah, but you're still not telling us about the source server's log
during the release to the "bad" server. What appears in the good
server's log at 9:06:27?

What OpenAFS versions are in play, again?