[OpenAFS] Re: 1.6.1a migration from 1.4: corrupted and offline
volumes
Jakub Moscicki
jakub.moscicki@cern.ch
Thu, 20 Sep 2012 09:06:00 +0200
Hello,
OK, thanks, indeed it looks like a problem of a particular cell only...
sorry for the confusion.
Derrick: looking at a larger picture, is there a plan to integrate (some
of) these patches into the main repo in the future?
Many thanks,
kuba
Best regards,
Jakub Moscicki
On 09/19/2012 08:31 PM, Derrick Brashear wrote:
> it's some patch CERN applied to 1.4; I had email from long ago with it
> and Andrew reminded me
> that I'd seen it before, so I dug it out.
>
> On Wed, Sep 19, 2012 at 2:29 PM, Troy Benjegerdes <hozer@hozed.org> wrote:
>> Is that in the main openafs git tree RxOSD branch, or some other patch?
>>
>> On Wed, Sep 19, 2012 at 11:48:11AM -0400, Derrick Brashear wrote:
>>> Ah. Yup. Something like this:
>>>
>>> UserId author; /* Userid of the last user storing the file */
>>> UserId owner; /* Userid of the user who created the file */
>>> VnodeId parent; /* Parent directory vnode */
>>> - bit32 vnodeMagic; /* Magic number--mainly for file server
>>> + bit16 vnodeMagic; /* Magic number--mainly for file server
>>> * paranoia checks */
>>> -# define SMALLVNODEMAGIC 0xda8c041F
>>> +# define SMALLVNODEMAGIC 0xda8c
>>> # define LARGEVNODEMAGIC 0xad8765fe
>>> /* Vnode magic can be removed, someday, if we run need the room. Simply
>>> * have to be sure that the thing we replace can be VNODEMAGIC, rather
>>> * than 0 (in an old file system). Or go through and zero the fields,
>>> * when we notice a version change (the index version number) */
>>> + unsigned int isMigrated:1;
>>> + unsigned int rsvd7:1;
>>> + unsigned int serverUseDay:14;
>>> ViceLock lock; /* Advisory lock */
>>> Date serverModifyTime; /* Used only by the server; for incremental
>>> * backup purposes */
>>>
>>> So what's required is either to patch your 1.6 to be special, or
>>> migrate the volumes using vos move instead
>>> of relying on the data format on disk being the same (which it isn't)
>>>
>>>
>>> On Wed, Sep 19, 2012 at 11:16 AM, Andrew Deason <adeason@sinenomine.net> wrote:
>>>> On Wed, 19 Sep 2012 15:37:39 +0200
>>>> Jakub Moscicki <jakub.moscicki@cern.ch> wrote:
>>>>
>>>>> I just tried to deploy 1.6.1a on linux, migrating from 1.4 server. I
>>>>> compiled from tar.gz sources and copied executables to /usr/afs/bin
>>>>>
>>>>> This operation has put all my volumes offline with the following
>>>>> FileLog entries:
>>>>>
>>>>> Wed Sep 19 13:54:48 2012 GetBitmap: addled vnode index in volume
>>>>> q.afs.st.afs211.fb.1; volume needs salvage
>>>> Doesn't CERN use some modified backend code that changes the vnode magic
>>>> for files? I thought I heard this was done for rxosd (or it's
>>>> predecessor) or something. A modified on-disk format is not going to
>>>> work with vanilla OpenAFS, and as far as I know that's exactly the error
>>>> you would get doing it.
>>>>
>>>> If I'm way off, then nevermind, but... that's not a "normal" error to
>>>> get. Unfortunately we don't print out what the 'bad' magic was, but if
>>>> you copy the files in one of the 'special' directories in /vicepfb (just
>>>> find one of the directories named 'special') and make them available,
>>>> one of us could see what it is.
>>>>
>>>> --
>>>> Andrew Deason
>>>> adeason@sinenomine.net
>>>>
>>>> _______________________________________________
>>>> OpenAFS-info mailing list
>>>> OpenAFS-info@openafs.org
>>>> https://lists.openafs.org/mailman/listinfo/openafs-info
>>>>
>>>
>>>
>>> --
>>> Derrick
>>> _______________________________________________
>>> OpenAFS-info mailing list
>>> OpenAFS-info@openafs.org
>>> https://lists.openafs.org/mailman/listinfo/openafs-info
>
>