[OpenAFS] help, salvaged volume won't come back online, is it
corrupt? [trimmed log]
John Koyle
jkoyle@koyle.org
Tue, 12 Sep 2006 22:36:08 -0600
Derrick J Brashear wrote:
> the 1.4.2-fc3 salvager is more likely to be helpful.
That's what I started with today and it didn't help. I only moved back
to 1.4.1 this evening after having the problems I posted earlier today.
I can now create and move volumes fine, but I am left with this volume
that's in a bad state.
John
>
> On Tue, 12 Sep 2006, John Koyle wrote:
>
>> Adam Megacz wrote:
>>> Well, this is it, the day I'd always feared...
>>>
>>> Server is running openafs-1.4.1. I did a "vos remove" on
>>> root.cell.backup and suddenly couldn't access root.cell. Shut down
>>> the fileserver, started it back up, and not it can't attach:
>>>
>>> $ vos examine 536870912
>>> **** Could not attach volume 536870912 ****
>>>
>>> RWrite: 536870912 Backup: 536870914 number of sites
>>> -> 1
>>> server research.CS.Berkeley.EDU partition /vicepa RW Site
>>> Everything for this cell is in one 3.3gb volume (root.cell); I hadn't
>>> yet gotten around to splitting it out into separate volumes.
>>>
>>> Help! Is there any way to get 'bos salvage' to make the volume
>>> attachable? Or in any other way recover the files or some subset
>>> thereof?
>>>
>>> The good news is that 'du -chs' of /vicepa gives exactly the right
>>> size (to within a few meg), so maybe I have reason to hope that my
>>> data is still there somewhere...
>>>
>>> SalvageLog (below) looks like it's stopping at /atmelx and getting
>>> upset. I'm quite happy to nuke that directory since it hasn't changed
>>> since the last dump. But other very, very important stuff has changed.
>>>
>>> Help!
>>>
>>> - a
>>>
>>>
>>> ______________________________________________________________________________
>>>
>>> SalvageLog:
>>>
>>> @(#) OpenAFS 1.4.1 built 2006-06-18 09/12/2006 20:27:15 STARTING
>>> AFS SALVAGER 2.4 (/usr/lib/openafs/salvager /vicepa 536870912)
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/1++++I=.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/2++++gV3.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/3++++kC4.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/5++++oI.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/7++++sI.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/9++++wI.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/B+++++J.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/D++++2J.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/F++++6J.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/0++++AJ.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/4++++EJ.
>>> ...
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/n5++6280.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/3+++6kC4.
>>> 09/12/2006 20:27:15 Found 0 link count file
>>> /vicepa/AFSIDat/+/++++U/+/+/=+++A2.
>>> 09/12/2006 20:27:16 1 nVolumesInInodeFile 28 09/12/2006 20:27:16
>>> SALVAGING VOLUME 536870912.
>>> 09/12/2006 20:27:16 root.cell (536870912) updated 09/12/2006 20:18
>>> 09/12/2006 20:27:16 totalInodes 9646
>>> 09/12/2006 20:27:16 iinc failed. inode 2632814952956 errno 9
>>> 09/12/2006 20:27:16 iinc failed. inode 9552007266813 errno 9
>>> ...
>>> 09/12/2006 20:27:17 iinc failed. inode 102744207679954 errno 9
>>> 09/12/2006 20:27:17 iinc failed. inode 102748502647252 errno 9
>>> 09/12/2006 20:27:17 iinc failed. inode 102752797614550 errno 9
>>> 09/12/2006 20:27:17 iinc failed. inode 102757092581848 errno 9
>>> 09/12/2006 20:27:17 iinc failed. inode 102761387549146 errno 9
>>> 09/12/2006 20:27:17 dir vnode 499: ??/atmelx (vnode 367): unique
>>> changed from 6884 to 0 -- deleted
>>>
>> I had a problem moving some volumes (posted earlier today). This is
>> the exact type of errors I'm getting in the salvage log for a users
>> home directory volume. I've spent the last 6 hours trying to recover
>> it to no avail. I would suggest a couple of things. First do a vos
>> dump so you can at least get back to the same state you're in.
>> Second, don't continue to run salvager - it only seemed to get my
>> volume in a worse state to the point where it's now saying:
>>
>> vol-is-afs:/usr/afs/logs# tail -f SalvageLog
>> @(#) OpenAFS 1.4.1 built 2006-09-12
>> 09/13/2006 02:53:46 STARTING AFS SALVAGER 2.4 (/usr/afs/bin/salvager
>> /vicepa 536870984 -orphans attach)
>> 09/13/2006 02:53:47 1 nVolumesInInodeFile 28
>> 09/13/2006 02:53:47 SALVAGING VOLUME 536870984.
>> 09/13/2006 02:53:47 home.user (536870984) updated 09/13/2006 01:08
>> 09/13/2006 02:53:47 totalInodes 57696
>> 09/13/2006 02:53:48 Cannot attach orphaned files and directories:
>> Root directory not found
>> 09/13/2006 02:53:48 Found 57692 orphaned files and directories
>> (approx. 1770169 KB)
>> 09/13/2006 02:53:48 Salvaged home.user (536870984): 57692 files,
>> 1770169 blocks
>>
>> Doing some searching on the Root directory not found error yields
>> non-promising results. Any help recovering volumes from either state
>> would be appreciated by myself as well.
>>
>> John
>>
>> _______________________________________________
>> OpenAFS-info mailing list
>> OpenAFS-info@openafs.org
>> https://lists.openafs.org/mailman/listinfo/openafs-info
>>
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info