[OpenAFS-devel] vos move takes ages (almost)
Harald Barth
haba@pdc.kth.se
Tue, 21 Feb 2006 10:10:39 +0100 (MET)
> You didn't say which AFS that is.
Sorry, vos move from 1.3.85 on FC3 xfs to 1.4.1rc7 on Centos xfs.
> The modern, multi-threaded RX has a "pacing" bug where every now and
> then one of the parties waits for roughly 0.3 seconds for a missing
> acknowledgement. The bug must be there in the lwp-version as well but
> somehow it doesn't slow things down enough. I mention this as last
> year I ran into this RX-tracing a memory-to-memory RX application and
> bypassed the problem by completely and horribly changing the ACK
> algorithm, just to show that >110MB/s is possible using RX.
Ok, "horribly" is the word then. Pointer?
> On the volume with the 70k files you suffer mainly from another
> effect: the first clone increments all the link counts and fsync()s
> the link count file for every one of them. Depending on your RAID you
> won't do many more than 100-200 per second of those. After the move,
> it decrements all the link counts twice (for the clone and the
> original) fsyncing all the time, again you can calculate how long this
> will take.
Maybe my RAID (when it was new, it was a RAED) is really bad when
fsyncing under load. It was not that bad when otherwise idle
(bonnie++), but of course this is not an artificial load.
> In the inode fileserver there is not much you can do about that, for
> the namei fileserver I've got a patch which a few people are already
> running that batches the fsyncs - speedup factor >200 on a volume with
> 1 million 30-bytes files
mmm, users with metadata > data
> that used to take almost 24 hours to move.
> Although we're using it on several servers now I still consider it
> experimental - let me know if you're interested.
Sure, I'm running 1.4.1rc*, so can it get any worse? ;-)
Harald.