[OpenAFS-devel] Retry transaction creates on transient problems

Tom Keiser tkeiser@sinenomine.net
Mon, 27 Apr 2009 06:56:30 -0400


On Mon, Apr 27, 2009 at 5:13 AM, Harald Barth <haba@kth.se> wrote:
>
>> The patch solves a concrete problem, namely daily backups failing on
>> a several-thousand-volume server hence we need it or something
>> equivalent.
>
> We have ~5 vos dump (for backup) per week failing and will solve these
> in the same way if noone steps up with a better patch. So yes, I'd
> like to see that patch in the distro. There are other higher
> prioritized probles than the expected thread tie up. We can easily

This isn't a theoretical issue.  If this goes in the tree, it's going
to be a real problem at several large sites.

> have a higher thread count which is rediciously low per default
> anyway, or can't we?
>

Raising thread count is not a solution.  More to the point, this patch
introduces a trivially-exploitable (by rxnull conns, no less) thread
DoS against the volume server, and that's just not acceptable.

The solution to this problem needs to happen client-side, period.

-Tom