[OpenAFS] Re: Afs User volume servers in VM's

Wed, 26 Oct 2011 12:26:24 -0400

This is an OpenPGP/MIME signed message (RFC 2440 and 3156)
--------------enig950E3B17C6A6313997D30ACE
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

On 10/26/2011 11:26 AM, Stephan Wiesand wrote:
> Good point. This problem will vanish with IPv6, though.
>=20
> Containers could be another solution.
>=20
> Running multiple fileservers on different ports on the same system woul=
d be even more efficient. Is this possible or could be implemented (in th=
eory)?

YFS has already contributed much of the work necessary to run multiple
file servers on the same system on different ports.  What is missing in
OpenAFS at the moment are the database changes to track file servers
with port numbers in addition to addresses; the RPCs to tell the clients
about those port numbers; etc.  The requirements have been discussed on
afs3-standardization and at the last couple of afs hackathons.  However,
there is not an agreed upon set of RPCs for the implementation.

>> This also doesn't help you much if the server is getting bogged down d=
ue
>> to the I/O in servicing the relevant requests, unless you separate the=

>> user volumes physically.
>=20
> A single 1.4 fileserver is not able to make a decent contemporary files=
erver sweat. I don't have much data on 1.6 servers.

The 1.6 file servers are not significantly different.  The bottlenecks
throughout the file server stack prevent the file server from making
full use of either the available CPU processing or the I/O bandwidth.

A substantial re-architecture of the file server and the vol package is
needed in order to obtain the desired throughput.

>> But for the simple (and presumably common) case
>> of running out of fileserver threads or the fileserver not being
>> mp-scalable, sure.
>=20
>=20
> We're suffering from the same problem as SLAC, and are working around i=
t by keeping home directories small enough to make them unusable for use =
from the compute farm and providing extra volumes on fileservers dedicate=
d to workgroups. User education tends to be more effective if careless us=
e penalizes familiar co-workers only ;-)
>=20
> What would be a great feature to have is a way to keep the server from =
using more than, say, half of the available threads for a single volume. =
Would this be feasible to implement at all?

Unfortunately, by the time the file server gets the request queued onto
a worker thread such that the RPC can be evaluated and categorized such
that volume or partition specific limits can be applied the file server
might as well just answer the request.  80% of the work has already been
done.

The current file server architecture is a one-to-one worker thread per
request model.  Even if the file server was going to delay processing of
a particular request, what would be desirable would be to re-queue the
request with additional tag information permitting a worker thread that
is completing a request on the limited resource to pick it up while
allowing other workers to service unrelated requests.  Issuing RPCs to
the client and forcing the client to retry (as is done with VBUSY
responses) will increase the overall load on the file server when there
are no replicas on alternate file servers for the client to fail over to.=

Jeffrey Altman

--------------enig950E3B17C6A6313997D30ACE
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (MingW32)

iQEcBAEBAgAGBQJOqDSwAAoJENxm1CNJffh4m7kH/3//92MoS074195vMgZUsHt7
hb4yOro/VIGdCEEg+he4WiH61TVbIq5NX+pxt4CfjtBqzfSbPqpWbEXN5PR84k31
jjTY7pufb53rXs12/ofd3MgMifH3PDKf6o/5510FqBculX3MEA826l6tUwarrPJj
Hc6Jk6x48/OUldQxDDG0UrxSFYegjhHYcuA7lx0icI9SH0TGLhgytYcUvUwd527R
WBbLI7e8tximy+KDAQadN07Z8Y9te+xtthQpMxwAcTZUmgH2W+36/8iNMOSRNVLS
DauCGtIrlQa2bg8tNWtwuuC9hEIR+IAgWWv46foMzFptNCpGaR8tZr5ndkosJwo=
=AZXW
-----END PGP SIGNATURE-----

--------------enig950E3B17C6A6313997D30ACE--