[OpenAFS] AFS Cache on Parallel File system

Jeffrey Altman jaltman@your-file-system.com
Thu, 07 Jul 2011 08:53:03 -0400


This is an OpenPGP/MIME signed message (RFC 2440 and 3156)
--------------enigE5587B8D14B6DF2638BEAB5A
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Spenser:

The AFS cache cannot be used the way you are thinking.  What you are
looking for is a ceph/gluster to afs gateway which does not exist.

The AFS cache is part of the security boundary that is enforced by the
Cache Manager in the kernel.  As such, it is stored either in kernel
memory or on local disk accessible only to the one kernel.  It is not
designed for shared access.  Pointing multiple AFS cache managers at the
same cache will most likely result in data corruption.

There are two extensions to AFS that are being developed that will help
cluster access to data stores from far away locations

 1. read/write replication which permits a single copy of the data
generated at the slow site to be replicated to file servers near the
cluster.

 2. peer-to-peer cache sharing which permits an AFS cache manager to
securely access data from another cache manager on the same subnet and
avoid retransmitting it across a slow link.

The first option is preferred when it is possible to deploy file servers
in the cluster data center because it doesn't involve adding workload to
client nodes and provides for the possibility of parallel reads.

Jeffrey Altman


On 7/7/2011 4:01 AM, Spenser Gilliland wrote:
> Hello,
>=20
> Can the AFS cache be placed on a parallel file system (IE: ceph or glus=
ter)?
>=20
> If the cache can be placed on a parallel file system,
> When data is read into or written to the cache will all of the other
> nodes in the cluster have access to this cached data for both reading
> and writing?  And will every write block until it is written to the
> AFS cell (IE: is it write back or write-through)?
>=20
> FYI: I'm going to give this a go here in a couple weeks and wanted to
> know if anyone has tried it.
>=20
> The idea is to have an AFS Cell at home (very slow especially upload)
> and a cluster at School which accesses this AFS Cell but only
> downloads a file once for all of the servers in the cluster thereby
> saving time and bandwidth.  Additionally, because the file is now on
> the parallel file system all nodes can access the data concurrently.
> When the program is finished the results will be available in the same
> directory as the program.
>=20
> I'm thinking this could be immensely valuable for grid computing; if it=
 works.
>=20
> Let me know if there is anything I should be looking out for along the =
way.
>=20
> Thanks,
> Spenser
>=20
>=20


--------------enigE5587B8D14B6DF2638BEAB5A
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (MingW32)

iQEcBAEBAgAGBQJOFaw1AAoJENxm1CNJffh4GNkIAJca2ZQwapW96ggi1hprw6vV
NV2sR53JCkbF6Nu4io7xu6Jen3/t17PwfPG5qN15LveRlmCF5nhfi+5X4ejjOHe7
miRFvO/ejLaGIaUwfvKBMDhzlkvGwLdstO2kZQ6D0oqpqyH3LPLndcFLv67N5BGt
cesVJz1nyMA1VNTFGfqmm4PZYYSHpWlKImvfB3+4zWwcKmPH3RoD01JULuEwTI9w
eoJ0qA4F8fKSJgIxJoec0jSXU1GR7cDSh5N2FuU179csDA9fBYU8xf5uYUbDmlfE
nkpISJFJLDMp150XnZV2pD39/PYctlz7qhGlfUNSol/5rLu4BJvzcO44d4r9t0w=
=uZQb
-----END PGP SIGNATURE-----

--------------enigE5587B8D14B6DF2638BEAB5A--