[OpenAFS-devel] RX Performance
Rainer Toebbicke
rtb@pclella.cern.ch
Thu, 19 Jun 2003 12:08:45 +0200
We recently saw disappointing AFS performance on fast Linux machines on
gigabit Ethernet compared to e.g. NFS, despite tricks like bypassing the AFS
cache etc. so I undertook to measure a simple RX application which just
creates a single call and issues a jumbo rx_Write on it, i.e. memory-to-memory.
The result was disappointing: best I could get was around 45 MB/s transfer
rate on two 1 GHz dual CPU machines. As a comparison, a very simple minded UDP
'flooder' saturates at around 97 MB/s between those machines. The CPU load
during the RX test is slightly over 50% well distributed over both CPU's (why
on both?). Repeating the test on two 2GHz XEON machines did not show any
improvement. Conclusion: the protocol introduces waits.
Are the default values in RX for window size, ack rate etc. still adequate for
today's networks? I tried to play around a bit: increased # of pcks/jumbogram,
receive/transmit window size etc. (every time bluntly changing rx_globals.h
and rebuilding libafsrpc.a). Nothing really happened. The only improvement we
got was cranking up the MTU to around 9K (on a 10GB card that supported it).
Any ideas?
--
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Rainer Toebbicke http://cern.ch/~rtb rtb@mail.cern.ch O__
European Laboratory for Particle Physics(CERN) - Geneva, Switzerland > |
Phone: +41 22 767 8985 Fax: +41 22 767 7155 ( )\( )