[OpenAFS] Re: guide to debugging nat issues?

Adam Megacz megacz@cs.berkeley.edu
Thu, 14 Jun 2007 14:41:59 -0700


Jeffrey Altman <jaltman@secure-endpoints.com> writes:
> There is a behavior modification we made to the Windows client that has
> not made it into a release UNIX client where a request that ends in a
> timeout is always retried once using a new RX connection.  We only mark
> the server down if the request on the new RX connection fails.

Hrm, a Windows VM (Parallels) on the same machine with 1.5.20 exhibits
similar behavior.  Here's what FileLog says.  The long pause occurs
right before or after the WhoAreYou (can't remember if it was before
or after).  Client gets "file not found" on about half the files in
the directory.

  - a

Thu Jun 14 14:39:29 2007 [12] FetchData_RXStyle: Pos 0, Len 999999999
Thu Jun 14 14:39:29 2007 [12] FetchData_RXStyle: file size 2048
Thu Jun 14 14:39:29 2007 [12] SRXAFS_FetchData returns 0
Thu Jun 14 14:39:29 2007 [14] SAFS_FetchStatus,  Fid = 536871096.4900.4545, Host 128.32.37.206:7001, Id 1
Thu Jun 14 14:39:29 2007 [14] SAFS_FetchStatus returns 0
Thu Jun 14 14:39:29 2007 [13] SRXAFS_FetchData, Fid = 536871096.4900.4545
Thu Jun 14 14:39:29 2007 [13] SRXAFS_FetchData, Fid = 536871096.4900.4545, Host 128.32.37.206:7001, Id 1
Thu Jun 14 14:39:29 2007 [13] FetchData_RXStyle: Pos 0, Len 1619
Thu Jun 14 14:39:29 2007 [13] FetchData_RXStyle: file size 1619
Thu Jun 14 14:39:29 2007 [13] SRXAFS_FetchData returns 0
Thu Jun 14 14:39:57 2007 [3] FindClient: authenticating connection: authClass=2
Thu Jun 14 14:39:57 2007 [3] FindClient: rxkad conn: name=megacz,inst=,cell=MEGACZ.COM,exp=1181892778,kvno=256
Thu Jun 14 14:39:57 2007 [3] MapName: cell is foreign.  cell=MEGACZ.COM, localcell=research.cs.berkeley.edu, localrealm=research.cs.berkeley.edu
Thu Jun 14 14:39:57 2007 [3] WhoAreYou success on 204.153.12.52:19460
Thu Jun 14 14:40:04 2007 [3] CB: ProbeUuid for 204.153.12.52:18893 failed -01
Thu Jun 14 14:40:11 2007 [3] CB: new addr 204.153.12.52:19460 for old host 204.153.12.52:18893
Thu Jun 14 14:40:11 2007 [3] DV: no call backs
Thu Jun 14 14:40:11 2007 [3] SAFS_SetLock type = LockRead Fid = 536871096.10244.9385, Host 204.153.12.52:19460, Id 130865
Thu Jun 14 14:40:11 2007 [3] CheckRights: len=0, for host=204.153.12.52:19460
Thu Jun 14 14:40:11 2007 [3] SAFS_SetLock returns 0
Thu Jun 14 14:40:11 2007 [4] SAFS_ReleaseLock Fid = 536871096.10244.9385, Host 204.153.12.52:19460, Id 130865
Thu Jun 14 14:40:11 2007 [4] CheckRights: len=0, for host=204.153.12.52:19460
Thu Jun 14 14:40:11 2007 [4] BCB: BreakCallBack(all but 204.153.12.52:19460, (536871096,10244,9385))
Thu Jun 14 14:40:11 2007 [4] SAFS_ReleaseLock returns 0
Thu Jun 14 14:40:11 2007 [6] SAFS_SetLock type = LockRead Fid = 536871096.10244.9385, Host 204.153.12.52:19460, Id 130865
Thu Jun 14 14:40:11 2007 [6] CheckRights: len=0, for host=204.153.12.52:19460
Thu Jun 14 14:40:11 2007 [6] SAFS_SetLock returns 0
Thu Jun 14 14:40:11 2007 [5] SAFS_ReleaseLock Fid = 536871096.10244.9385, Host 204.153.12.52:19460, Id 130865
Thu Jun 14 14:40:11 2007 [5] CheckRights: len=0, for host=204.153.12.52:19460
Thu Jun 14 14:40:11 2007 [5] BCB: BreakCallBack(all but 204.153.12.52:19460, (536871096,10244,9385))
Thu Jun 14 14:40:11 2007 [5] SAFS_ReleaseLock returns 0

  - a

-- 
PGP/GPG: 5C9F F366 C9CF 2145 E770  B1B8 EFB1 462D A146 C380