[OpenAFS-devel] OpenAFS - "Lost contact with file server"

Dylan Vaughn dylan@dcvaughn.com
Thu, 01 Jul 2004 13:40:57 -0700


Hello - 

I'm trying to set up OpenAFS on my laptop running Debian Sarge with the
following packages:

libpam-openafs-session 1.0-5
openafs-client 1.2.11-1
openafs-dbserver 1.2.11-1
openafs-filesever 1.2.11-1
openafs-krb5   1.3-10
openafs-modules-2.4.26-200406181 (compiled from the
openafs-modules-source package with kernel 2.4.26)

Everything seemed to be working - I set up a new cell and had it
integrated with Kerberos, and was using it for my home directory
(/home/dylan was a link to /afs/dcvaughn.com/user/dylan)

However, I started to get a lot of "Lost contact with file server"
errors.  When I logged in using gdm (set up with pam to auth with
Kerberos then get an afs Kerberos ticket), I would be able to browse
files in my home directory, but any application that needed to access
files in my home directory would crash, for example evolution.  I never
got any errors (that I could find) except for the "Lost contact with
file server".  The .xsessionerrors file reported that the apps were
having "Connection timed out" errors trying to access files in my home
directory (which I could see fine from the terminal).  If I did: fs
checkservers then the logs would say that the server was back up, but I
was still having the problems with evolution, mozilla, etc.  Sometimes
it would work fine, but more and more it seemed to be losing the
connection.  In trying to move some data out of afs, I got the error
pasted below in my syslog (this was the only time I got the kernel
debugging info - normally it would only say "Lost contact", then "file
server back up" a little later...).  I'm below my quota on my user
volume and the cache seems to have plenty of room still left. 

Does anyone have any suggestions as to what to try or how to debug this
problem?  

Thanks, 

Dylan Vaughn

Jul  1 13:08:59 sunbeam kernel: afs: Lost contact with file server
192.168.0.25 in cell dcvaughn.com (all multi-homed ip addresses down for
the server)
Jul  1 13:08:59 sunbeam kernel: afs: Lost contact with file server
192.168.0.25 in cell dcvaughn.com (all multi-homed ip addresses down for
the server)
Jul  1 13:11:52 sunbeam kernel: afs: file server 192.168.0.25 in cell
dcvaughn.com is back up (multi-homed address; other same-host interfaces
may still be down)
Jul  1 13:11:52 sunbeam kernel: afs: file server 192.168.0.25 in cell
dcvaughn.com is back up (multi-homed address; other same-host interfaces
may still be down)
Jul  1 13:15:23 sunbeam kernel: rxi_AllocPacket error<1>Unable to handle
kernel paging request at virtual address ffffffff
Jul  1 13:15:23 sunbeam kernel:  printing eip:
Jul  1 13:15:23 sunbeam kernel: e09ce1c0
Jul  1 13:15:23 sunbeam kernel: *pde = 00002063
Jul  1 13:15:23 sunbeam kernel: *pte = 00000000
Jul  1 13:15:23 sunbeam kernel: Oops: 0002
Jul  1 13:15:23 sunbeam kernel: CPU:    0
Jul  1 13:15:23 sunbeam kernel: EIP:    0010:[<e09ce1c0>]    Tainted: PF
Jul  1 13:15:23 sunbeam kernel: EFLAGS: 00010282
Jul  1 13:15:23 sunbeam kernel: eax: 00000015   ebx: e09fdf9c   ecx:
00000000   edx: 00000000
Jul  1 13:15:23 sunbeam kernel: esi: d2eeba80   edi: 00000027   ebp:
0000071a   esp: d9595e44
Jul  1 13:15:23 sunbeam kernel: ds: 0018   es: 0018   ss: 0018
Jul  1 13:15:23 sunbeam kernel: Process afs_rxlistener (pid: 989,
stackpage=d9595000)
Jul  1 13:15:23 sunbeam kernel: Stack: e09ece90 d9594000 d9595f80
e09cdf7c 581b0002 1900a8c0 00000246 e09cf3ef
Jul  1 13:15:23 sunbeam kernel:        e09ece90 d9594000 d9595f80
e09cdf7c 00000000 00000000 0000071a e09cf46d
Jul  1 13:15:23 sunbeam kernel:        00000002 00000000 e0a2f674
0000000c e0a2f0f4 0000058c 00000588 e09c9284
Jul  1 13:15:23 sunbeam kernel: Call Trace:    [<e09ece90>] [<e09cdf7c>]
[<e09cf3ef>] [<e09ece90>] [<e09cdf7c>]
Jul  1 13:15:23 sunbeam kernel:   [<e09cf46d>] [<e09c9284>]
[__wake_up+63/124] [<e09c7200>] [<e09c69b0>] [<e09ed2e6>]
Jul  1 13:15:23 sunbeam kernel:   [<e09ed2e6>] [<e09ceaab>] [<e09ed2e6>]
[<e09ed2cb>] [<e09dc42b>] [<e09ed2d8>]
Jul  1 13:15:23 sunbeam kernel:   [arch_kernel_thread+35/45]
[<e09dc0e0>]
Jul  1 13:15:23 sunbeam kernel:
Jul  1 13:15:23 sunbeam kernel: Code: c6 05 ff ff ff ff 2a 83 c4 1c c3
90 8d 74 26 00 b8 7e ce 9e
Jul  1 13:17:01 sunbeam kernel:  afs: Lost contact with file server
192.168.0.25 in cell dcvaughn.com (all multi-homed ip addresses down for
the server)
Jul  1 13:17:01 sunbeam kernel: afs: Lost contact with file server
192.168.0.25 in cell dcvaughn.com (all multi-homed ip addresses down for
the server)