[OpenAFS-devel] File server won't talk to client

Jim Rees rees@umich.edu
Mon, 17 Nov 2003 13:37:27 -0500


Here is some more info.

tcpdump trce:
/afs/umich.edu/user/r/e/rees/pub/fs-wont-talk.tr

server FileLog (grep for client IP):
Fri Nov  7 17:48:52 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:22774
Sat Nov  8 00:13:13 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:25810
Sat Nov  8 09:56:36 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:30150
Sat Nov  8 15:10:47 2003 ProbeUuid failed for host 66.93.1.248:25334
Sat Nov  8 16:04:34 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:4556
Sat Nov  8 17:41:10 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:18912
Sat Nov  8 18:20:02 2003 ProbeUuid failed for host 66.93.1.248:47837
Sat Nov  8 20:25:14 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:25334
Sat Nov  8 23:33:55 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:38879
Sun Nov  9 11:12:52 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:29132
Sun Nov  9 12:27:36 2003 ProbeUuid failed for host 66.93.1.248:18423
Sun Nov  9 13:12:50 2003 ProbeUuid failed for host 66.93.1.248:18423
Sun Nov  9 22:16:24 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:5625
Mon Nov 10 03:07:32 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:61408
Mon Nov 10 09:41:13 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:41209
Mon Nov 10 21:44:15 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:16873
Mon Nov 10 22:51:40 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:5577
Tue Nov 11 07:46:17 2003 ProbeUuid failed for host 66.93.1.248:21978
Tue Nov 11 08:59:27 2003 ProbeUuid failed for host 66.93.1.248:21978
Tue Nov 11 23:02:51 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:32724
Wed Nov 12 03:42:40 2003 ProbeUuid failed for host 66.93.1.248:54987
Wed Nov 12 10:35:49 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:12542
Wed Nov 12 11:35:59 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:19952
Wed Nov 12 23:08:51 2003 ProbeUuid failed for host 66.93.1.248:31742
Thu Nov 13 10:58:05 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:17905
Thu Nov 13 22:03:16 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:16339
Thu Nov 13 23:18:12 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:54477
Fri Nov 14 10:14:43 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:22491
Fri Nov 14 13:00:40 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:2247
Fri Nov 14 14:02:44 2003 ProbeUuid failed for host 66.93.1.248:29183
Fri Nov 14 14:33:10 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:63698
Fri Nov 14 15:16:03 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:65021
Sat Nov 15 15:21:39 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:4087
Sat Nov 15 17:04:23 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:7622
Sat Nov 15 21:27:32 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:25849
Sat Nov 15 22:04:33 2003 ProbeUuid failed for host 66.93.1.248:25304
Sun Nov 16 10:13:19 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:52707
Sun Nov 16 12:01:13 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:22249
Sun Nov 16 17:00:56 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:58865
Sun Nov 16 22:33:00 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:31452
Sun Nov 16 23:49:06 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:37326
Mon Nov 17 00:35:45 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:53238
Mon Nov 17 09:39:48 2003 CB: RCallBackConnectBack (host.c) failed for host 66.93.1.248:27075
--

To: Dan Hyde <drh@umich.edu>
Cc: rees@citi.umich.edu, ifs.support@umich.edu, kwc@citi.umich.edu,
	"Kevin Coffman" <kwcoffman@earthlink.net>
From: Robert King <rak@umich.edu>
Subject: Re: FW: afs21 
In-reply-to: Your message of Sun, 16 Nov 2003 22:28:28 -0500.
             <23657.1069039708@block.ifs.umich.edu> 
Date: Mon, 17 Nov 2003 12:46:04 -0500

The server is running OpenAFS 1.2.10 on Solaris 2.8.

There are 28 errors of the form "ProbeUuid failed for host 
66.93.1.248:<port number>" in FileLog going back to Sept. 18, 
three days after the server was brought back up. The last such 
entry was from Saturday. There are 20063 "ProbeUuid failed" errors in 
the log for 2348 IP addresses. I don't know whether this has anything 
to do with the loss of contact since it has apparently been going
on for some time without any (reported) problems. I found a couple of
articles from the openafs list from Feb. 2003 (subject "fileserver 
threads stuck on solaris servers" suggesting a problem with
OpenAFS 1.2.9 and below on Solaris machines. Another message from 
November 2002 (subject "openAFS over WAN") reported someone using
a NAT having this kind of trouble, but no solution was mentioned.
I'll keep looking.

Bob