[OpenAFS] OopenAFS 1.2.13: ever increasing number of fileserver connections - h_Hold leak
Renata Maria Dart
Renata Maria Dart <renata@slac.stanford.edu>
Thu, 17 Feb 2005 08:51:26 -0800 (PST)
Hi, I applied Rainer's fix to a lightly loaded solaris 9 fileserver and I am
still seeing entries in the FileLog that look like:
Thu Feb 17 08:35:02 2005 FindClient: client ea7e58(7982e684) already had conn
eaa790 (host 808a850c), stolen by client ea7e58(7982e684)
Thu Feb 17 08:40:02 2005 FindClient: client e7a550(7982e6b8) already had conn
eaa930 (host 808a850c), stolen by client e7a550(7982e6b8)
Thu Feb 17 08:45:02 2005 FindClient: client ea69c8(7982e6ec) already had conn
ea81a0 (host 808a850c), stolen by client ea69c8(7982e6ec)
Is this an indication that the problem still exists and if I load
the fileserver up I will again see the ever increasing host/client connections
problem?
Thanks,
Renata
>X-Filter: CERNMX03 SMTPGW CERN Spam Sink v1.0
>From: Rainer Toebbicke <rtb@pclella.cern.ch>
>To: openafs-info@openafs.org
>Cc: openafs-devel@openafs.org
>Subject: Re: [OpenAFS] OopenAFS 1.2.13: ever increasing number of fileserver
connections - h_Hold leak
>X-OriginalArrivalTime: 07 Jan 2005 11:17:16.0546 (UTC)
FILETIME=[71E1D620:01C4F4AA]
>X-Virus-Status: Clean
>X-Greylist: Delayed for 00:12:36 by milter-greylist-1.6rc1
(meredith.dementia.org [128.2.120.216]); Fri, 07 Jan 2005 06:30:13 -0500 (EST)
>List-Post: <mailto:openafs-info@openafs.org>
>List-Subscribe: <https://lists.openafs.org/mailman/listinfo/openafs-info>,
<mailto:openafs-info-request@openafs.org?subject=subscribe>
>List-Unsubscribe: <https://lists.openafs.org/mailman/listinfo/openafs-info>,
<mailto:openafs-info-request@openafs.org?subject=unsubscribe>
>List-Archive: <https://lists.openafs.org/pipermail/openafs-info/>
>Date: Fri, 07 Jan 2005 12:17:15 +0100
>
>Rainer Toebbicke wrote:
>> Rainer Toebbicke wrote:
>>
>>> Something's wrong with OpenAFS 1.2.13 as on several busy servers we
>>> see an ever-increasing number of host/client connections.
>>>
>
>The attached patch (against 1.2.13, but modulo line numbers also for
>1.3.73 and later) fixes a h_Hold table leak in GetSomeSpace_r:
>
>lih_r now leaves the current lih_host held, and h_Releases those for
>which it changed mind during h_Enumerate. It also closes the window that
>ClearHostCallbacks_r opens.
>
>The problem was: due to the leak on the h_Hold table, rx connections and
>host and client structures stopped being garbage collected once the
>fileserver went through GetSomeSpace_r. Only relevant for "busy"
>servers, many would never even invoke this routine. The problem did not
>appear before 1.2.11 as the delta that caused this was not yet in.
>
>Tested on two servers up to now.
>
>--
>=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
>Rainer Toebbicke
>European Laboratory for Particle Physics(CERN) - Geneva, Switzerland
>Phone: +41 22 767 8985 Fax: +41 22 767 7155
Renata Dart | renata@SLAC.Stanford.edu
Stanford Linear Accelerator Center |
2575 Sand Hill Road, MS 97 | (650) 926-2848 (office)
Stanford, California 94025 | (650) 926-3329 (fax)