[OpenAFS] Rotating log files and server probs

Mitchell D. Baker Mitchell.D.Baker@rose-hulman.edu
30 Sep 2002 09:39:40 -0500


Yes, this showed up again Sunday morning and the log does show this
message.  Looks like we got the message several times before things
locked up, the system went for a little while longer and then things
came to a screeching halt.

I have downloaded 1.2.7 and am looking at it now.  From the changelog it
looks like there is ref to this problem.  Is this correct?  Should
upgrading to 1.2.7 help the problem? or is there another patch needed to
get around this that has not gone into the main distro yet.

Thanks

See-ya
Mitch


On Mon, 2002-09-30 at 00:37, Thomas Mueller wrote:
> On Fri, 27 Sep 2002, Derrick J Brashear wrote:
> 
> > On 27 Sep 2002, Mitchell D. Baker wrote:
> > 
> > 
> > > Seems that a arbitrary times during the day.. the fileserver process on
> > > on one of the servers (same server each time) jumps to 95%+ CPU and the
> > > load on the system starts to rise.. If we catch it in time, we MAY be
> > > able to issue a bos restart and calm things down.. and the system will
> > > run for a while longer.. if we don't get to it in time then all of the
> > > AFS cell locks up and we have to just reboot one or more servers... 
> > > 
> > > Anyone got a clue as to why this would be happening? Where to look for
> > > the prob?  
> > 
> > Perhaps one of the people whose servers were running out of callbacks
> > could tell us if the "looping trying to free a callback slot" behaves like
> > this. There's a fix in 1.2.7 but the right answer is to fix
> > viced/callback.c to not use u_short for storage of the callback index. No
> > one has gotten to it yet.
> 
> You can easily figure out if you have this callback problem.
> If your fileserver starts to consume all CPU cycles you may raise the
> debug level by 
> 
> kill -TSTP <pid_of_fileserver>
> 
> (perhaps you should repeat this two or three times)
> 
> After that you will see thousands of messages like
> 
> ... Delete longest inactive host ...
> 
> in your /usr/afs/logs/FileLog.
> If you don't see such messages you must have a different problem.
> 
> Thomas.
> -- 
> -------------------------------------------------
> Thomas Müller, TU Chemnitz, URZ, D-09107 Chemnitz
> Tel: +49 (0)371 5311755   Fax: +49 (0)371 5311629
> -------------------------------------------------
> 
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info
-- 
/####################################################################/
/# Mitchell "Buzz" Baker               "To Infinity And Beyond..."  #/
/# Sr. Systems/Security Admin  Rose-Hulman Institute of Technology  #/     
/# Mitchell.D.Baker@rose-hulman.edu            www.rose-hulman.edu  #/
/#        For PGP Public key, check out www.keyserver.net           #/
/####################################################################/