[OpenAFS] Problems in the last 2 days

Derrick J Brashear shadow@dementia.org
Wed, 29 Jan 2003 12:12:47 -0500 (EST)


On Wed, 29 Jan 2003, Klaas Hagemann wrote:

> First i used the pthread fileserver. There the fileserver-processes 
> simply stopped working from time to time, so that the volumes were not 
> reachable any more.
> The volserver kept working, so that "vos examine >volume<" gives a 
> successfull return. The System could not be shutdown any more and the 
> processes had to be killed by hand.

The main fileserver pthread died, leaving the others orphaned.

> Than i switched over to the PWD fileserver. There the fileserver process 

LWP

> itself works fine, but after 5-6 hours the kernel was not able to 
> allocate any more memory. So the whole system crashed and had to be 
> rebooted over the "reset-button".

It's going to be the same problem, malloc'ing yourself to death. Set a
resource limit before starting the fileserver, and get a core from the LWP
fileserver

> Another error was that the volserver stopped working but the fileserver 
> were still running. So "vos examine >volume<" delievered a failure but 
> the volume still was reachable.

Did you get a core?