[OpenAFS-devel] fileserver crash on Solaris 2.6 with 1.2.7

Martin MOKREJŠ mmokrejs@natur.cuni.cz
Tue, 10 Dec 2002 18:27:59 +0100 (CET)


On 10 Dec 2002, Joakim Fallsjo wrote:

> Martin MOKREJŠ <mmokrejs@natur.cuni.cz> writes:
>
> [...]
>
> > $ bos status -server var400 -long
> > Instance fs, (type is fs) currently running normally.
> >     Auxiliary status is: file server running.
> >     Process last started at Sun Dec  8 18:33:51 2002 (22 proc starts)
> >     Last exit at Sun Dec  8 18:33:51 2002
> >     Last error exit at Sun Dec  8 18:33:51 2002, by vol, by exiting with code 1
> >     Command 1 is '/usr/afs/bin/fileserver'
> >     Command 2 is '/usr/afs/bin/volserver'
> >     Command 3 is '/usr/afs/bin/salvager'
> >
> > Instance ptserver, (type is simple) temporarily disabled, stopped for too many errors, currently shutdown.
> >     Process last started at Sun Dec  8 18:59:42 2002 (151 proc starts)
> >     Last exit at Sun Dec  8 18:59:43 2002
> >     Last error exit at Sun Dec  8 18:59:43 2002, by exiting with code 2
> >     Command 1 is '/usr/afs/bin/ptserver'
> >
> > Instance vlserver, (type is simple) temporarily disabled, stopped for too many errors, currently shutdown.
> >     Process last started at Sun Dec  8 18:59:33 2002 (244 proc starts)
> >     Last exit at Sun Dec  8 18:59:33 2002
> >     Last error exit at Sun Dec  8 18:59:33 2002, by exiting with code 2
> >     Command 1 is '/usr/afs/bin/ptserver'
> >
>                     ^^^^^^^^^^^^^^^^^^^^^
>
> Is this realy correct? shoulden't there be a vlserver runnenig here instead of ptserver?

Thanks. I've deleted the instance and restarted -all, but it still doesn't
run:

BosLog
Tue Dec 10 18:23:01 2002: fs:vol exited on signal 15
Tue Dec 10 18:23:01 2002: fs:file exited with code 0
Tue Dec 10 18:23:02 2002: ptserver exited with code 2
Tue Dec 10 18:23:02 2002: vlserver exited with code 2
Tue Dec 10 18:23:02 2002: ptserver exited with code 2
Tue Dec 10 18:23:03 2002: ptserver exited with code 2
Tue Dec 10 18:23:03 2002: vlserver exited with code 2
Tue Dec 10 18:23:03 2002: ptserver exited with code 2
Tue Dec 10 18:23:03 2002: vlserver exited with code 2
Tue Dec 10 18:23:04 2002: ptserver exited with code 2
Tue Dec 10 18:23:04 2002: vlserver exited with code 2
Tue Dec 10 18:23:04 2002: ptserver exited with code 2
Tue Dec 10 18:23:04 2002: vlserver exited with code 2
Tue Dec 10 18:23:05 2002: ptserver exited with code 2
Tue Dec 10 18:23:05 2002: vlserver exited with code 2
Tue Dec 10 18:23:05 2002: ptserver exited with code 2
Tue Dec 10 18:23:05 2002: vlserver exited with code 2
Tue Dec 10 18:23:06 2002: vlserver exited with code 2
Tue Dec 10 18:23:06 2002: ptserver exited with code 2
Tue Dec 10 18:23:06 2002: vlserver exited with code 2
Tue Dec 10 18:23:07 2002: ptserver exited with code 2
Tue Dec 10 18:23:07 2002: vlserver exited with code 2
Tue Dec 10 18:23:07 2002: ptserver exited with code 2
Tue Dec 10 18:23:07 2002: vlserver exited with code 2
Tue Dec 10 18:23:08 2002: ptserver exited with code 2
Tue Dec 10 18:23:08 2002: BNODE 'ptserver' repeatedly failed to start, perhaps missing executable.
Tue Dec 10 18:23:08 2002: vlserver exited with code 2
Tue Dec 10 18:23:08 2002: BNODE 'vlserver' repeatedly failed to start, perhaps missing executable.
Tue Dec 10 18:23:08 2002: ptserver exited with code 2
Tue Dec 10 18:23:08 2002: BNODE 'ptserver' repeatedly failed to start, perhaps missing executable.
Tue Dec 10 18:23:08 2002: vlserver exited with code 2
Tue Dec 10 18:23:08 2002: BNODE 'vlserver' repeatedly failed to start, perhaps missing executable.


Tue Dec 10 18:23:01 2002 File server starting
Tue Dec 10 18:23:01 2002 afs_krb_get_lrealm failed, using natur.cuni.cz.
Tue Dec 10 18:24:12 2002 VL_RegisterAddrs rpc failed; will retry periodically (code=5376, err=2)
Tue Dec 10 18:24:12 2002 Partition /vicepa: attached 1 volumes; 0 volumes not attached
Tue Dec 10 18:24:12 2002 Getting FileServer name...
Tue Dec 10 18:24:12 2002 FileServer host name is 'var400'
Tue Dec 10 18:24:12 2002 Getting FileServer address...
Tue Dec 10 18:24:12 2002 FileServer var400 has address 195.113.59.121 (0xc3713b79 or 0xc3713b79 in host byte order)
Tue Dec 10 18:24:12 2002 File Server started Tue Dec 10 18:24:12 2002

Fetching log file 'VLLog'...
Tue Dec 10 18:23:08 2002 Using 195.113.59.121 as my primary address
Tue Dec 10 18:23:08 2002 Inconsistent Cell Info on server: Tue Dec 10 18:23:08 2002 195.113.59.251 Tue Dec 10 18:23:08 2002
vlserver: Ubik init failed with code 5385

Fetching log file 'PtLog'...
ptserver: problems with host name Ubik init failed
primary address
Tue Dec 10 18:23:08 2002 Inconsistent Cell Info on server: Tue Dec 10 18:23:08 2002 195.113.59.251 Tue Dec 10 18:23:08 2002


At least it doesn't coredump anymore.

-- 
Martin Mokrejs <mmokrejs@natur.cuni.cz>, <m.mokrejs@gsf.de>
PGP5.0i key is at http://www.natur.cuni.cz/~mmokrejs
MIPS / Institute for Bioinformatics <http://mips.gsf.de>
GSF - National Research Center for Environment and Health
Ingolstaedter Landstrasse 1, D-85764 Neuherberg, Germany
tel.: +49-89-3187 3683 , fax: +49-89-3187 3585