[OpenAFS] fileserver - salvage loop

Matthew Cocker matt@cs.auckland.ac.nz
Tue, 21 Oct 2003 11:08:33 +1300


Hi

found this post that maybe fix

https://lists.openafs.org/pipermail/openafs-info/2003-September/010688.htm

will upgrade now. But why would 3 fileservers go down at once?

Cheers

Matt


Matthew Cocker wrote:

> Hi
> 
> at exactly the same time last night we had 3 Openafs servers go into a 
> continuous fileserver-salvage-fs-salv loop.
> 
> Mon Oct 20 17:05:07 2003: fs:file exited on signal 11
> Mon Oct 20 17:05:07 2003: fs:vol exited on signal 15
> Mon Oct 20 17:09:54 2003: fs:salv exited with code 0
> Mon Oct 20 17:11:24 2003: fs:file exited with code 1
> Mon Oct 20 17:11:24 2003: fs:vol exited on signal 15
> Mon Oct 20 17:14:54 2003: fs:salv exited with code 0
> Mon Oct 20 17:16:24 2003: fs:file exited with code 1
> Mon Oct 20 17:16:24 2003: fs:vol exited on signal 15
> Mon Oct 20 17:18:15 2003: fs:salv exited with code 0
> Mon Oct 20 17:19:45 2003: fs:file exited with code 1
> Mon Oct 20 17:19:45 2003: fs:vol exited on signal 15
> Mon Oct 20 17:20:59 2003: fs:salv exited with code 0
> Mon Oct 20 17:22:29 2003: fs:file exited with code 1
> Mon Oct 20 17:22:29 2003: fs:vol exited on signal 15
> Mon Oct 20 17:23:37 2003: fs:salv exited with code 0
> Mon Oct 20 17:25:07 2003: fs:file exited with code 1
> 
> 
> The box is looping still if anyone has a command the want me to run I 
> can leave the box like this for a little while. The fileserver process 
> 719 is the one that has not died.
> 
> openafs 1.2.9 redhat 7.3 kernel 2.4.20-18.7
> 
> [root@afs-11-fos-ec logs]# ps -welf
>   F S UID        PID  PPID  C PRI  NI ADDR    SZ WCHAN  STIME TTY     
> TIME CMD
> 100 S root         1     0  0  68   0    -   343 do_sel Sep04 ? 00:00:08 
> init [3]
> 040 S root         2     1  0  69   0    -     0 contex Sep04 ? 00:00:04 
> [keventd]
> 040 S root         3     1  0  79  19    -     0 ksofti Sep04 ? 00:00:09 
> [ksoftirqd_CPU0]
> 040 S root         4     1  0  69   0    -     0 wakeup Sep04 ? 00:04:55 
> [kswapd]
> 040 S root         5     1  0  78   0    -     0 kscand Sep04 ? 06:28:34 
> [kscand]
> 040 S root         6     1  0  69   0    -     0 bdflus Sep04 ? 00:00:00 
> [bdflush]
> 040 S root         7     1  0  69   0    -     0 kupdat Sep04 ? 00:00:22 
> [kupdated]
> 040 S root         8     1  0  59 -20    -     0 md_thr Sep04 ? 00:00:00 
> [mdrecoveryd]
> 040 S root        14     1  0  69   0    -     0 end    Sep04 ? 00:00:00 
> [aacraid]
> 040 S root        15     1  0  69   0    -     0 down_i Sep04 ? 00:00:00 
> [scsi_eh_0]
> 040 S root        18     1  0  69   0    -     0 end    Sep04 ? 00:00:04 
> [kjournald]
> 040 S root       146     1  0  69   0    -     0 end    Sep04 ? 00:00:00 
> [kjournald]
> 040 S root       147     1  0  69   0    -     0 end    Sep04 ? 00:01:57 
> [kjournald]
> 040 S root       148     1  0  71   0    -     0 end    Sep04 ? 00:01:44 
> [kjournald]
> 040 S root       538     1  0  69   0    -   357 do_sel Sep04 ? 00:00:00 
> syslogd -m 0
> 140 S root       543     1  0  69   0    -   342 do_sys Sep04 ? 00:00:00 
> klogd -x
> 040 S root       698     1  0  73   0    -  1021 do_sel Sep04 ? 00:08:33 
> /sbin/bosserver
> 040 S root       715     1  0  68   0    -   384 nanosl Sep04 ? 00:00:00 
> crond
> 040 S root       719     1  0  69   0    - 12151 rt_sig Sep04 ? 00:00:00 
> /libexec/openafs/fileserver
> 100 S root       758     1  0  69   0    -   336 read_c Sep04 tty1 
> 00:00:00 /sbin/mingetty tty1
> 100 S root       759     1  0  69   0    -   337 read_c Sep04 tty2 
> 00:00:00 /sbin/mingetty tty2
> 100 S root       760     1  0  69   0    -   336 read_c Sep04 tty3 
> 00:00:00 /sbin/mingetty tty3
> 100 S root       762     1  0  69   0    -   335 read_c Sep04 tty5 
> 00:00:00 /sbin/mingetty tty5
> 100 S root       763     1  0  69   0    -   335 read_c Sep04 tty6 
> 00:00:00 /sbin/mingetty tty6
> 140 S root      6548     1  0  69   0    -   482 do_sel Sep23 ? 00:00:00 
> /usr/sbin/zebra -d
> 140 S root      6610     1  0  69   0    -   530 do_sel Sep23 ? 00:00:23 
> /usr/sbin/ripd -d
> 100 S root      7145     1  0  69   0    -   335 read_c Sep23 tty4 
> 00:00:00 /sbin/mingetty tty4
> 140 S root      8966     1  0  69   0    -   867 do_sel Sep24 ? 00:00:07 
> /usr/sbin/sshd
> 040 S root      5109  8966  0  69   0    -  1531 do_sel 09:19 ? 00:00:00 
> sshd: root@pts/0
> 100 S root      5114  5109  0  76   0    -   618 wait4  09:20 pts/0 
> 00:00:00 -bash
> 100 S root     20718   698  0  72  -5    -  2611 nanosl 09:27 ? 00:00:00 
> /libexec/openafs/fileserver
> 100 S root     20719   698  0  74   0    -  1289 tcp_da 09:27 ? 00:00:00 
> /libexec/openafs/volserver
> 040 S root     20720 20718  0  73   0    -  2611 do_pol 09:27 ? 00:00:00 
> /libexec/openafs/fileserver
> 040 S root     20721 20720  0  73   0    -  2611 rt_sig 09:27 ? 00:00:00 
> /libexec/openafs/fileserver
> 000 R root     20723  5114  0  78   0    -   774 -      09:27 pts/0 
> 00:00:00 ps -welf
> 
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info