[OpenAFS] fileserver hangs on shutdown

Adam Megacz megacz@cs.berkeley.edu
Fri, 27 Jan 2006 20:43:04 -0800


Any idea what's going on here (skip to the end)?

The fileserver became unresponsive, so I issued a "bos restart -all".
All the other processes restarted; fileserver just got stuck.

  - a

Tue Jan 24 02:43:04 2006 ProbeUuid failed for host 128.32.37.64:7001
Tue Jan 24 03:23:19 2006 ProbeUuid failed for host 128.32.37.64:7001
Tue Jan 24 14:04:15 2006 CB: RCallBackConnectBack (host.c) failed for host 66.159.230.136:7001
Wed Jan 25 01:55:14 2006 CB: RCallBackConnectBack (host.c) failed for host 128.32.37.64:7001
Wed Jan 25 14:16:11 2006 CB: RCallBackConnectBack (host.c) failed for host 128.32.37.64:7001
Wed Jan 25 15:17:07 2006 ProbeUuid failed for host 136.152.170.11:7001
Wed Jan 25 22:53:11 2006 ProbeUuid failed for host 128.32.37.64:7001
Wed Jan 25 23:13:50 2006 CB: WhoAreYou failed for 66.159.230.136:7001, error -01
Thu Jan 26 01:34:14 2006 ProbeUuid failed for host 66.159.230.136:7001
Thu Jan 26 12:50:18 2006 ProbeUuid failed for host 66.159.230.136:7001
Thu Jan 26 14:12:02 2006 CB: WhoAreYou failed for 66.159.230.136:7001, error -01
Thu Jan 26 14:22:56 2006 CB: WhoAreYou failed for 66.159.230.136:7001, error -01
Thu Jan 26 16:13:52 2006 CB: WhoAreYou failed for 66.159.230.136:7001, error -01
Thu Jan 26 18:24:48 2006 CB: WhoAreYou failed for 66.159.230.136:7001, error -01
Thu Jan 26 20:36:15 2006 CB: RCallBackConnectBack (host.c) failed for host 66.159.230.136:7001
Fri Jan 27 05:17:12 2006 CB: RCallBackConnectBack (host.c) failed for host 128.32.37.64:7001
Fri Jan 27 19:23:16 2006 ProbeUuid failed for host 66.159.230.136:7001
Fri Jan 27 19:56:21 2006 CB: WhoAreYou failed for 66.159.230.136:7001, error -01
Fri Jan 27 20:02:37 2006 CB: WhoAreYou failed for 66.159.230.136:7001, error -01
Fri Jan 27 20:03:33 2006 CB: WhoAreYou failed for 66.159.230.136:7001, error -01
Fri Jan 27 20:03:33 2006 FindClient: client 81e88b0(f4f0498) already had conn 81f4e50 (host 40252080), stolen by client 81e88b0(f4f0498)
Fri Jan 27 20:31:52 2006 Shutting down file server at Fri Jan 27 20:31:52 2006
Fri Jan 27 20:31:52 2006 Vice was last started at Sun Jan 22 23:53:49 2006

Fri Jan 27 20:31:52 2006 Large vnode cache, 400 entries, 2 allocs, 32587 gets (552 reads), 143 writes
Fri Jan 27 20:31:52 2006 Small vnode cache,400 entries, 79 allocs, 61917 gets (8090 reads), 970 writes
Fri Jan 27 20:31:52 2006 Volume header cache, 400 entries, 32622 gets, 0 replacements
Fri Jan 27 20:31:52 2006 Partition /vicepa: 796541188 available 1K blocks (minfree=0), Fri Jan 27 20:31:52 2006 163311396 free blocks
Fri Jan 27 20:31:52 2006 Partition /vicepc: 796541188 available 1K blocks (minfree=0), Fri Jan 27 20:31:52 2006 163311396 free blocks
Fri Jan 27 20:31:52 2006 With 90 directory buffers; 1507 reads resulted in 17 read I/Os
Fri Jan 27 20:31:52 2006 Total Client entries = 14, blocks = 1; Host entries = 4, blocks = 1
Fri Jan 27 20:31:52 2006 There are 14 connections, process size 133168
Fri Jan 27 20:31:52 2006 There are 4 workstations, 1 are active (req in < 15 mins), 0 marked "down"
Fri Jan 27 20:31:52 2006 VShutdown:  shutting down on-line volumes...
Fri Jan 27 20:32:22 2006 CB: WhoAreYou failed for 66.159.230.136:7001, error -01
Fri Jan 27 20:33:32 2006 pr_GetCPS failed(-01) for user 5, host 66.159.230.136:7001
Fri Jan 27 20:33:32 2006 CallPreamble: Couldn't get CPS. Reconnect to ptserver
Fri Jan 27 20:34:42 2006 FindClient: client 81e9218(96316934) already had conn 81e8548 (host 40252080), stolen by client 81e9218(96316934)
Fri Jan 27 20:34:42 2006 CallPreamble: Couldn't get CPS. Fail


-- 
PGP/GPG: 5C9F F366 C9CF 2145 E770  B1B8 EFB1 462D A146 C380