[OpenAFS] missing sysid file, vos listaddrs empty

Dave Botsch botsch@cnf.cornell.edu
Wed, 12 Sep 2007 23:16:22 -0400


Hi, all.

Recently upgraded my single oafs server on a private ip and using netinfo to
also answer on a public ip to 1.4.4, then tried to change the public ip.

A couple of problems:
1. sysid file is not getting recreated
2. the old natted ip would not go away, even after multiple restarts and
changing the contents of NetInfo and putting the old ip in NetRestrict
3. tried a vos changeaddr -remove public_nat_ip -- still showing up in vos
listaddrs
4. tried vos changeaddr -old 192.168.1.20 -new 192.168.1.20 --now vos listaddrs
returns no addresses at all

If I restart the server, in FileLog I see:

Wed Sep 12 22:55:27 2007 File server starting
Wed Sep 12 22:55:27 2007 afs_krb_get_lrealm failed, using fruit.
Wed Sep 12 22:55:27 2007 /usr/afs/local/sysid: doesn't exist
Wed Sep 12 22:55:27 2007 Creating new SysID file
Wed Sep 12 22:55:27 2007 VL_RegisterAddrs rpc failed; will retry periodically
(code=5376, err=0)
Wed Sep 12 22:55:28 2007 Set thread id 14 for FSYNC_sync
Wed Sep 12 22:55:28 2007 Partition /vicepa: attaching volumes
Wed Sep 12 22:55:28 2007 Partition /vicepa: attached 9 volumes; 0 volumes not
attached
Wed Sep 12 22:55:28 2007 Set thread id 15 for 'FiveMinuteCheckLWP'
Wed Sep 12 22:55:28 2007 Set thread id 16 for 'HostCheckLWP'
Wed Sep 12 22:55:28 2007 Set thread id 17 for 'FsyncCheckLWP'
Wed Sep 12 22:55:28 2007 Getting FileServer name...
Wed Sep 12 22:55:28 2007 FileServer host name is 'Moo'
Wed Sep 12 22:55:28 2007 Getting FileServer address...
Wed Sep 12 22:55:28 2007 FileServer Moo has address 192.168.1.20 (0x1401a8c0 or
0xc0a80114 in host byte order)
Wed Sep 12 22:55:28 2007 File Server started Wed Sep 12 22:55:28 2007

Despite what it says in the log, no sysid file is created (there is, however, a
new fssync.sock file in /usr/afs/local).

NetInfo has:
192.168.1.20
f 24.92.249.15

NetRestrict has:
24.58.0.101
(which is the ip that would not go away)

also odd, in udebug output, I see (notice the backwards IP address in the Last
yes vote line):

udebug moo 7003 -long
Host's addresses are: 192.168.1.20 24.92.249.15 
Host's 192.168.1.20 time is Wed Sep 12 23:13:40 2007
Local time is Wed Sep 12 23:13:38 2007 (time differential -2 secs)
Last yes vote for 20.1.168.192 was 0 secs ago (sync site); 
Last vote started 0 secs ago (at Wed Sep 12 23:13:38 2007)
Local db version is 1189652834.5
I am sync site forever (1 server)
Recovery state 1f
Sync site's db version is 1189652834.5
0 locked pages, 0 of them for write
Last time a new db version was labelled was:
         386 secs ago (at Wed Sep 12 23:07:12 2007)


Any help is greatly appreciated!

This server is also not running a client, at the moment, since the kernel
module has unresolved deps (next issue after the server has been fixed -
hopefully this is not somehow causing the server issues, which I would think it
would not).

Thanks!


-- 
********************************
David William Botsch
Programmer/Analyst
CNF Computing
botsch@cnf.cornell.edu
********************************