[OpenAFS] Second fileserver setup

Cajus Pollmeier c.pollmeier@gmx.net
Fri, 10 Oct 2003 17:57:05 +0200


On Freitag, 10. Oktober 2003 17:37, you wrote:
> Cajus Pollmeier wrote:
> >>Does this file exist: /var/lib/openafs/sysid
> >>
>  > Not on afs-2. afs-1 has this file.

>
> Then copy it to afs-2.
>
> > FileLog:
> > Fri Oct 10 17:01:32 2003 File server starting
> > Fri Oct 10 17:01:32 2003 afs_krb_get_lrealm failed, using testnet.local.
> > Fri Oct 10 17:01:32 2003 /var/lib/openafs/sysid: doesn't exist
>
> Hmmm. This should have created the file is not existing. I would copy
>
> >from afs-1 then do a bos startup again.

I copied the sysid file. Now - doing the -noauth stuff - BosLog is
clean:

afs-2:~# cat /var/log/openafs/BosLog
Fri Oct 10 17:48:01 2003: Server directory access is okay
Fri Oct 10 17:48:11 2003: fs:salv exited with code 0

But after starting the fileserver stuff, I still get these in FileLog:
Fri Oct 10 17:48:11 2003 File server starting
Fri Oct 10 17:48:11 2003 afs_krb_get_lrealm failed, using testnet.local.
Fri Oct 10 17:48:12 2003 VL_RegisterAddrs rpc failed; will retry periodically (code=19270408, err=4)
Fri Oct 10 17:48:13 2003 Couldn't get CPS for AnyUser, will try again in 30 seconds; code=19270408.
Fri Oct 10 17:48:44 2003 Couldn't get CPS for AnyUser, will try again in 30 seconds; code=19270408.

Doing some strace for fileserver:
fs-2:~# strace -fff -s1024 -p 694
gettimeofday({1065801104, 801720}, NULL) = 0
gettimeofday({1065801104, 801965}, NULL) = 0
time(NULL)                              = 1065801104
gettimeofday({1065801104, 802325}, NULL) = 0
kill(699, SIGRTMIN)                     = 0
kill(699, SIGRTMIN)                     = 0
gettimeofday({1065801104, 802964}, NULL) = 0
sendmsg(7, {msg_name(16)={sin_family=AF_INET, sin_port=htons(7002), sin_addr=inet_addr("10.3.64.2")}}, msg_iov(2)=[{"\244Eq\22^\v\211\254\0\0\0\1\0\0\0\1\0\0\0\1\1\5\0\2I\21\0I", 28}, {"\351\346^\245_ \235\344>\255.\v\377|TL", 16}], msg_controllen=0, msg_flags=0}, 0) = 44
time(NULL)                              = 1065801104
kill(699, SIGRTMIN)                     = 0
kill(699, SIGRTMIN)                     = 0
time(NULL)                              = 1065801104
rt_sigprocmask(SIG_SETMASK, NULL, [RTMIN], 8) = 0
rt_sigsuspend([] <unfinished ...>
--- SIGRTMIN (Real-time signal 0) ---
<... rt_sigsuspend resumed> )           = -1 EINTR (Interrupted system call)
sigreturn()                             = ? (mask now [RTMIN])
time(NULL)                              = 1065801106
write(4, "Fri Oct 10 17:51:46 2003 Couldn\'t get CPS for AnyUser, will try again in 30 seconds; code=19270408.\n", 100) = 100
rt_sigprocmask(SIG_BLOCK, [CHLD], [RTMIN], 8) = 0
rt_sigaction(SIGCHLD, NULL, {SIG_DFL}, 8) = 0
rt_sigprocmask(SIG_SETMASK, [RTMIN], NULL, 8) = 0
nanosleep({30, 0},
...

So in fact it (tries) to do a sendmsg to afs-1 on udp port 7007. netstat -pltun on afs-1 show that there's the bosserver process listening
on.

And for volserver:
socket(PF_INET, SOCK_STREAM, IPPROTO_IP) = 3
connect(3, {sin_family=AF_INET, sin_port=htons(2040), sin_addr=inet_addr("127.0.0.1")}}, 16) = -1 ECONNREFUSED (Connection refused)
dup(2)                                  = 4
fcntl64(4, F_GETFL)                     = 0x8002 (flags O_RDWR|O_LARGEFILE)
fstat64(4, {st_mode=S_IFCHR|0600, st_rdev=makedev(136, 0), ...}) = 0
old_mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x40014000
_llseek(4, 0, 0xbffffa14, SEEK_CUR)     = -1 ESPIPE (Illegal seek)
write(4, "FSYNC_clientInit temporary failure (will retry): Connection refused\n", 68FSYNC_clientInit temporary failure (will retry): Connection refused
) = 68
close(4)                                = 0
munmap(0x40014000, 4096)                = 0
close(3)                                = 0
rt_sigprocmask(SIG_BLOCK, [CHLD], [], 8) = 0
rt_sigaction(SIGCHLD, NULL, {SIG_DFL}, 8) = 0
rt_sigprocmask(SIG_SETMASK, [], NULL, 8) = 0
nanosleep({48, 0},
...

So the volserver process still tries to connect to localhost?

Greetings,
Cajus