[OpenAFS] problem running openafs client on linux-2.4.5

Derek Atkins warlord@MIT.EDU
19 Jun 2001 10:11:50 -0400


Actually, Hartmut, the particular strace he gave was very valuable,
and the log messages he sent (not an oops) was also extremely
valuable.  I now know almost exactly _where_ the error is coming from.
I just need to figure out _why_.

-derek

Hartmut Reuter <reuter@rzg.mpg.de> writes:

> You cannot get much information from the afsd because these are only
> dummy daemons which once in their life do the afs system call and never
> come back. They just create a kernel thread which sits in the kernel and
> waits for work.
> 
> You should have a look into /var/log/messages whether there have
> happened any oops. If there is nothing wrong with your configuration
> (directory /afs exists, CellServDB, ThisCell and cacheinfo are correct)
> it's mostly a kernel crash (oops). If you call from another machine
> "rxdebug <your machine's name> 7001 -all" it should show you the
> connections to the database servers and the server which hosts root.afs.
> If this is not the case you may have a routing problem.
> 
> Hartmut Reuter
> 
> Claude Lecommandeur wrote:
> > 
> >    Hello,
> > 
> >    I am new to this list and am trying to install the open afs system
> > on my site. The server is running fine, but I have problems with a client.
> > It is using linux kernel 2.4.5 and I built successfully libafs-2.4.5.o.
> > from openafs-kernel-source-1.0.4-24.2. But (there is a but) when I try
> > to launch afsd (via /etc/init.d/afs start) it freezes. If I strace it,
> > I get :
> > 
> > ps :
> > 
> > root     17335  0.2  0.2  1540  600 pts/8    S    14:45   0:00 strace -f /usr/vice/etc/afsd -stat 300 -dcache 100 -daemons 2 -volumes 50
> > root     17336  6.1  0.2  1516  652 pts/8    R    14:45   0:00 /usr/vice/etc/afsd -stat 300 -dcache 100 -daemons 2 -volumes 50
> > root     17337  0.0  0.0     0    0 pts/8    Z<   14:45   0:00 [afsd <defunct>]
> > root     17338  6.3  0.2  1516  652 pts/8    R    14:45   0:00 /usr/vice/etc/afsd -stat 300 -dcache 100 -daemons 2 -volumes 50
> > root     17339  8.7  0.2  1516  652 pts/8    R<   14:45   0:00 /usr/vice/etc/afsd -stat 300 -dcache 100 -daemons 2 -volumes 50
> > root     17340  6.1  0.2  1516  652 pts/8    R    14:45   0:00 /usr/vice/etc/afsd -stat 300 -dcache 100 -daemons 2 -volumes 50
> > root     17341  6.1  0.2  1516  652 pts/8    R    14:45   0:00 /usr/vice/etc/afsd -stat 300 -dcache 100 -daemons 2 -volumes 50
> > root     17342  6.1  0.2  1516  652 pts/8    R    14:45   0:00 /usr/vice/etc/afsd -stat 300 -dcache 100 -daemons 2 -volumes 50
> > root     17343  5.8  0.2  1516  652 pts/8    R    14:45   0:00 /usr/vice/etc/afsd -stat 300 -dcache 100 -daemons 2 -volumes 50
> > 
> > tail -22 /tmp/afsd.strace :
> > 
> > close(3)                                = 0
> > open("/usr/vice/etc/NetRestrict", O_RDONLY) = -1 ENOENT (No such file or directory)
> > afs_syscall(0x1c, 0x11, 0x1, 0xbfffea60, 0xbfffda60) = 0
> > fork()                                  = 17337
> > [pid 17337] setpriority(PRIO_PROCESS, 0, -10) = 0
> > [pid 17337] afs_syscall(0x1c, 0x30, 0x190, 0, 0) = -1 E2BIG (Argument list too long)
> > [pid 17337] _exit(1)                    = ?
> > --- SIGCHLD (Child exited) ---
> > fork()                                  = 17338
> > [pid 17336] fork()                      = 17339
> > [pid 17339] setpriority(PRIO_PROCESS, 0, -10) = 0
> > [pid 17339] afs_syscall(0x1c, 0x13, 0, 0xfffffff6, 0 <unfinished ...>
> > [pid 17338] afs_syscall(0x1c, 0, 0x190, 0x60, 0x804a9bc <unfinished ...>
> > [pid 17336] fork()                      = 17340
> > [pid 17336] fork()                      = 17341
> > [pid 17340] afs_syscall(0x1c, 0x1, 0x40016280, 0x60, 0x804aa64 <unfinished ...>
> > [pid 17341] afs_syscall(0x1c, 0x4, 0x40016280, 0x60, 0x804aaa4 <unfinished ...>
> > [pid 17336] fork()                      = 17342
> > [pid 17342] afs_syscall(0x1c, 0x2, 0x40016280, 0x60, 0x804ab0d <unfinished ...>
> > [pid 17336] fork()                      = 17343
> > [pid 17336] afs_syscall(0x1c, 0x1d, 0xbfffc5e0, 0x806dccc, 0x1 <unfinished ...>
> > [pid 17343] afs_syscall(0x1c, 0x2, 0x40016280, 0x60, 0x804ab0d <unfinished ...>
> > 
> >    The E2BIG return is suspicious. I have to rmmod by hand the openafs module
> > to unfreeze afsd.
> > 
> >    Anyone has any idea ?
> > 
> >    Thanks in advance.
> > 
> >          Claude.
> > --
> > Claude Lecommandeur           Claude.Lecommandeur@Epfl.Ch
> > EPFL - SIC                    +41 21 693 22 97
> > 1015 Lausanne (Switzerland)   http://slwww.epfl.ch/SIC/SL/info/Claude.html
> > 
> > sh: fortune:  not found.
> > _______________________________________________
> > OpenAFS-info mailing list
> > OpenAFS-info@openafs.org
> > https://lists.openafs.org/mailman/listinfo/openafs-info
> 
> -- 
> -----------------------------------------------------------------
> Hartmut Reuter                           e-mail reuter@rzg.mpg.de
> 					   phone +49-89-3299-1328
> RZG (Rechenzentrum Garching)               fax   +49-89-3299-1301 
> Computing Center of the Max-Planck-Gesellschaft (MPG) and the
> Institut fuer Plasmaphysik (IPP)
> -----------------------------------------------------------------
> _______________________________________________
> OpenAFS-info mailing list
> OpenAFS-info@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-info

-- 
       Derek Atkins, SB '93 MIT EE, SM '95 MIT Media Laboratory
       Member, MIT Student Information Processing Board  (SIPB)
       URL: http://web.mit.edu/warlord/    PP-ASEL-IA     N1NWH
       warlord@MIT.EDU                        PGP key available