[OpenAFS] Re: Pauses in vos dump commands

Frank Burkhardt fbo2@gmx.net
Tue, 18 Oct 2005 09:31:02 +0200


Hi,

On Fri, Oct 14, 2005 at 08:47:37AM -0500, E. Chris Garrison wrote:

[snip]

> Another person on the list named Steve suggested I try to substitute
> "cat > /dev/null" for the archive command and run strace on the vos
> dump and archive.
> 
> Well, when I use the "cat > /dev/null", it pauses in the exact same
> way.  Good test though, at least it helps make the case that it's the
> vos dump not the archive command that's the problem.  
> 
> The strace hangs at this point:
> 
>   rt_sigprocmask(SIG_BLOCK, [INT CHLD], [CHLD], 8) = 0
>   clone(child_stack=0,
>   flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD,
>   child_tidptr=0xb7ff2708) = 28554
>   rt_sigprocmask(SIG_SETMASK, [CHLD], NULL, 8) = 0
>   close(3)                                = 0
>   rt_sigprocmask(SIG_BLOCK, [CHLD], [CHLD], 8) = 0
>   rt_sigprocmask(SIG_SETMASK, [CHLD], NULL, 8) = 0
>   rt_sigprocmask(SIG_BLOCK, [CHLD], [CHLD], 8) = 0
>   rt_sigaction(SIGINT, {0x807603b, [], SA_RESTORER, 0x587a48},
>   {SIG_DFL}, 8) = 0
>   waitpid(-1,
> 
> Not that I know what all the code means, but it's odd that it pauses
> in the middle of printing out the command like that.

It's not. It means, that waitpid() is called, which blocks the program until
a child process terminates. If you want to see, what's going on within the
child processes, use 'strace -f' instead of 'strace' to (f)ollow child
processes spawned by the straced process.

A guess of mine: Maybe one of your DB-Servers or one of your DNS-Servers is down?
This is a problem, I have in one of my cells although I have to wait ~ 30s instead
of 10min.

Regards,

Frank