[OpenAFS-devel] 1.8.x AIX support
Ben Huntsman
ben@huntsmans.net
Tue, 16 May 2023 20:28:01 +0000
--_000_MWHPR0701MB3674BBF00BF8617E0514D7D9A7799MWHPR0701MB3674_
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Hi there!
Yes, that was it! I pulled that in and it solved the issue. With 15106=
and 14705 applied, pus the other three you already have proposed in gerrit=
, the 1.8.x branch works on AIX.
Thank you!
-Ben
________________________________
From: Cheyenne Wills <cwills@sinenomine.net>
Sent: Tuesday, May 16, 2023 11:10 AM
To: Ben Huntsman <ben@huntsmans.net>
Cc: openafs-devel@openafs.org <openafs-devel@openafs.org>
Subject: Re: [OpenAFS-devel] 1.8.x AIX support
On Tue, 16 May 2023 17:21:58 +0000
Ben Huntsman <ben@huntsmans.net> wrote:
> Hi there!
> Here is the backtrace with a debug build:
>
> bash-4.2# dbx /opt/openafs/libexec/openafs/vlserver core
> Type 'help' for help.
> [using memory image in core]
> reading symbolic information ...
>
> IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2)
> 0xd054cb34 (pthread_kill+0xb4) 80410014 lwz r2,0x14(r1)
> (dbx) where all
> Thread $t1
> _sigsetmask(??, ??, ??) at 0xd054b288
> _p_sigaction(??, ??, ??) at 0xd054be68
> raise.sigaction(??, ??, ??) at 0xd0120d50
> signal(??, ??) at 0xd021f6f4
> SetupLogSoftSignals(), line 469 in "serverLog.c"
> main(argc =3D 0, argv =3D (nil)), line 399 in "vlserver.c"
> Thread $t2
> pthread_kill(??, ??) at 0xd054cb34
> _p_raise(??) at 0xd054bf84
> raise.raise(??) at 0xd0121020
> abort() at 0xd017ca64
> opr_AssertionFailed(file =3D (nil), line =3D 0), line 29 in "assert.c"
> signalHandler(arg =3D (nil)), line 73 in "softsig.c"
>
>
> Hopefully that sheds more light on the situation?
>
> Thank you!
>
> -Ben
>
> ________________________________
> From: Cheyenne Wills <cwills@sinenomine.net>
> Sent: Tuesday, May 16, 2023 6:04 AM
> To: Ben Huntsman <ben@huntsmans.net>
> Cc: openafs-devel@openafs.org <openafs-devel@openafs.org>
> Subject: Re: [OpenAFS-devel] 1.8.x AIX support
>
> On Tue, 16 May 2023 04:46:40 +0000
> Ben Huntsman <ben@huntsmans.net> wrote:
>
> > Hi there-
> > I see that the three AIX issues are being pulled up to 1.8.x. I
> > just tried them out and found that we also need this one:
> >
> > 15106
> >
> > We get a kernel panic on AIX as soon as afsd is started without
> > 15106 applied.
> >
> > I pulled in 15106 and it compiles, but then I have another
> > problem that is that many of the servers coredump immediately:
> >
> > # /opt/openafs/libexec/openafs/vlserver
> > IOT/Abort trap(coredump)
> > # dbx /opt/openafs/libexec/openafs/vlserver core
> > Type 'help' for help.
> > [using memory image in core]
> > reading symbolic information ...warning: no source compiled with -g
> >
> >
> > IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2)
> > 0xd054cb34 (pthread_kill+0xb4) 80410014 lwz r2,0x14(r1)
> > (dbx) where all
> > Thread $t1
> > _sigsetmask(??, ??, ??) at 0xd054b288
> > _p_sigaction(??, ??, ??) at 0xd054be68
> > raise.sigaction(??, ??, ??) at 0xd0120d50
> > signal(??, ??) at 0xd021f6f4
> > .() at 0x100708e4
> > .() at 0x100019fc
> > Thread $t2
> > pthread_kill(??, ??) at 0xd054cb34
> > _p_raise(??) at 0xd054bf84
> > raise.raise(??) at 0xd0121020
> > abort() at 0xd017ca64
> > .() at 0x10005f8c
> > .() at 0x10071670
> >
> >
> > Are we missing another patch as well? Anyone have an idea what it
> > might be?
> >
> > Thanks much!
> >
> > -Ben
> >
> >
>
> Can you try doing a build with --enable-debug as a configure option.
> That should provide information for the backtrace.
>
> --
> Cheyenne Wills
> cwills@sinenomine.net
Looks like gerrit 14705 (from master) is needed as well
--
Cheyenne Wills
cwills@sinenomine.net
--_000_MWHPR0701MB3674BBF00BF8617E0514D7D9A7799MWHPR0701MB3674_
Content-Type: text/html; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
<html>
<head>
<meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dus-ascii"=
>
<style type=3D"text/css" style=3D"display:none;"> P {margin-top:0;margin-bo=
ttom:0;} </style>
</head>
<body dir=3D"ltr">
<div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size=
: 12pt; color: rgb(0, 0, 0);" class=3D"elementToProof">
Hi there!</div>
<div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size=
: 12pt; color: rgb(0, 0, 0);" class=3D"elementToProof">
Yes, that was it! I pulled that in and it solved the iss=
ue. With 15106 and 14705 applied, pus the other three you already hav=
e proposed in gerrit, the 1.8.x branch works on AIX.</div>
<div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size=
: 12pt; color: rgb(0, 0, 0);" class=3D"elementToProof">
<br>
</div>
<div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size=
: 12pt; color: rgb(0, 0, 0);" class=3D"elementToProof">
Thank you!</div>
<div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size=
: 12pt; color: rgb(0, 0, 0);" class=3D"elementToProof">
<br>
</div>
<div style=3D"font-family: Calibri, Arial, Helvetica, sans-serif; font-size=
: 12pt; color: rgb(0, 0, 0);" class=3D"elementToProof">
-Ben</div>
<div id=3D"appendonsend"></div>
<hr style=3D"display:inline-block;width:98%" tabindex=3D"-1">
<div id=3D"divRplyFwdMsg" dir=3D"ltr"><font face=3D"Calibri, sans-serif" st=
yle=3D"font-size:11pt" color=3D"#000000"><b>From:</b> Cheyenne Wills <cw=
ills@sinenomine.net><br>
<b>Sent:</b> Tuesday, May 16, 2023 11:10 AM<br>
<b>To:</b> Ben Huntsman <ben@huntsmans.net><br>
<b>Cc:</b> openafs-devel@openafs.org <openafs-devel@openafs.org><br>
<b>Subject:</b> Re: [OpenAFS-devel] 1.8.x AIX support</font>
<div> </div>
</div>
<div class=3D"BodyFragment"><font size=3D"2"><span style=3D"font-size:11pt;=
">
<div class=3D"PlainText">On Tue, 16 May 2023 17:21:58 +0000<br>
Ben Huntsman <ben@huntsmans.net> wrote:<br>
<br>
> Hi there!<br>
> Here is the backtrace with a debug build:<br>
> <br>
> bash-4.2# dbx /opt/openafs/libexec/openafs/vlserver core<br>
> Type 'help' for help.<br>
> [using memory image in core]<br>
> reading symbolic information ...<br>
> <br>
> IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2)<br>
> 0xd054cb34 (pthread_kill+0xb4) 80410014 &=
nbsp; lwz r2,0x14(r1)<br>
> (dbx) where all<br>
> Thread $t1<br>
> _sigsetmask(??, ??, ??) at 0xd054b288<br>
> _p_sigaction(??, ??, ??) at 0xd054be68<br>
> raise.sigaction(??, ??, ??) at 0xd0120d50<br>
> signal(??, ??) at 0xd021f6f4<br>
> SetupLogSoftSignals(), line 469 in "serverLog.c"<br>
> main(argc =3D 0, argv =3D (nil)), line 399 in "vlserver.c"<b=
r>
> Thread $t2<br>
> pthread_kill(??, ??) at 0xd054cb34<br>
> _p_raise(??) at 0xd054bf84<br>
> raise.raise(??) at 0xd0121020<br>
> abort() at 0xd017ca64<br>
> opr_AssertionFailed(file =3D (nil), line =3D 0), line 29 in "asse=
rt.c"<br>
> signalHandler(arg =3D (nil)), line 73 in "softsig.c"<br>
> <br>
> <br>
> Hopefully that sheds more light on the situation?<br>
> <br>
> Thank you!<br>
> <br>
> -Ben<br>
> <br>
> ________________________________<br>
> From: Cheyenne Wills <cwills@sinenomine.net><br>
> Sent: Tuesday, May 16, 2023 6:04 AM<br>
> To: Ben Huntsman <ben@huntsmans.net><br>
> Cc: openafs-devel@openafs.org <openafs-devel@openafs.org><br>
> Subject: Re: [OpenAFS-devel] 1.8.x AIX support<br>
> <br>
> On Tue, 16 May 2023 04:46:40 +0000<br>
> Ben Huntsman <ben@huntsmans.net> wrote:<br>
> <br>
> > Hi there-<br>
> > I see that the three AIX issues are being pulle=
d up to 1.8.x. I<br>
> > just tried them out and found that we also need this one:<br>
> ><br>
> > 15106<br>
> ><br>
> > We get a kernel panic on AIX as soon as afsd is=
started without<br>
> > 15106 applied.<br>
> ><br>
> > I pulled in 15106 and it compiles, but then I h=
ave another<br>
> > problem that is that many of the servers coredump immediately:<br=
>
> ><br>
> > # /opt/openafs/libexec/openafs/vlserver<br>
> > IOT/Abort trap(coredump)<br>
> > # dbx /opt/openafs/libexec/openafs/vlserver core<br>
> > Type 'help' for help.<br>
> > [using memory image in core]<br>
> > reading symbolic information ...warning: no source compiled with =
-g<br>
> ><br>
> ><br>
> > IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2)<br>
> > 0xd054cb34 (pthread_kill+0xb4) 80410014 &n=
bsp; lwz r2,0x14(r1)<br>
> > (dbx) where all<br>
> > Thread $t1<br>
> > _sigsetmask(??, ??, ??) at 0xd054b288<br>
> > _p_sigaction(??, ??, ??) at 0xd054be68<br>
> > raise.sigaction(??, ??, ??) at 0xd0120d50<br>
> > signal(??, ??) at 0xd021f6f4<br>
> > .() at 0x100708e4<br>
> > .() at 0x100019fc<br>
> > Thread $t2<br>
> > pthread_kill(??, ??) at 0xd054cb34<br>
> > _p_raise(??) at 0xd054bf84<br>
> > raise.raise(??) at 0xd0121020<br>
> > abort() at 0xd017ca64<br>
> > .() at 0x10005f8c<br>
> > .() at 0x10071670<br>
> ><br>
> ><br>
> > Are we missing another patch as well? Anyone have an idea w=
hat it<br>
> > might be?<br>
> ><br>
> > Thanks much!<br>
> ><br>
> > -Ben<br>
> ><br>
> ><br>
> <br>
> Can you try doing a build with --enable-debug as a configure option.<b=
r>
> That should provide information for the backtrace.<br>
> <br>
> --<br>
> Cheyenne Wills<br>
> cwills@sinenomine.net<br>
<br>
<br>
Looks like gerrit 14705 (from master) is needed as well<br>
-- <br>
Cheyenne Wills<br>
cwills@sinenomine.net<br>
</div>
</span></font></div>
</body>
</html>
--_000_MWHPR0701MB3674BBF00BF8617E0514D7D9A7799MWHPR0701MB3674_--