From ben@huntsmans.net Thu May 4 18:14:34 2023 From: ben@huntsmans.net (Ben Huntsman) Date: Thu, 4 May 2023 17:14:34 +0000 Subject: [OpenAFS-devel] One more AIX fix Message-ID: --_000_DM5PR0701MB367135E16F9FF5C1FF90C74DA76D9DM5PR0701MB3671_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hi guys- Did the next OpenAFS Stable get released yet? I just pushed another cha= nge for an issue that was reported to me for the AIX port, and we should ha= ve this incorporated before we fully bless the AIX port. The change is 15418. Thank you so much! -Ben --_000_DM5PR0701MB367135E16F9FF5C1FF90C74DA76D9DM5PR0701MB3671_ Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable
Hi guys-
   Did the next OpenAFS Stable get released yet?  I just pus= hed another change for an issue that was reported to me for the AIX port, a= nd we should have this incorporated before we fully bless the AIX port.

   The change is 15418.

Thank you so much!

-Ben

--_000_DM5PR0701MB367135E16F9FF5C1FF90C74DA76D9DM5PR0701MB3671_-- From mmeffie@sinenomine.net Thu May 4 19:55:06 2023 From: mmeffie@sinenomine.net (Michael Meffie) Date: Thu, 4 May 2023 14:55:06 -0400 Subject: [OpenAFS-devel] OpenAFS Release Team weekly meeting Message-ID: <20230504145506.20a6b703@lana> OpenAFS Release Team weekly meeting Date: May 04, 2023 Participants: - Stephan Wiesand, OpenAFS Release Manager - Ben Kaduk - Cheyenne Wills - Michael Meffie The OpenAFS Release Team meetings are held each Thursday at 12:00pm Eastern, 9:00am Pacific, on IRC channel #openafs-releaseteam of Libra.Chat. Stable (1.8.x) ============== Stephan has merged the pending changes to the openafs-stable-1_8_x branch and asked if there are any other urgent changes for 1.8.10pre1. Cheyenne believes 14323 (Retire AFS_MOUNT_AFS) is needed for AIX 7.1. (This change was already merged on to the master branch.) Development (1.9.x/master) ========================== Cheyenne pushed a change to fix a build error on CentOS Stream 9 when building with --disable-checking (which is still the default). 15417 Linux: libafs build error with disable-checking Recent Changes ============== Merged onto 'openafs-stable-1_8_x' branch since 2023-04-27: 15411 Linux: Clear msghdr structure before use 15410 Linux: Do not initialize msghdr's msg_iter/msg_iov 15403 AIX: Fix undefined symbols 15401 LINUX: Introduce afs_d_alias_foreach 15400 afs: Fix a few ARCH/osi_vcache.c style errors 15389 Linux 6.3: Use mnt_idmap for inode op functions 15388 Linux 6.3: Include linux/filelock.h if available Merged onto 'master' branch since 2023-04-27: 15409 Linux: Clear msghdr structure before use 15408 Linux: Do not initialize msghdr's msg_iter/msg_iov Updated for 'master' branch since 2023-04-27: 15404 afs: Check UHasTokens in afs_GCUserData 15406 RedHat: Add RPM_RELNOTES, RPM_CHANGELOG, and RPM_CELLSERVDB 15405 RedHat: Support building RPMs with alternate CellServDB 15407 RedHat: Add RPM_BUILD_MODULES and RPM_BUILD_USERSPACE 15417 Linux: libafs build error with disable-checking 15414 audit: Update comments 15413 audit: Remove redundant AFS_AIX32_ENV ifdef 15412 audit: Remove vestigial osi_echo_trail logic 15416 vol: Remove dead extern FSYNC_Init 15415 audit: Make audit operations threadsafe 12586 bozo: Do not create client directory and symlinks 15351 Add command fallback to server config From mmeffie@sinenomine.net Fri May 12 16:54:17 2023 From: mmeffie@sinenomine.net (Michael Meffie) Date: Fri, 12 May 2023 11:54:17 -0400 Subject: [OpenAFS-devel] OpenAFS Release Team weekly meeting Message-ID: <20230512115417.22f978ef@lana> OpenAFS Release Team weekly meeting Date: May 11, 2023 Participants: - Stephan Wiesand, OpenAFS Release Manager - Ben Kaduk - Cheyenne Wills - Michael Meffie The OpenAFS Release Team meetings are held each Thursday at 12:00pm Eastern, 9:00am Pacific, on IRC channel #openafs-releaseteam of Libra.Chat. Stable (1.8.x) ============== The OpenAFS mirror on github.com is not being updated. Cheyenne pointed out a recent announcement by github related to 2FA that may be relevant. Ben Kaduk will investigate. Mike submitted 15422 for 1.8.x, since Ben Huntsman reported it fixes a build error on AIX 7.1. 15422 Retire AFS_MOUNT_AFS Ben Huntsman has created a buildbot worker for AIX 7.1. It is current configured as a nightly builder since the build times are very slow. Development (1.9.x/master) ========================== Ben Huntsman submitted two changes to fix crashes on AIX. 15418 AIX: AFS_SMALLOCSIZ too small on newer OS releases 15419 AIX: Free pinned_heap during shutdown to prevent kernel panic Andrew submitted a client fix related to cached stat info of symlinks. 15423 LINUX: Set .getattr for symlinks Jeffrey Altman submitted a fix related to rx jumbograms: 15421 rx: alter rx_jumboHeader byte-order conversion Mike updated the changes needed to retire the bosserver creation of client CellServDB and ThisCell symlinks. 12586 bozo: Do not create client directory and symlinks 15351 Add command fallback to server config Cheyenne said he completed code changes for the buildbot to improve reliability when one or more non-critical workers goes offline or hangs. He also has a change to avoid rebuilding when just the commit message is changed. Mike will be pushing the changes to the buildbot master after more local testing. Recent Changes ============== Updated for 'openafs-stable-1_8_x' branch since 2023-05-05: 15422 Retire AFS_MOUNT_AFS Merged onto 'master' branch since 2023-05-05: 15418 AIX: AFS_SMALLOCSIZ too small on newer OS releases 15413 audit: Remove redundant AFS_AIX32_ENV ifdef 15412 audit: Remove vestigial osi_echo_trail logic Updated for 'master' branch since 2023-05-05: 15419 AIX: Free pinned_heap during shutdown to prevent kernel panic 15391 LINUX: Make 'fs flush*' invalidate dentry 15089 bozo: Parse command lines with cmd_Tokenize() and cmd_Split() 15335 cmd: Do not leak tokens in cmd_ParseLine() 15087 tests: Add cmd_ParseLine() checks 15088 cmd: Introduce cmd_Tokenize() and cmd_Split() 15086 cmd: Do not leak param in cmd_Parse() 15423 LINUX: Set .getattr for symlinks 15404 afs: Check UHasTokens in afs_GCUserData 15420 util: Remove duplicate path definitions 12586 bozo: Do not create client directory and symlinks 15351 Add command fallback to server config 15415 audit: Make audit operations threadsafe 15414 audit: Update comments 15421 rx: alter rx_jumboHeader byte-order conversion From ben@huntsmans.net Tue May 16 05:46:40 2023 From: ben@huntsmans.net (Ben Huntsman) Date: Tue, 16 May 2023 04:46:40 +0000 Subject: [OpenAFS-devel] 1.8.x AIX support Message-ID: --_000_MWHPR0701MB3674672FE73F702B95F7E6BAA7789MWHPR0701MB3674_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hi there- I see that the three AIX issues are being pulled up to 1.8.x. I just tr= ied them out and found that we also need this one: 15106 We get a kernel panic on AIX as soon as afsd is started without 15106 ap= plied. I pulled in 15106 and it compiles, but then I have another problem that = is that many of the servers coredump immediately: # /opt/openafs/libexec/openafs/vlserver IOT/Abort trap(coredump) # dbx /opt/openafs/libexec/openafs/vlserver core Type 'help' for help. [using memory image in core] reading symbolic information ...warning: no source compiled with -g IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2) 0xd054cb34 (pthread_kill+0xb4) 80410014 lwz r2,0x14(r1) (dbx) where all Thread $t1 _sigsetmask(??, ??, ??) at 0xd054b288 _p_sigaction(??, ??, ??) at 0xd054be68 raise.sigaction(??, ??, ??) at 0xd0120d50 signal(??, ??) at 0xd021f6f4 .() at 0x100708e4 .() at 0x100019fc Thread $t2 pthread_kill(??, ??) at 0xd054cb34 _p_raise(??) at 0xd054bf84 raise.raise(??) at 0xd0121020 abort() at 0xd017ca64 .() at 0x10005f8c .() at 0x10071670 Are we missing another patch as well? Anyone have an idea what it might be= ? Thanks much! -Ben --_000_MWHPR0701MB3674672FE73F702B95F7E6BAA7789MWHPR0701MB3674_ Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable
Hi there-
   I see that the three AIX issues are being pulled up to 1.8.x.&= nbsp; I just tried them out and found that we also need this one:

15106

   We get a kernel panic on AIX as soon as afsd is started withou= t 15106 applied.

   I pulled in 15106 and it compiles, but then I have another pro= blem that is that many of the servers coredump immediately:

# /opt/openafs/libexec/openafs/vlserver
IOT/Abort trap(coredump)
# dbx /opt/openafs/libexec/openafs/vlserver c= ore
Type 'help' for help.
[using memory image in core]
reading symbolic information ...warning: no s= ource compiled with -g


IOT/Abort trap in pthread_kill at 0xd054cb34 = ($t2)
0xd054cb34 (pthread_kill+0xb4) 80410014  = ;          lwz   r2,0x14(r1)
(dbx) where all
Thread $t1
_sigsetmask(??, ??, ??) at 0xd054b288
_p_sigaction(??, ??, ??) at 0xd054be68
raise.sigaction(??, ??, ??) at 0xd0120d50
signal(??, ??) at 0xd021f6f4
.() at 0x100708e4
.() at 0x100019fc
Thread $t2
pthread_kill(??, ??) at 0xd054cb34
_p_raise(??) at 0xd054bf84
raise.raise(??) at 0xd0121020
abort() at 0xd017ca64
.() at 0x10005f8c
.() at 0x10071670


Are we missing another patch as well?  Anyone have an idea what it mig= ht be?

Thanks much!

-Ben


--_000_MWHPR0701MB3674672FE73F702B95F7E6BAA7789MWHPR0701MB3674_-- From cwills@sinenomine.net Tue May 16 14:04:21 2023 From: cwills@sinenomine.net (Cheyenne Wills) Date: Tue, 16 May 2023 07:04:21 -0600 Subject: [OpenAFS-devel] 1.8.x AIX support In-Reply-To: References: Message-ID: <20230516070421.29414c9f.cwills@sinenomine.net> On Tue, 16 May 2023 04:46:40 +0000 Ben Huntsman wrote: > Hi there- > I see that the three AIX issues are being pulled up to 1.8.x. I > just tried them out and found that we also need this one: > > 15106 > > We get a kernel panic on AIX as soon as afsd is started without > 15106 applied. > > I pulled in 15106 and it compiles, but then I have another problem > that is that many of the servers coredump immediately: > > # /opt/openafs/libexec/openafs/vlserver > IOT/Abort trap(coredump) > # dbx /opt/openafs/libexec/openafs/vlserver core > Type 'help' for help. > [using memory image in core] > reading symbolic information ...warning: no source compiled with -g > > > IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2) > 0xd054cb34 (pthread_kill+0xb4) 80410014 lwz r2,0x14(r1) > (dbx) where all > Thread $t1 > _sigsetmask(??, ??, ??) at 0xd054b288 > _p_sigaction(??, ??, ??) at 0xd054be68 > raise.sigaction(??, ??, ??) at 0xd0120d50 > signal(??, ??) at 0xd021f6f4 > .() at 0x100708e4 > .() at 0x100019fc > Thread $t2 > pthread_kill(??, ??) at 0xd054cb34 > _p_raise(??) at 0xd054bf84 > raise.raise(??) at 0xd0121020 > abort() at 0xd017ca64 > .() at 0x10005f8c > .() at 0x10071670 > > > Are we missing another patch as well? Anyone have an idea what it > might be? > > Thanks much! > > -Ben > > Can you try doing a build with --enable-debug as a configure option. That should provide information for the backtrace. -- Cheyenne Wills cwills@sinenomine.net From ben@huntsmans.net Tue May 16 18:21:58 2023 From: ben@huntsmans.net (Ben Huntsman) Date: Tue, 16 May 2023 17:21:58 +0000 Subject: [OpenAFS-devel] 1.8.x AIX support In-Reply-To: <20230516070421.29414c9f.cwills@sinenomine.net> References: <20230516070421.29414c9f.cwills@sinenomine.net> Message-ID: --_000_MWHPR0701MB3674DC7895D57FBA9F7D0DCCA7799MWHPR0701MB3674_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi there! Here is the backtrace with a debug build: bash-4.2# dbx /opt/openafs/libexec/openafs/vlserver core Type 'help' for help. [using memory image in core] reading symbolic information ... IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2) 0xd054cb34 (pthread_kill+0xb4) 80410014 lwz r2,0x14(r1) (dbx) where all Thread $t1 _sigsetmask(??, ??, ??) at 0xd054b288 _p_sigaction(??, ??, ??) at 0xd054be68 raise.sigaction(??, ??, ??) at 0xd0120d50 signal(??, ??) at 0xd021f6f4 SetupLogSoftSignals(), line 469 in "serverLog.c" main(argc =3D 0, argv =3D (nil)), line 399 in "vlserver.c" Thread $t2 pthread_kill(??, ??) at 0xd054cb34 _p_raise(??) at 0xd054bf84 raise.raise(??) at 0xd0121020 abort() at 0xd017ca64 opr_AssertionFailed(file =3D (nil), line =3D 0), line 29 in "assert.c" signalHandler(arg =3D (nil)), line 73 in "softsig.c" Hopefully that sheds more light on the situation? Thank you! -Ben ________________________________ From: Cheyenne Wills Sent: Tuesday, May 16, 2023 6:04 AM To: Ben Huntsman Cc: openafs-devel@openafs.org Subject: Re: [OpenAFS-devel] 1.8.x AIX support On Tue, 16 May 2023 04:46:40 +0000 Ben Huntsman wrote: > Hi there- > I see that the three AIX issues are being pulled up to 1.8.x. I > just tried them out and found that we also need this one: > > 15106 > > We get a kernel panic on AIX as soon as afsd is started without > 15106 applied. > > I pulled in 15106 and it compiles, but then I have another problem > that is that many of the servers coredump immediately: > > # /opt/openafs/libexec/openafs/vlserver > IOT/Abort trap(coredump) > # dbx /opt/openafs/libexec/openafs/vlserver core > Type 'help' for help. > [using memory image in core] > reading symbolic information ...warning: no source compiled with -g > > > IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2) > 0xd054cb34 (pthread_kill+0xb4) 80410014 lwz r2,0x14(r1) > (dbx) where all > Thread $t1 > _sigsetmask(??, ??, ??) at 0xd054b288 > _p_sigaction(??, ??, ??) at 0xd054be68 > raise.sigaction(??, ??, ??) at 0xd0120d50 > signal(??, ??) at 0xd021f6f4 > .() at 0x100708e4 > .() at 0x100019fc > Thread $t2 > pthread_kill(??, ??) at 0xd054cb34 > _p_raise(??) at 0xd054bf84 > raise.raise(??) at 0xd0121020 > abort() at 0xd017ca64 > .() at 0x10005f8c > .() at 0x10071670 > > > Are we missing another patch as well? Anyone have an idea what it > might be? > > Thanks much! > > -Ben > > Can you try doing a build with --enable-debug as a configure option. That should provide information for the backtrace. -- Cheyenne Wills cwills@sinenomine.net --_000_MWHPR0701MB3674DC7895D57FBA9F7D0DCCA7799MWHPR0701MB3674_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable
Hi there!
   Here is the backtrace with a debug build:

bash-4.2# dbx /opt/openafs/libexec/openafs/vlserver core
Type 'help' for help.
[using memory image in core]
reading symbolic information ...

IOT/Abort trap in pthread_kill at 0xd054cb34 = ($t2)
0xd054cb34 (pthread_kill+0xb4) 80410014  = ;          lwz   r2,0x14(r1)
(dbx) where all
Thread $t1
_sigsetmask(??, ??, ??) at 0xd054b288
_p_sigaction(??, ??, ??) at 0xd054be68
raise.sigaction(??, ??, ??) at 0xd0120d50
signal(??, ??) at 0xd021f6f4
SetupLogSoftSignals(), line 469 in "serv= erLog.c"
main(argc =3D 0, argv =3D (nil)), line 399 in= "vlserver.c"
Thread $t2
pthread_kill(??, ??) at 0xd054cb34
_p_raise(??) at 0xd054bf84
raise.raise(??) at 0xd0121020
abort() at 0xd017ca64
opr_AssertionFailed(file =3D (nil), line =3D = 0), line 29 in "assert.c"
signalHandler(arg =3D (nil)), line 73 in &quo= t;softsig.c"


Hopefully that sheds more light on the situation?

Thank you!

-Ben


From: Cheyenne Wills <cw= ills@sinenomine.net>
Sent: Tuesday, May 16, 2023 6:04 AM
To: Ben Huntsman <ben@huntsmans.net>
Cc: openafs-devel@openafs.org <openafs-devel@openafs.org>
Subject: Re: [OpenAFS-devel] 1.8.x AIX support
 
On Tue, 16 May 2023 04:46:40 +0000
Ben Huntsman <ben@huntsmans.net> wrote:

> Hi there-
>    I see that the three AIX issues are being pulled up = to 1.8.x.  I
> just tried them out and found that we also need this one:
>
> 15106
>
>    We get a kernel panic on AIX as soon as afsd is star= ted without
> 15106 applied.
>
>    I pulled in 15106 and it compiles, but then I have a= nother problem
> that is that many of the servers coredump immediately:
>
> # /opt/openafs/libexec/openafs/vlserver
> IOT/Abort trap(coredump)
> # dbx /opt/openafs/libexec/openafs/vlserver core
> Type 'help' for help.
> [using memory image in core]
> reading symbolic information ...warning: no source compiled with -g >
>
> IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2)
> 0xd054cb34 (pthread_kill+0xb4) 80410014     &= nbsp;      lwz   r2,0x14(r1)
> (dbx) where all
> Thread $t1
> _sigsetmask(??, ??, ??) at 0xd054b288
> _p_sigaction(??, ??, ??) at 0xd054be68
> raise.sigaction(??, ??, ??) at 0xd0120d50
> signal(??, ??) at 0xd021f6f4
> .() at 0x100708e4
> .() at 0x100019fc
> Thread $t2
> pthread_kill(??, ??) at 0xd054cb34
> _p_raise(??) at 0xd054bf84
> raise.raise(??) at 0xd0121020
> abort() at 0xd017ca64
> .() at 0x10005f8c
> .() at 0x10071670
>
>
> Are we missing another patch as well?  Anyone have an idea what i= t
> might be?
>
> Thanks much!
>
> -Ben
>
>

Can you try doing a build with --enable-debug as a configure option.
That should provide information for the backtrace.

--
Cheyenne Wills
cwills@sinenomine.net
--_000_MWHPR0701MB3674DC7895D57FBA9F7D0DCCA7799MWHPR0701MB3674_-- From cwills@sinenomine.net Tue May 16 19:10:53 2023 From: cwills@sinenomine.net (Cheyenne Wills) Date: Tue, 16 May 2023 12:10:53 -0600 Subject: [OpenAFS-devel] 1.8.x AIX support In-Reply-To: References: <20230516070421.29414c9f.cwills@sinenomine.net> Message-ID: <20230516121053.52f8925c.cwills@sinenomine.net> On Tue, 16 May 2023 17:21:58 +0000 Ben Huntsman wrote: > Hi there! > Here is the backtrace with a debug build: > > bash-4.2# dbx /opt/openafs/libexec/openafs/vlserver core > Type 'help' for help. > [using memory image in core] > reading symbolic information ... > > IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2) > 0xd054cb34 (pthread_kill+0xb4) 80410014 lwz r2,0x14(r1) > (dbx) where all > Thread $t1 > _sigsetmask(??, ??, ??) at 0xd054b288 > _p_sigaction(??, ??, ??) at 0xd054be68 > raise.sigaction(??, ??, ??) at 0xd0120d50 > signal(??, ??) at 0xd021f6f4 > SetupLogSoftSignals(), line 469 in "serverLog.c" > main(argc = 0, argv = (nil)), line 399 in "vlserver.c" > Thread $t2 > pthread_kill(??, ??) at 0xd054cb34 > _p_raise(??) at 0xd054bf84 > raise.raise(??) at 0xd0121020 > abort() at 0xd017ca64 > opr_AssertionFailed(file = (nil), line = 0), line 29 in "assert.c" > signalHandler(arg = (nil)), line 73 in "softsig.c" > > > Hopefully that sheds more light on the situation? > > Thank you! > > -Ben > > ________________________________ > From: Cheyenne Wills > Sent: Tuesday, May 16, 2023 6:04 AM > To: Ben Huntsman > Cc: openafs-devel@openafs.org > Subject: Re: [OpenAFS-devel] 1.8.x AIX support > > On Tue, 16 May 2023 04:46:40 +0000 > Ben Huntsman wrote: > > > Hi there- > > I see that the three AIX issues are being pulled up to 1.8.x. I > > just tried them out and found that we also need this one: > > > > 15106 > > > > We get a kernel panic on AIX as soon as afsd is started without > > 15106 applied. > > > > I pulled in 15106 and it compiles, but then I have another > > problem that is that many of the servers coredump immediately: > > > > # /opt/openafs/libexec/openafs/vlserver > > IOT/Abort trap(coredump) > > # dbx /opt/openafs/libexec/openafs/vlserver core > > Type 'help' for help. > > [using memory image in core] > > reading symbolic information ...warning: no source compiled with -g > > > > > > IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2) > > 0xd054cb34 (pthread_kill+0xb4) 80410014 lwz r2,0x14(r1) > > (dbx) where all > > Thread $t1 > > _sigsetmask(??, ??, ??) at 0xd054b288 > > _p_sigaction(??, ??, ??) at 0xd054be68 > > raise.sigaction(??, ??, ??) at 0xd0120d50 > > signal(??, ??) at 0xd021f6f4 > > .() at 0x100708e4 > > .() at 0x100019fc > > Thread $t2 > > pthread_kill(??, ??) at 0xd054cb34 > > _p_raise(??) at 0xd054bf84 > > raise.raise(??) at 0xd0121020 > > abort() at 0xd017ca64 > > .() at 0x10005f8c > > .() at 0x10071670 > > > > > > Are we missing another patch as well? Anyone have an idea what it > > might be? > > > > Thanks much! > > > > -Ben > > > > > > Can you try doing a build with --enable-debug as a configure option. > That should provide information for the backtrace. > > -- > Cheyenne Wills > cwills@sinenomine.net Looks like gerrit 14705 (from master) is needed as well -- Cheyenne Wills cwills@sinenomine.net From ben@huntsmans.net Tue May 16 21:28:01 2023 From: ben@huntsmans.net (Ben Huntsman) Date: Tue, 16 May 2023 20:28:01 +0000 Subject: [OpenAFS-devel] 1.8.x AIX support In-Reply-To: <20230516121053.52f8925c.cwills@sinenomine.net> References: <20230516070421.29414c9f.cwills@sinenomine.net> <20230516121053.52f8925c.cwills@sinenomine.net> Message-ID: --_000_MWHPR0701MB3674BBF00BF8617E0514D7D9A7799MWHPR0701MB3674_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Hi there! Yes, that was it! I pulled that in and it solved the issue. With 15106= and 14705 applied, pus the other three you already have proposed in gerrit= , the 1.8.x branch works on AIX. Thank you! -Ben ________________________________ From: Cheyenne Wills Sent: Tuesday, May 16, 2023 11:10 AM To: Ben Huntsman Cc: openafs-devel@openafs.org Subject: Re: [OpenAFS-devel] 1.8.x AIX support On Tue, 16 May 2023 17:21:58 +0000 Ben Huntsman wrote: > Hi there! > Here is the backtrace with a debug build: > > bash-4.2# dbx /opt/openafs/libexec/openafs/vlserver core > Type 'help' for help. > [using memory image in core] > reading symbolic information ... > > IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2) > 0xd054cb34 (pthread_kill+0xb4) 80410014 lwz r2,0x14(r1) > (dbx) where all > Thread $t1 > _sigsetmask(??, ??, ??) at 0xd054b288 > _p_sigaction(??, ??, ??) at 0xd054be68 > raise.sigaction(??, ??, ??) at 0xd0120d50 > signal(??, ??) at 0xd021f6f4 > SetupLogSoftSignals(), line 469 in "serverLog.c" > main(argc =3D 0, argv =3D (nil)), line 399 in "vlserver.c" > Thread $t2 > pthread_kill(??, ??) at 0xd054cb34 > _p_raise(??) at 0xd054bf84 > raise.raise(??) at 0xd0121020 > abort() at 0xd017ca64 > opr_AssertionFailed(file =3D (nil), line =3D 0), line 29 in "assert.c" > signalHandler(arg =3D (nil)), line 73 in "softsig.c" > > > Hopefully that sheds more light on the situation? > > Thank you! > > -Ben > > ________________________________ > From: Cheyenne Wills > Sent: Tuesday, May 16, 2023 6:04 AM > To: Ben Huntsman > Cc: openafs-devel@openafs.org > Subject: Re: [OpenAFS-devel] 1.8.x AIX support > > On Tue, 16 May 2023 04:46:40 +0000 > Ben Huntsman wrote: > > > Hi there- > > I see that the three AIX issues are being pulled up to 1.8.x. I > > just tried them out and found that we also need this one: > > > > 15106 > > > > We get a kernel panic on AIX as soon as afsd is started without > > 15106 applied. > > > > I pulled in 15106 and it compiles, but then I have another > > problem that is that many of the servers coredump immediately: > > > > # /opt/openafs/libexec/openafs/vlserver > > IOT/Abort trap(coredump) > > # dbx /opt/openafs/libexec/openafs/vlserver core > > Type 'help' for help. > > [using memory image in core] > > reading symbolic information ...warning: no source compiled with -g > > > > > > IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2) > > 0xd054cb34 (pthread_kill+0xb4) 80410014 lwz r2,0x14(r1) > > (dbx) where all > > Thread $t1 > > _sigsetmask(??, ??, ??) at 0xd054b288 > > _p_sigaction(??, ??, ??) at 0xd054be68 > > raise.sigaction(??, ??, ??) at 0xd0120d50 > > signal(??, ??) at 0xd021f6f4 > > .() at 0x100708e4 > > .() at 0x100019fc > > Thread $t2 > > pthread_kill(??, ??) at 0xd054cb34 > > _p_raise(??) at 0xd054bf84 > > raise.raise(??) at 0xd0121020 > > abort() at 0xd017ca64 > > .() at 0x10005f8c > > .() at 0x10071670 > > > > > > Are we missing another patch as well? Anyone have an idea what it > > might be? > > > > Thanks much! > > > > -Ben > > > > > > Can you try doing a build with --enable-debug as a configure option. > That should provide information for the backtrace. > > -- > Cheyenne Wills > cwills@sinenomine.net Looks like gerrit 14705 (from master) is needed as well -- Cheyenne Wills cwills@sinenomine.net --_000_MWHPR0701MB3674BBF00BF8617E0514D7D9A7799MWHPR0701MB3674_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable
Hi there!
   Yes, that was it!  I pulled that in and it solved the iss= ue.  With 15106 and 14705 applied, pus the other three you already hav= e proposed in gerrit, the 1.8.x branch works on AIX.

Thank you!

-Ben

From: Cheyenne Wills <cw= ills@sinenomine.net>
Sent: Tuesday, May 16, 2023 11:10 AM
To: Ben Huntsman <ben@huntsmans.net>
Cc: openafs-devel@openafs.org <openafs-devel@openafs.org>
Subject: Re: [OpenAFS-devel] 1.8.x AIX support
 
On Tue, 16 May 2023 17:21:58 +0000
Ben Huntsman <ben@huntsmans.net> wrote:

> Hi there!
>    Here is the backtrace with a debug build:
>
> bash-4.2# dbx /opt/openafs/libexec/openafs/vlserver core
> Type 'help' for help.
> [using memory image in core]
> reading symbolic information ...
>
> IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2)
> 0xd054cb34 (pthread_kill+0xb4) 80410014     &= nbsp;      lwz   r2,0x14(r1)
> (dbx) where all
> Thread $t1
> _sigsetmask(??, ??, ??) at 0xd054b288
> _p_sigaction(??, ??, ??) at 0xd054be68
> raise.sigaction(??, ??, ??) at 0xd0120d50
> signal(??, ??) at 0xd021f6f4
> SetupLogSoftSignals(), line 469 in "serverLog.c"
> main(argc =3D 0, argv =3D (nil)), line 399 in "vlserver.c" > Thread $t2
> pthread_kill(??, ??) at 0xd054cb34
> _p_raise(??) at 0xd054bf84
> raise.raise(??) at 0xd0121020
> abort() at 0xd017ca64
> opr_AssertionFailed(file =3D (nil), line =3D 0), line 29 in "asse= rt.c"
> signalHandler(arg =3D (nil)), line 73 in "softsig.c"
>
>
> Hopefully that sheds more light on the situation?
>
> Thank you!
>
> -Ben
>
> ________________________________
> From: Cheyenne Wills <cwills@sinenomine.net>
> Sent: Tuesday, May 16, 2023 6:04 AM
> To: Ben Huntsman <ben@huntsmans.net>
> Cc: openafs-devel@openafs.org <openafs-devel@openafs.org>
> Subject: Re: [OpenAFS-devel] 1.8.x AIX support
>
> On Tue, 16 May 2023 04:46:40 +0000
> Ben Huntsman <ben@huntsmans.net> wrote:
>
> > Hi there-
> >    I see that the three AIX issues are being pulle= d up to 1.8.x.  I
> > just tried them out and found that we also need this one:
> >
> > 15106
> >
> >    We get a kernel panic on AIX as soon as afsd is= started without
> > 15106 applied.
> >
> >    I pulled in 15106 and it compiles, but then I h= ave another
> > problem that is that many of the servers coredump immediately: > >
> > # /opt/openafs/libexec/openafs/vlserver
> > IOT/Abort trap(coredump)
> > # dbx /opt/openafs/libexec/openafs/vlserver core
> > Type 'help' for help.
> > [using memory image in core]
> > reading symbolic information ...warning: no source compiled with = -g
> >
> >
> > IOT/Abort trap in pthread_kill at 0xd054cb34 ($t2)
> > 0xd054cb34 (pthread_kill+0xb4) 80410014    &n= bsp;       lwz   r2,0x14(r1)
> > (dbx) where all
> > Thread $t1
> > _sigsetmask(??, ??, ??) at 0xd054b288
> > _p_sigaction(??, ??, ??) at 0xd054be68
> > raise.sigaction(??, ??, ??) at 0xd0120d50
> > signal(??, ??) at 0xd021f6f4
> > .() at 0x100708e4
> > .() at 0x100019fc
> > Thread $t2
> > pthread_kill(??, ??) at 0xd054cb34
> > _p_raise(??) at 0xd054bf84
> > raise.raise(??) at 0xd0121020
> > abort() at 0xd017ca64
> > .() at 0x10005f8c
> > .() at 0x10071670
> >
> >
> > Are we missing another patch as well?  Anyone have an idea w= hat it
> > might be?
> >
> > Thanks much!
> >
> > -Ben
> >
> >
>
> Can you try doing a build with --enable-debug as a configure option. > That should provide information for the backtrace.
>
> --
> Cheyenne Wills
> cwills@sinenomine.net


Looks like gerrit 14705 (from master) is needed as well
--
Cheyenne Wills
cwills@sinenomine.net
--_000_MWHPR0701MB3674BBF00BF8617E0514D7D9A7799MWHPR0701MB3674_-- From mmeffie@sinenomine.net Fri May 19 20:13:46 2023 From: mmeffie@sinenomine.net (Michael Meffie) Date: Fri, 19 May 2023 15:13:46 -0400 Subject: [OpenAFS-devel] OpenAFS Release Team weekly meeting Message-ID: <20230519151346.62ebfe6e@lana> OpenAFS Release Team weekly meeting Date: May 18, 2023 Participants: - Stephan Wiesand, OpenAFS Release Manager - Ben Kaduk - Cheyenne Wills - Michael Meffie The OpenAFS Release Team meetings are held each Thursday at 12:00pm Eastern, 9:00am Pacific, on IRC channel #openafs-releaseteam of Libra.Chat. Stable (1.8.x) ============== Stephan reports the first draft of the 1.8.10 release notes is ready for review in gerrit. Stephan is targeting next week to release 1.8.10pre1. * 15440 Update NEWS for 1.8.10pre1 Two more fixes for AIX support have been merged to master and have been submitted to gerrit for 1.8.x: * 15425 AIX: Free pinned_heap during shutdown to prevent kernel panic * 15442 rxevent: fix mismatched #endif The latest AIX patch that Ben Huntsman submitted is for new compiler support. The others (15422/15424/15245 - on 1.8.x and 15106/14705) address build and runtime problems. Cheyenne reports success with Linux 6.4rc on master and openafs-stable-1_8_x. Discussion if 15423 can be submitted for 1.8.x. Stephan approved pushing it to gerrit for 1.8.x for review and discussion. * 15423 LINUX: Set .getattr for symlinks Development (1.9.x/master) ========================== Ben has some outstanding questions for Mark on gerrit 15428. 15428 vol: Don't leak volume bitmaps Recent Changes ============== Updated for 'openafs-stable-1_8_x' branch since 2023-05-12: 15443 LINUX: Set .getattr for symlinks 15425 AIX: Free pinned_heap during shutdown to prevent kernel panic 15442 rxevent: fix mismatched #endif 15441 opr: Avoid sigwait on SIGWAITING 15440 Update NEWS for 1.8.10pre1 15424 AIX: AFS_SMALLOCSIZ too small on newer OS releases 15422 Retire AFS_MOUNT_AFS Merged onto 'master' branch since 2023-05-12: 15404 afs: Check UHasTokens in afs_GCUserData 15427 rxkad: Free memory used to check rxkad response 15391 LINUX: Make 'fs flush*' invalidate dentry 15423 LINUX: Set .getattr for symlinks 12586 bozo: Do not create client directory and symlinks 15351 Add command fallback to server config 15419 AIX: Free pinned_heap during shutdown to prevent kernel panic Updated for 'master' branch since 2023-05-12: 15335 cmd: Do not leak tokens in cmd_ParseLine() 15089 bozo: Parse command lines with cmd_Tokenize() and cmd_Split() 15088 cmd: Introduce cmd_Tokenize() and cmd_Split() 15087 tests: Add cmd_ParseLine() checks 15086 cmd: Do not leak param in cmd_Parse() 15436 AIX: Detect IBM Open XL C 17.1+ (clang) on AIX 7.2+ 15450 AIX: Use MT_CC for libuafs CC 15449 export: Modernize code 15448 tsm41: Fix various errors in aix_aklog.c 15447 AIX: Don't specify -qlanglvl=stdc99 for libuafs 15446 AIX: Declare syscall() 15445 AIX: Avoid COMPAT_43 for clang 15444 AIX: Use 'static inline' with clang-ish compilers 15432 Remove almost all bcopy/bzero/bcmp calls 15435 dumpscan: Include time.h 15434 bozo: Declare tweak_config() void 15433 afsd: Properly declare aix_vmount and vmountdata 15430 lwp: Declare LWP_CreateProcess2 for AIX 15431 kauth: Do not redefine vfprintf 15438 tests: Avoid realpath(x, NULL) 15437 tests: Standardize src/tests includes 15428 vol: Don't leak volume bitmaps 15342 tests: Make src/tests buildable 15395 LINUX: Re-evaluate dentry if d_time > parent DV 15217 Fix various warnings on RHEL5 15429 macOS: Remove duplicated signing files from PKGROOT 15417 Linux: Fix cast-function-type w/disable-checking 12744 Do not merge: Check buildbot verification From Tracy.DiMarcoWhite@gs.com Wed May 24 15:09:21 2023 From: Tracy.DiMarcoWhite@gs.com (Di Marco White, Tracy J) Date: Wed, 24 May 2023 14:09:21 +0000 Subject: [OpenAFS-devel] 2023 AFS Technologies Workshop - virtual Message-ID: Hi all, I'd like to invite anyone interested in AFS to consider registering for the= 2023 AFS Technologies Workshop on June 12th, 13th, & 14th. The workshop is= again this year being hosted on Zoom, and registration is open at https://= events.zoom.us/e/view/vqOro4A_RfGeHNSVWD4Yvw with the list of talks and tim= ings listed there as well. I will work to also have the schedule on the wor= kshop website as soon as I can. The main talks will run from 9:30am until 3= pm Eastern time, with themed community discussion or social time for an hou= r before the start each day, and an hour after the talks end each day.. If you have any questions, please let us know. Tracy Di Marco White On behalf of the 2023 Workshop organizers ________________________________ Your Personal Data: We may collect and process information about you that m= ay be subject to data protection laws. For more information about how we us= e and disclose your personal data, how we protect your information, our leg= al basis to use your information, your rights and who you can contact, plea= se refer to: www.gs.com/privacy-notices From mmeffie@sinenomine.net Thu May 25 21:09:32 2023 From: mmeffie@sinenomine.net (Michael Meffie) Date: Thu, 25 May 2023 16:09:32 -0400 Subject: [OpenAFS-devel] OpenAFS Release Team weekly meeting Message-ID: <20230525160932.1e1e57e4@lana> --MP_/djBC=IP=+JMHNFordJUzrAS Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Content-Disposition: inline OpenAFS Release Team weekly meeting Date: May 25, 2023 Participants: - Stephan Wiesand, OpenAFS Release Manager - Cheyenne Wills - Michael Meffie - Mark Vitale The OpenAFS Release Team meetings are held each Thursday at 12:00pm Eastern, 9:00am Pacific, on IRC channel #openafs-releaseteam of Libra.Chat. Stable (1.8.x) ============== * Stephen merged the four changes for AIX and updated the NEWS file for 1.8.10pre1. * Two more changes to be merged to 1.8.x: - 15422, required for AIX 7 - 15443, fix for symlink ownership cache coherency * Stephan to release 1.8.10pre1 this week with those final changes. * Cheyenne reports success building 1.8.x for Linux 6.4rc. * Stephan pointed out the freebsd12 buildbot worker is failing with an out of disk space error. Mike to contact the owner and disable the gerrit builder until fixed. Development (1.9.x/master) ========================== No report this week. Recent Changes ============== Merged onto 'openafs-stable-1_8_x' branch since 2023-05-18: 15443 LINUX: Set .getattr for symlinks 15422 Retire AFS_MOUNT_AFS 15442 rxevent: fix mismatched #endif 15441 opr: Avoid sigwait on SIGWAITING 15425 AIX: Free pinned_heap during shutdown to prevent kernel panic 15424 AIX: AFS_SMALLOCSIZ too small on newer OS releases Updated for 'openafs-stable-1_8_x' branch since 2023-05-18: 15440 Update NEWS for 1.8.10pre1 Merged onto 'master' branch since 2023-05-18: 15404 afs: Check UHasTokens in afs_GCUserData 15427 rxkad: Free memory used to check rxkad response Updated for 'master' branch since 2023-05-18: 15446 AIX: Declare syscall() 15458 INSTALL: Update AIX notes 15462 clang: Fix redefined macro if fortify_source=2 15448 tsm41: Fix various errors in aix_aklog.c 15447 AIX: Don't specify -qlanglvl=stdc99 for libuafs 15457 AIX: Search harder for compiler path 15454 export: Use XCFLAGS64 for -q64 15436 AIX: Detect IBM Open XL C 17.1+ (clang) on AIX 7.2+ 15463 libadmin: Correctly fall back to rxnull in afscp 15430 lwp: Declare LWP_CreateProcess2 for AIX 15438 tests: Avoid realpath(x, NULL) 15444 AIX: Use 'static inline' with clang-ish compilers 15432 Remove almost all bcopy/bzero/bcmp calls 15456 cf: Set CC before calling AC_PROG_CC 15415 audit: Make audit operations threadsafe 15460 audit: Call osi_audit_check at initialization 15459 audit: Refactor osi_audit_check 14760 vos: Check for tty in vos restore 15089 bozo: Parse command lines with cmd_Tokenize() and cmd_Split() 15088 cmd: Introduce cmd_Tokenize() and cmd_Split() 15335 cmd: Do not leak tokens in cmd_ParseLine() 15087 tests: Add cmd_ParseLine() checks 15437 tests: Standardize src/tests includes 15405 RedHat: Support building RPMs with alternate CellServDB 15452 macos: Update SDK for Ventura 13 15450 AIX: Use MT_CC for libuafs CC 15449 export: Modernize code 15406 RedHat: Add RPM_RELNOTES, RPM_CHANGELOG, and RPM_CELLSERVDB 15445 AIX: Avoid COMPAT_43 for clang 15431 kauth: Do not redefine vfprintf 15433 afsd: Properly declare aix_vmount and vmountdata 15435 dumpscan: Include time.h 15434 bozo: Declare tweak_config() void 14711 vos: Check start-of-dump magic in vos restore 14710 vos: Check end-of-dump magic before deleting volume 15069 vos: Open dump file before deleting volume in vos restore 14778 vos: Avoid dumping volume to tty 14758 vos: Check end of dump magic when file is seekable 14777 usd: Add USD_IOCTL() is seekable check 14756 vos: Fix vos dump and restore dump file close error messages 15453 macos: Remove vestigial AFS_MOUNT_AFS references 15414 audit: Update comments 15399 doc: Move man page generation from regen.sh to make-release 15365 doc: Build man pages when pod2man is found 15455 Kill Irix pre-6.5 15363 doc: Fix the AFS::ukernel man page title 15364 doc: Generate man pages in batches 15387 doc: Stage man pages before installing 15461 vlserver: Implement GetCapabilities() RPC 15429 macOS: Remove duplicated signing files from PKGROOT 15086 cmd: Do not leak param in cmd_Parse() 15428 vol: Don't leak volume bitmaps --MP_/djBC=IP=+JMHNFordJUzrAS Content-Type: text/plain Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename=2023-05-25.txt OpenAFS Release Team weekly meeting Date: May 25, 2023 Participants: - Stephan Wiesand, OpenAFS Release Manager - Cheyenne Wills - Michael Meffie - Mark Vitale The OpenAFS Release Team meetings are held each Thursday at 12:00pm Eastern, 9:00am Pacific, on IRC channel #openafs-releaseteam of Libra.Chat. Stable (1.8.x) ============== * Stephen merged the four changes for AIX and updated the NEWS file for 1.8.10pre1. * Two more changes to be merged to 1.8.x: - 15422, required for AIX 7 - 15443, fix for symlink ownership cache coherency * Stephan to release 1.8.10pre1 this week with those final changes. * Cheyenne reports success building 1.8.x for Linux 6.4rc. * Stephan pointed out the freebsd12 buildbot worker is failing with an out of disk space error. Mike to contact the owner and disable the gerrit builder until fixed. Development (1.9.x/master) ========================== No report this week. Recent Changes ============== Merged onto 'openafs-stable-1_8_x' branch since 2023-05-18: 15443 LINUX: Set .getattr for symlinks 15422 Retire AFS_MOUNT_AFS 15442 rxevent: fix mismatched #endif 15441 opr: Avoid sigwait on SIGWAITING 15425 AIX: Free pinned_heap during shutdown to prevent kernel panic 15424 AIX: AFS_SMALLOCSIZ too small on newer OS releases Updated for 'openafs-stable-1_8_x' branch since 2023-05-18: 15440 Update NEWS for 1.8.10pre1 Merged onto 'master' branch since 2023-05-18: 15404 afs: Check UHasTokens in afs_GCUserData 15427 rxkad: Free memory used to check rxkad response Updated for 'master' branch since 2023-05-18: 15446 AIX: Declare syscall() 15458 INSTALL: Update AIX notes 15462 clang: Fix redefined macro if fortify_source=2 15448 tsm41: Fix various errors in aix_aklog.c 15447 AIX: Don't specify -qlanglvl=stdc99 for libuafs 15457 AIX: Search harder for compiler path 15454 export: Use XCFLAGS64 for -q64 15436 AIX: Detect IBM Open XL C 17.1+ (clang) on AIX 7.2+ 15463 libadmin: Correctly fall back to rxnull in afscp 15430 lwp: Declare LWP_CreateProcess2 for AIX 15438 tests: Avoid realpath(x, NULL) 15444 AIX: Use 'static inline' with clang-ish compilers 15432 Remove almost all bcopy/bzero/bcmp calls 15456 cf: Set CC before calling AC_PROG_CC 15415 audit: Make audit operations threadsafe 15460 audit: Call osi_audit_check at initialization 15459 audit: Refactor osi_audit_check 14760 vos: Check for tty in vos restore 15089 bozo: Parse command lines with cmd_Tokenize() and cmd_Split() 15088 cmd: Introduce cmd_Tokenize() and cmd_Split() 15335 cmd: Do not leak tokens in cmd_ParseLine() 15087 tests: Add cmd_ParseLine() checks 15437 tests: Standardize src/tests includes 15405 RedHat: Support building RPMs with alternate CellServDB 15452 macos: Update SDK for Ventura 13 15450 AIX: Use MT_CC for libuafs CC 15449 export: Modernize code 15406 RedHat: Add RPM_RELNOTES, RPM_CHANGELOG, and RPM_CELLSERVDB 15445 AIX: Avoid COMPAT_43 for clang 15431 kauth: Do not redefine vfprintf 15433 afsd: Properly declare aix_vmount and vmountdata 15435 dumpscan: Include time.h 15434 bozo: Declare tweak_config() void 14711 vos: Check start-of-dump magic in vos restore 14710 vos: Check end-of-dump magic before deleting volume 15069 vos: Open dump file before deleting volume in vos restore 14778 vos: Avoid dumping volume to tty 14758 vos: Check end of dump magic when file is seekable 14777 usd: Add USD_IOCTL() is seekable check 14756 vos: Fix vos dump and restore dump file close error messages 15453 macos: Remove vestigial AFS_MOUNT_AFS references 15414 audit: Update comments 15399 doc: Move man page generation from regen.sh to make-release 15365 doc: Build man pages when pod2man is found 15455 Kill Irix pre-6.5 15363 doc: Fix the AFS::ukernel man page title 15364 doc: Generate man pages in batches 15387 doc: Stage man pages before installing 15461 vlserver: Implement GetCapabilities() RPC 15429 macOS: Remove duplicated signing files from PKGROOT 15086 cmd: Do not leak param in cmd_Parse() 15428 vol: Don't leak volume bitmaps --MP_/djBC=IP=+JMHNFordJUzrAS--