[OpenAFS] Re: RHEL 7.5 beta / 3.10.0-830.el7.x86_66 kernel lock up

Stephan Wiesand stephan.wiesand@desy.de
Thu, 1 Feb 2018 17:11:24 +0100


Comparing the 1.6.22.2 module builds from the SL packaging, where the =
kABI hashes of the used symbols are stored as a requirement, is seems =
none of those hashes changed between -693 and -830.

There are two differences in the configure results:

-ac_cv_linux_header_sched_signal_h=3Dno
+ac_cv_linux_header_sched_signal_h=3Dyes

-ac_cv_linux_struct_file_operations_has_iterate=3Dno
+ac_cv_linux_struct_file_operations_has_iterate=3Dyes

And there's quite a bit of churn in include/linux.fs.h (and some in =
key.h).

> On 1. Feb 2018, at 16:58, Gary Gatling <gsgatlin@ncsu.edu> wrote:
>=20
> Ok. This gets weirder. Any directory under /afs says Not a directory. =
But I can read files like
>=20
> /afs/eos.ncsu.edu/software/inventory/software_inventory
>=20
> just fine.=20
>=20
> On Thu, Feb 1, 2018 at 10:55 AM, Gary Gatling <gsgatlin@ncsu.edu> =
wrote:
> I don't get a kernel panic but instead I get:
>=20
> [gsgatlin@localhost ~]$ ls /afs/
> ls: reading directory /afs/: Not a directory
> [gsgatlin@localhost ~]$=20
>=20
>=20
> which is pretty weird. I don't see anything in the syslog about =
problems with openafs
>=20
> Feb  1 10:44:24 localhost systemd: Starting OpenAFS Client Service...
> Feb  1 10:44:24 localhost kernel: libafs: loading out-of-tree module =
taints kernel.
> Feb  1 10:44:24 localhost kernel: libafs: module license =
'http://www.openafs.org/dl/license10.html' taints kernel.
> Feb  1 10:44:24 localhost kernel: Disabling lock debugging due to =
kernel taint
> Feb  1 10:44:24 localhost kernel: libafs: module verification failed: =
signature and/or required key missing - tainting kernel
> Feb  1 10:44:24 localhost kernel: Key type afs_pag registered
> Feb  1 10:44:24 localhost kernel: enabling dynamically allocated =
vcaches
> Feb  1 10:44:24 localhost kernel: Starting AFS cache scan...Memory =
cache: Allocating 1600 dcache entries...found 0 non-empty cache files =
(0%).
> Feb  1 10:44:24 localhost afsd: afsd: All AFS daemons started.
> Feb  1 10:44:24 localhost afsd: afsd: All AFS daemons started.
> Feb  1 10:44:24 localhost systemd: Started OpenAFS Client Service.
>=20
> I am using openafs-1.6.22
>=20
>=20
> with
>=20
> correct-m4-conditionals-in-curses.m4.patch
> linux-test-for-vfswrite-rather-than-vfsread.patch
> linux-use-kernelread-kernelwrite-when-vfs-varian.patch
>=20
> from the arch linux distro in my rpm packages.
>=20
> Anyone know what=20
>=20
> ls: reading directory /afs/: Not a directory
>=20
> means and is there some way around it?
>=20
> Also, is 1.6.22.2 coming out soon?
>=20
> Thanks so much,
>=20
> On Wed, Jan 31, 2018 at 9:43 AM, Kodiak Firesmith =
<kfiresmith@gmail.com> wrote:
> https://photos.app.goo.gl/WgPsSUCLK5ojxIuH3
>=20
>=20
> On Wed, Jan 31, 2018 at 9:41 AM, Kodiak Firesmith =
<kfiresmith@gmail.com> wrote:
> Folks, re-sending this because the first try never hit the list - =
perhaps mail with attachments are silently dropped or held for manual =
moderation?  I'd originally attached an image of the stack trace.  I'll =
host it and reply to this with a  URL link in case that would also =
result in a drop or moderation.
>=20
>=20
>=20
> Anyhow: =20
>=20
> In testing the new RHEL 7.5 beta, we've discovered that hosts using =
AFS fail to boot after the upgrade, with Openafs 1.6.22.1 installed. =20
>=20
> We are wondering if some of the non-guaranteed kernel ABIs that =
OpenAFS uses might have changed with the latest kernel provided in RHEL =
7. =20
>=20
> I've attached a picture of the trace.
>=20
> Anyone else kicking the tires on the new RHEL yet?
>=20
> Thanks!
>=20
>=20
>=20
>=20

--=20
Stephan Wiesand
DESY -DV-
Platanenallee 6
15738 Zeuthen, Germany