[OpenAFS] getcwd() error for RHEL 7.4 kernel

Benjamin Kaduk kaduk@mit.edu
Wed, 18 Oct 2017 18:21:23 -0500


On Tue, Oct 17, 2017 at 11:55:27AM -0400, Jacob Bonek wrote:
> Hello,
> 
> We're having some strange issues with OpenAFS lately.
> 
> It started after installing the base RHEL 7.4 kernel, 3.10.0-693.el7.x86_64
> back in August, with the latest version of OpenAFS client at the time,
> 1.6.21. We've tried using the now latest version, 1.6.21.1, and still have
> the same issues. This happens with all the subsequent RHEL 7.4 kernels as
> well, including the latest kernel, 3.10.0-693.2.2.el7.x86_64.
> 
[...]
> 
> This is a major issue that has caused us to have to stay at the latest
> pre-RHEL 7.4 kernel for a long time now while this issue has existed. This
> may be related to previous issues with getcwd() but something in the RHEL
> 7.4 kernel seems to have made it much worse. Simply rebooting a system does
> not fix it, nor does clearing the AFS cache.
> 
> Has anyone else experienced this issue with RHEL 7.4? Is there anything
> that we can do to narrow down what is causing this?

I think we've seen another report or two, but it's always been hard to
reproduce.  That said, with the specifics you've offered about the kernel
version that introduced the issue, we've got a couple folks trying to
reproduce in a controlled environment.

In the meantime, could you post an (openafs) config.log from one of the
affected systems?  It's pretty long, so maybe as an attachment for
mail to openafs-bugs@openafs.org is best.

Thanks,

Ben
OpenAFS Guardian