[OpenAFS] getcwd() error for RHEL 7.4 kernel

Matt Vander Werf mvanderw@nd.edu
Wed, 15 Nov 2017 13:02:15 -0500


--f4030438f0f02e559d055e09509e
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hello,

Are there any updates or progress on a potential fix for this issue?
Anything we can do to help figure things out?

We are running into more and more users encountering the issue on systems
we have updated, forcing us to have to downgrade the kernel on them yet as
well (including the system we were able to reproduce it on and test with
before). Is there any other information we might provide before we do that?

Thank you.

--
Matt Vander Werf
HPC System Administrator
University of Notre Dame
Center for Research Computing - Union Station
506 W. South Street
South Bend, IN 46601
Phone: (574) 631-0692

On Wed, Nov 1, 2017 at 1:19 PM, Matt Vander Werf <mvanderw@nd.edu> wrote:

> Thanks for the update!
>
> Let us know if there's anything else you need from us. We're happy to tes=
t
> out any potential fixes, if you'd like more testing done.
>
> Thanks.
>
> --
> Matt Vander Werf
> HPC System Administrator
> University of Notre Dame
> Center for Research Computing - Union Station
> 506 W. South Street
> South Bend, IN 46601
> Phone: (574) 631-0692
>
> On Mon, Oct 30, 2017 at 11:15 AM, Mark Vitale <mvitale@sinenomine.net>
> wrote:
>
>> Matt,
>>
>> > On Oct 28, 2017, at 9:38 AM, Matt Vander Werf <mvanderw@nd.edu> wrote:
>> >
>> > Attached is the output from running the command 'lsof /afs' after
>> running both 'echo 2 > /proc/sys/vm/drop_caches' and 'fs flushall' on th=
e
>> system we're testing with the updated kernel. Is this what you were look=
ing
>> for? Let me know if you were wanting something different at all.
>> Thank you for the lsof output, it=E2=80=99s helpful.
>>
>> >
>> > We do still have users actively using AFS on the system in question
>> (who are not having any issues, as far as we know).
>> >
>> > Did the straces I provided help at all? Or did they not give any usefu=
l
>> details?
>> The straces provided confirmation of the getcwd() ENOENT symptom.
>>
>> > Let me know if there=E2=80=99s anything else we can try or any other i=
nfo we
>> can provide.
>> I am now able to reproduce the problem at will on my test system, so I
>> don=E2=80=99t expect I will need anything more from you for now.  Thank =
you for all
>> you provided, and we will keep you informed if there is any progress.
>>
>> Regards,
>> =E2=80=94-
>> Mark Vitale
>> OpenAFS release team
>>
>
>

--f4030438f0f02e559d055e09509e
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><div>Hello,<br><br></div>Are there any updates or pro=
gress on a potential fix for this issue? Anything we can do to help figure =
things out?<br></div><div><br></div><div>We are running into more and more =
users encountering the issue on systems we have updated, forcing us to have=
 to downgrade the kernel on them yet as well (including the system we were =
able to reproduce it on and test with before). Is there any other informati=
on we might provide before we do that?<br></div><div><br></div>Thank you.<b=
r><div><div class=3D"gmail_extra"><br clear=3D"all"><div><div class=3D"m_-1=
79848824641958541gmail_signature" data-smartmail=3D"gmail_signature"><div d=
ir=3D"ltr"><div>--<br></div><div>Matt Vander Werf<br>HPC System Administrat=
or<br>University of Notre Dame<br>Center for Research Computing - Union Sta=
tion<br>506 W. South Street<br>South Bend, IN 46601<br></div>Phone: <a href=
=3D"tel:(574)%20631-0692" value=3D"+15746310692" target=3D"_blank">(574) 63=
1-0692</a></div></div></div>
<br><div class=3D"gmail_quote">On Wed, Nov 1, 2017 at 1:19 PM, Matt Vander =
Werf <span dir=3D"ltr">&lt;<a href=3D"mailto:mvanderw@nd.edu" target=3D"_bl=
ank">mvanderw@nd.edu</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_qu=
ote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex=
"><div dir=3D"ltr"><div><div>Thanks for the update!<br><br></div>Let us kno=
w if there&#39;s anything else you need from us. We&#39;re happy to test ou=
t any potential fixes, if you&#39;d like more testing done.<br></div><div><=
br></div>Thanks.<br></div><div class=3D"gmail_extra"><span><br clear=3D"all=
"><div><div class=3D"m_-179848824641958541m_-5787757725593875639gmail_signa=
ture" data-smartmail=3D"gmail_signature"><div dir=3D"ltr"><div>--<br></div>=
<div>Matt Vander Werf<br>HPC System Administrator<br>University of Notre Da=
me<br>Center for Research Computing - Union Station<br>506 W. South Street<=
br>South Bend, IN 46601<br></div>Phone: <a href=3D"tel:(574)%20631-0692" va=
lue=3D"+15746310692" target=3D"_blank">(574) 631-0692</a></div></div></div>
<br></span><div><div class=3D"m_-179848824641958541h5"><div class=3D"gmail_=
quote">On Mon, Oct 30, 2017 at 11:15 AM, Mark Vitale <span dir=3D"ltr">&lt;=
<a href=3D"mailto:mvitale@sinenomine.net" target=3D"_blank">mvitale@sinenom=
ine.net</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D=
"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Matt,<br>
<span><br>
&gt; On Oct 28, 2017, at 9:38 AM, Matt Vander Werf &lt;<a href=3D"mailto:mv=
anderw@nd.edu" target=3D"_blank">mvanderw@nd.edu</a>&gt; wrote:<br>
&gt;<br>
&gt; Attached is the output from running the command &#39;lsof /afs&#39; af=
ter running both &#39;echo 2 &gt; /proc/sys/vm/drop_caches&#39; and &#39;fs=
 flushall&#39; on the system we&#39;re testing with the updated kernel. Is =
this what you were looking for? Let me know if you were wanting something d=
ifferent at all.<br>
</span>Thank you for the lsof output, it=E2=80=99s helpful.<br>
<span><br>
&gt;<br>
&gt; We do still have users actively using AFS on the system in question (w=
ho are not having any issues, as far as we know).<br>
&gt;<br>
&gt; Did the straces I provided help at all? Or did they not give any usefu=
l details?<br>
</span>The straces provided confirmation of the getcwd() ENOENT symptom.<br=
>
<span><br>
&gt; Let me know if there=E2=80=99s anything else we can try or any other i=
nfo we can provide.<br>
</span>I am now able to reproduce the problem at will on my test system, so=
 I don=E2=80=99t expect I will need anything more from you for now.=C2=A0 T=
hank you for all you provided, and we will keep you informed if there is an=
y progress.<br>
<br>
Regards,<br>
=E2=80=94-<br>
<div class=3D"m_-179848824641958541m_-5787757725593875639HOEnZb"><div class=
=3D"m_-179848824641958541m_-5787757725593875639h5">Mark Vitale<br>
OpenAFS release team</div></div></blockquote></div><br></div></div></div>
</blockquote></div><br></div></div></div>

--f4030438f0f02e559d055e09509e--