[OpenAFS] disk cache read error in CacheItems

Martin Flemming martin.flemming@desy.de
Tue, 23 Oct 2018 07:35:55 +0200 (CEST)


Hi !

In the last few days we've observed an increasing number of Nodes,
which are no longer be reached and have to be rebooted

In the /var/log/messages we see a lot of lines with e.g.

Oct 22 18:48:26 bird858 kernel: afs: disk cache read error in CacheItems slot 25254 off 2020340/13880020 code -5/80
Oct 22 18:48:26 bird858 kernel: afs: disk cache read error in CacheItems slot 25253 off 2020260/13880020 code -5/80
Oct 22 18:48:26 bird858 kernel: afs: disk cache read error in CacheItems slot 25252 off 2020180/13880020 code -5/80
Oct 22 18:48:26 bird858 kernel: afs: disk cache read error in CacheItems slot 25251 off 2020100/13880020 code -5/80

till nothing happens anymore ...

The clients are  Centos 7.5 , 3.10.0-862.14.4.el7.x86_64, OpenAFS 1.6.23 built 2018-09-12 (289.sl7.862.11.6@fnal.gov)

Any hints for the possible reason ?

Thanks & Cheers,

        Martin