[OpenAFS] openafs-server does not recover from crash

Pascal Salet pascal.salet@wu.ac.at
Thu, 10 Mar 2022 13:28:48 +0100


Am 09.03.22 um 19:09 schrieb Mark Vitale:
> Pascal,
>=20
>> On 9 Mar 2022, at 11:26 AM, Pascal Salet <pascal.salet@wu.ac.at> wrote=
:
>>
>> our openafs-server has stopped working after a crash.
>>
>> "bos status" shows all services online for all fileservers and DBserve=
rs.
>>
>> udebug port 7003 works correctly from all fileservers and DBservers.
>>
>> However, "strace -p $(pidof salvageserver)" shows an error:
>> connect(6, {sa_family=3DAF_UNIX, sun_path=3D"/var/lib/openafs/local/fs=
sync.sock"}, 110) =3D -1 ECONNREFUSED (Connection refused)
>>
>> SalsrvLog:
>> Wed Mar 09 16:09:31 2022 @(#)OpenAFS 1.8.2-1-debian 2018-09-12
>=20
> Unfortunately this version of OpenAFS has the Rx CID bug:
>    http://openafs.org/frameset/dl/openafs/1.8.7/RELNOTES-1.8.7
>=20
> This bug may only become apparent on the first restart after Jan 15, 20=
21.
> You must upgrade all your clients and servers to OpenAFS 1.8.7 or highe=
r.
>=20
> Regards,
> --
> Mark Vitale
> mvitale@sinenomine.net


Mark, thank you very much for your advice!

Upgrading to OpenAFS 1.8.6-5 (Debian) solved the problem.

Pascal

--=20
Pascal Salet
IT-Services / Server Infrastructure
Wirtschaftsuniversit=C3=A4t Wien / Vienna University of Economics and=20
Business / Austria
pascal.salet@wu.ac.at / +43-676-8213-5375