[OpenAFS-devel] anyone tried linux > 2.4.14?
Neulinger, Nathan
nneul@umr.edu
Tue, 27 Nov 2001 11:14:51 -0600
UP kernel didn't help, same hang.
-- Nathan
------------------------------------------------------------
Nathan Neulinger EMail: nneul@umr.edu
University of Missouri - Rolla Phone: (573) 341-4841
Computing Services Fax: (573) 341-4216
> -----Original Message-----
> From: Neulinger, Nathan
> Sent: Tuesday, November 27, 2001 11:08 AM
> To: 'openafs-devel@openafs.org'
> Subject: [OpenAFS-devel] anyone tried linux > 2.4.14?
>
>
> I'm getting mysterious afs lockups with 2.4.15 or 2.4.16.
> Nothing gets put
> in D state, the processes just appear to hang.
>
> It looks to me like the process is spinning in a read and
> sucking cpu like
> mad, but the syscall is never completing. Strace does no
> output. The process
> is definately not killable.
>
> Interestingly, other afs accesses on the machine appear fine,
> except if you
> try to ls the same directory that was being read when the
> process hung, it
> will hang as well, so it's possible it's a directory read
> that is failing,
> but it appears not:
>
> Here is the tail end of an strace:
>
>
> 806 fstat64(3, {st_dev=makedev(0, 9), st_ino=2050626764,
> st_mode=S_IFREG|0755, st_nlink=1, st_uid=1, st_gid=0, st_blksize=4096,
> st_blocks=12, st_size=6083, st_atime=2001/10/26-09:24:37,
> st_mtime=2001/10/26-09:24:37, st_ctime=2001/10/26-09:24:37}) = 0
> 806 close(3) = 0
> 806
> lstat64("/umr/s/openafs/.oldfiles/openafs/src/WINNT/doc/instal
> l/Documentatio
> n/ja_JP/html/CmdRef/auarf131.htm", {st_dev=makedev(0, 9),
> st_ino=2050626766,
> st_mode=S_IFREG|0755, st_nlink=1, st_uid=1, st_gid=0, st_blksize=4096,
> st_blocks=18, st_size=8538, st_atime=2001/10/26-09:24:37,
> st_mtime=2001/10/26-09:24:37, st_ctime=2001/10/26-09:24:37}) = 0
> 806
> open("/umr/s/openafs/.oldfiles/openafs/src/WINNT/doc/install/D
> ocumentation/j
> a_JP/html/CmdRef/auarf131.htm", O_RDONLY|O_LARGEFILE) = 3
> 806 write(1,
> "umr/s/openafs/.oldfiles/openafs/src/WINNT/doc/install/Documen
> tation/ja_JP/h
> tml/CmdRef/auarf131.htm\n", 99) = 99
> 806 read(3,
>
>
> Note the hang in the read. If I reboot and restart the tar,
> it will hang
> somewhere else, usually further along.
>
> This is on a clean 2.4.16 with only tiny local patches that
> shouldn't have
> any impact. (Console blanking, semopm, and numfiles). It is
> configured with
> highmem-4gb support, and is a SMP kernel, but on a UP box.
>
> I'm going to try a UP kernel and see if that makes any
> difference... (We've
> been trying to standardize on a single kernel build. Makes
> managing lots of
> machines easier.)
>
> Any ideas?
>
> -- Nathan
>
> ------------------------------------------------------------
> Nathan Neulinger EMail: nneul@umr.edu
> University of Missouri - Rolla Phone: (573) 341-4841
> Computing Services Fax: (573) 341-4216
> _______________________________________________
> OpenAFS-devel mailing list
> OpenAFS-devel@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-devel
>