Joshua Baker-LePain wrote:
Do you have a test case that would show this up? I've been testing a
centos-4 based server with the RH-derived 2.6.9-based kernel tweaked to
disable 4K stacks and enable XFS and haven't run into any issues yet.
This includes running the parallel IOR benchmark from 10 clients (and
getting 200MiB/s throughput on reads).
For Jakob,
Note that the last OOPS I posted was for 2.6.11.10.
For Joshua,
We first saw the problem after 5 days in production, but since then
we took the server out of production and used the script
nfs_fsstress.sh located in this package:
http://prdownloads.sourceforge.net/ltp/ltp-full-20050505.tgz?download
We run the script on 5 client machines that are running RedHat 9
with kernel-smp-2.4.20-20.9 and nfs-utils-1.0.1-3.9.1.legacy and
are NFS mounting our 2.6 kernel server. The longest time to OOPS
has been about 8 hours. We have not tried the parallel IOR
benchmark. (Where can we get that?)
You didn't mention if you are using md at all. We have a
software RAID-0 of 4 x 3ware 8506-4 controllers running the
latest 3ware driver from their site. The filesystem is XFS.
The network driver is e1000 (two interfaces, not bonded). The
system is a dual Xeon. We upped the number of NFS daemons
from 8 to 64. The nfs_fsstress.sh client mounts the servers
both UDP and TCP, and our in-production oops likely happened
with a combination of both protocols in use simultaneously as
well. We've seen the OOPS with both the default and with 32K
read and write NFS block sizes. The machine was stable for
over a year with RedHat 9 and 2.4.20.
I'm grasping for any subtle details that might help...
Here is our list of loaded modules:
Our server configuration is
Module Size Used by
nfsd 185569 65
exportfs 9921 1 nfsd
lockd 59625 2 nfsd
md5 8001 1
ipv6 236769 16
parport_pc 29701 1
lp 15405 0
parport 37129 2 parport_pc,lp
sunrpc 135077 28 nfsd,lockd
xfs 487809 1
dm_mod 57925 0
video 19653 0
button 10577 0
battery 13253 0
ac 8773 0
uhci_hcd 33497 0
hw_random 9429 0
i2c_i801 11981 0
i2c_core 24513 1 i2c_i801
e1000 84629 0
bonding 59817 0
floppy 56913 0
ext3 117961 2
jbd 57177 1 ext3
raid0 11840 1
3w_xxxx 30561 4
sd_mod 20545 4
scsi_mod 116033 2 3w_xxxx,sd_mod
Let me know if there is anything else I can provide.
Thanks.
Greg
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
[Index of Archives]
[Kernel Newbies]
[Netfilter]
[Bugtraq]
[Photo]
[Stuff]
[Gimp]
[Yosemite News]
[MIPS Linux]
[ARM Linux]
[Linux Security]
[Linux RAID]
[Video 4 Linux]
[Linux for the blind]
[Linux Resources]