Fedora Users — Re: I/O Scheduling results in poor responsiveness

Nathan Grennan wrote:

Why is the command below all that is needed to bring the system toit's knees? Why doesn't the io scheduler, CFQ, which is supposed to beall about fairness starve other processes? Example, if I open a new filein vim, and hold down "i" while this is running it will pause thedisplay of new "i"s for seconds, sometimes until the dd write iscompletely finished. Another example is applications like firefox,thunderbird, xchat, and pidgin will stop refreshing for 10+ seconds.
 dd if=/dev/zero of=test-file bs=2M count=2048
I understand the main difference between using oflag=direct or notrelates to if the io scheduler is used, and if the file is cached ornot. I can see this clearly by watching cached rise withoutoflag=direct, stay the same with it, and go way down when I delete thefile after running dd without oflag=direct.
The system in question is running Fedora 8. It is an E6600, 4gb memory,and 2x300gb Seagate sata drives. The drives are setup with md raid 1,and the filesystem is ext3. But I also see this with plenty of othersystems with more cpu, less cpu, less memory, raid, and no raid.
I have tried various tweaks to sys.vm settings, tried changing thescheduler to as or deadline. Nothing seem to get it to behave, otherthan oflag=direct.
Using dd if=/dev/zero is just an easy test case. I see this whencopying large files, creating large files, and using virtualizationsoftware that does heavy i/o on large files.
The command below seems to result in cpu idle 0 and io wait 100%. Asshown by "vmstat 1"
dd if=/dev/zero of=test-file bs=2M count=2048

2048+0 records in
2048+0 records out
4294967296 bytes (4.3 GB) copied, 94.7903 s, 45.3 MB/s
The command below seems to work much better for responsiveness. The cpuidle will be around 50, and the io wait will be around 50.
dd if=/dev/zero of=test-file2 bs=2M count=2048 oflag=direct

2048+0 records in
2048+0 records out
4294967296 bytes (4.3 GB) copied, 115.733 s, 37.1 MB/s

CFQ is optimized for throughput, not latency. When you're doing dd withoutoflag=direct, you're dirtying memory faster than it can be written to disk, sopdflush will spawn up to 8 threads (giving it 8 threads' worth of CFQ time),which starves out vim's extremely frequent syncing of its session file.

The 8 threaded behavior of pdflush is a bit of a hack, and upstream is workingon pageout improvements that should obviate it, but that work is still experimental.

vim's behavior is a performance/robustness tradeoff, and is expected to be slowwhen the system is doing a lot of I/O.

As for your virtualization, this is why most virtualization software (includingXen and KVM) allows you to use a block device, such as a logical volume, towhich it can do direct I/O, which takes pdflush out of the picture.

Ultimately, if latency is a high priority for you, you should switch to thedeadline scheduler.


-- Chris