[ILUG] Disk throughput on current generation server hardware
Niall O Broin
niall at linux.ie
Wed Oct 28 08:32:54 GMT 2009
I have a few HP DL380 servers, with 2 x quad core 2.66 GHz Xeons, 12
GB RAM, and 8 x 300 GB 10K RPM SAS disks in a RAID6 array on a Smart
Array P410i controller. The servers were originally deployed with
SLES10SP2 where they proved very unreliable - sustained disk I/O would
kill them.
After spending a long time with HP on support tickets, where we were
told to do such inane things as update the ILO firmware (because of
course that's intimately involved in disk I/O) we eventually installed
SLES10SP3 on one of them and SLES11 on another, and they seem to have
ceased dying, which is of course A Good Thing. However, the disk
throughput is now a bit disappointing. The simple test we were using
to exercise the disks was to run in a loop the following:
dd if=/dev/cciss/c0d0p4 of=/opt/dd_test count=20000 bs=1M
I know we could do much 'better' tests with bonnie or some such but
our initial objective was to provoke the failures which we were seeing
in normal use, and which we suspected were disk I/O related, and the
dd test did that just nicely. It gave us a means of relatively quickly
seeing if HP's latest suggestion was of any use.
The dd test gave the following approximate throughput figures:
OS release Kernel version Throughput
SLES10SP2 2.6.16.60-0.21-smp 100 MB/s
SLES10SP3 2.6.16.60-0.54.5-smp 50 MB/s
SLES11 2.6.27.19-5-default 30 MB/s
SLES10SP2 is of course the outright winner here - unfortunately that
speed came at the cost of stability.
The reason for this post is to get people's views as to what kind of
throughput I should reasonably be able to expect on that class of
hardware.
Niall
More information about the ILUG
mailing list