[ILUG] Systems crashing on disk activity

kevin lyda kevin+dated+1082727834.777d81 at ie.suberic.net
Sun Apr 18 14:43:45 IST 2004


On Sun, Apr 18, 2004 at 02:34:50PM +0100, Niall O Broin wrote:
> So, tomorrow I'll be having a rather fraught (I imagine) telecon with
> the people from Rackspace and I'm wondering what to say to them.

what does /var/log/messages say?  how about the console?

> It would seem that the chances of getting 3 servers deployed, all of
> which has a similar hardware fault, is very small (of course, I could be
> after stumbling on a bad mother board batch - I'm assuming that these
> boxes have mobo integrated RAID controllers). That leaves a kernel
> problem. The kernel is 2.4.21-9.0.1.EL which hopefully means that we'll
> be able to utilise Red Hat's support to help investigate the kernel if
> that becomes necessary.
> 
> Do any of you have any ideas about this, or have encountered anything
> remotely similar?

i worked in an office once where a slew of unixware boxes atarted
dying.  in the end they all had the same model and revision number on
their scsi disks.  so we swapped them all out and the crashes stopped.

i'd try canceling out one of the two major variables: either use
different hardware or a different kernel.  and go nuts on the different
hardware - different cpu, different raid card, different disks.  and if
rackspace can't do that, do it somewhere else.

kevin

-- 
kevin at ie.suberic.net  ~  "you're either with us or against us." --gw bush.
                         in that simplistic world-view liberals and
                         progressives are "us" and bush and bin laden are
                         "against us."
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://mail.linux.ie/pipermail/ilug/attachments/20040418/fcf9b919/attachment.pgp


More information about the ILUG mailing list