[ILUG] Booting a broken raid array

Kevin Philp kevin at cybercolloids.net
Fri Jan 16 12:04:47 GMT 2009


Conor Wynne wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Kenn Humborg wrote:
>   
>>> - partition as follows:
>>>
>>>   - /boot goes onto the compact flash card, set as the booting device
>>> in the bios
>>>   - create a slice of swap on each of the disks (say 500MB - 1GB each)
>>> at the start of the disk
>>>   - fill the rest of the disc with a software raid partition
>>>   - repeat for additional disks
>>>   - build desired raid level MD device on the partitions
>>>   - put a single LVM volume on top of that (for future expansion)
>>>       
>> If I read this right, your swap space is not on mirrored or
>> raided storage.  So, when one disk fails, the system won't
>> be able to swap data back in from that disk.  
>>
>> While this shouldn't cause a kernel crash in Linux (since
>> the kernel never swaps out its own data), it will crash
>> any user-land process that needs to swap in from that swap 
>> space.
>>     
>
> it's like when the OOM killer invokes, avoid a panic by killing off
> unimportant processes like your databases etc :-)
>
> To me that's just as bad as a panic as your going to end up rebooting
> either way.
>
>   
>> Your setup is protecting the data on the disks, but is not
>> really giving you any higher availability.
>>     
>
> Are we talking about a production environment or a home setup here?
> I assume it's home, and in which case all I would bother RAIDing
> anything other than the data. Even then I wouldn't bother wasting the
> power. I would use an offline backup like a USBHDD jobby. Only switching
> it on when required for a quick rsync, you'll find the disk lasts longer
> that way too!
>   
Its a small office setup - About 10 people - data files shared by NFS 
from the server. Currently the files are synced off-site every night 
using rsync over ssh and then once a week copied to an external hard 
drive, also held off site. So I can restore the system if it fails, 
given time. We have had to do this in the past when the server hard 
drive failed so I am quite happy our back up system works. However it 
takes time and also relies on me being there - as the only vaguely 
computer literate person in the company. As the company expands we need 
a better system.

So to make things a bit more robust I want to migrate the server to a 
RAID array for starters - I reckon hard drive failure is our most likely 
failure point. At a later stage we are thinking of having a second 
server on stand by as a backup.

Any suggestions welcome - I am no expert here.


> By all means, backup your /etc (it's generally tiny) and the data.
> How long does it take to re-install and restore data? If it's for home
> use, time is on your side. Production is a another matter.
>
>   
>> Later,
>> Kenn
>>
>>
>>     
>
> Conor.
>
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v1.4.7 (MingW32)
> Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
>
> iD8DBQFJcHIl0c4nF4jv4ukRAlW2AJ0d+4Q1cyZWvINpFFOu4iN09UkUsgCeNZk5
> SR6VqU3nH+jA31FrZzZU0C0=
> =V3v2
> -----END PGP SIGNATURE-----
>
>
>   




More information about the ILUG mailing list