A serious gotcha with growing software RAID devices

May 12, 2009

Suppose that you have a fully mirrored system and you want to expand your available space by migrating to a pair of larger hard drives. The obvious way to do this is fairly straightforward, assuming that you can connect all of your drives at once: partition the new drives (with larger partitions for at least some of your software RAID devices), add the new partitions to the appropriate MD device, let things resync, and then detach the old disks (and eventually physically remove them).

(Hopefully the obvious way is also the right way.)

One of the things that you will probably have to do in order to do this is to grow the number of devices in your mirrors. This is simple to do; 'mdadm -G -n 4 /dev/md0' will make it so that /dev/md0 can now have a four-way mirror (instead of the two-way one it's probably set up with).

If you do this, your system will probably fail to come up the next time you reboot it.

(Specifically, it will report that it is unable to set up the mirror for your root filesystem.)

In the old days, the Linux kernel automatically assembled software RAID devices itself. These days, this is done by your initrd running mdadm to assemble the root mirror, which uses information from an embedded copy of /etc/mdadm.conf. One of the pieces of information in mdadm.conf is how many devices each mirror has.

(You can see where this is going.)

mdadm will refuse to assemble a mirror, like say the one for your root filesystem, if the RAID superblocks on the disks say that the mirror has a different number of devices than the configuration file says it has, even if everything else matches. When you added more devices to your root mirror you probably didn't update /etc/mdadm.conf, and especially you probably didn't rebuild your initrd after doing that. And who does? The whole point of modern Linux initrds is that you don't have to rebuild them.

In my barely printable opinion, this behavior of mdadm is asinine, especially in combination with an uninformative initrd environment; it is robotically correct but practically useless. In an initrd environment, the only things mdadm should care about are whether the array UUID matches and the device can be assembled intact, and I am not entirely convinced about the UUID.

(Disclaimer: I know this happens on Fedora 8. It is possible that more recent versions of mdadm have fixed this, although I don't see anything in the changelog.)


Comments on this page:

From 71.250.234.178 at 2009-05-12 10:45:48:

This is the perfect justification for checklists when doing anything remotely complicated. I feel stupid doing things by rote when walking down a checklist, but I'm glad pilots do it on the planes that I fly in. I see the value of them, for sure.

Matt Simmons
http://standalone-sysadmin.blogspot.com

By cks at 2009-05-12 11:09:37:

The problem isn't remembering to change /etc/mdadm.conf and rebuild your initrd, the problem is knowing that you have to do it in the first place.

Written on 12 May 2009.
« What affects how fast you can restore backups
Booting a Linux system without a root mirror »

Page tools: View Source, View Normal.
Search:
Login: Password:

Last modified: Tue May 12 00:27:56 2009
This dinky wiki is brought to you by the Insane Hackers Guild, Python sub-branch.