Scaling up a Graphite cluster, I was restarting the carbon-cache
nodes on more powerful EC2 instances. All was going well until I hit
a box which came back without all its storage. Turns out that the
previous sysadmin (or devops helicopter pilot or whatever he calls
himself) hadn’t automated all of the things quite to the extent
where a box could restore its own disks. The mdadm
config hadn’t
been persisted to disk.
Setting aside for the moment how painful it is for me to have to do disk and filesystem manipulation on Linux, with its awful 1980s ideas and interfaces, here’s what I did to fix it. I’ll never remember this.
To get mdadm.conf
updated from the live config:
# /usr/share/mdadm/mkconf force-generate /etc/mdadm/mdadm.conf
(This can take a little while: don’t panic.)
Then, put the new config file into the initramfs
:
# update-initramfs -u
I also had to update the fstab
because that hadn’t been done
either.
/dev/md0 /opt/graphite/storage auto defaults,nobootwait 0 2