Go to file
Krzysztof Wojcik bb025c2f22 Add raid10 -> raid0 takeover support
The patch introduces takeover from level 10 to level 0 for imsm
metadata. This patch contains procedures connected with preparing
and applying metadata update during 10 -> 0 takeover.
When performing takeover 10->0 mdmon should update the external
metadata (due to disk slot and level changes).
To achieve that mdadm calls reshape_super() and prepare
the "update_takeover" metadata update type.
Prepared update is processed by mdmon in process_update().

Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com>
Signed-off-by: NeilBrown <neilb@suse.de>
2011-01-26 08:50:37 +10:00
misc mdadm-1.8.0 2004-11-01 04:49:34 +00:00
tests Enable tests for OLCE, takeover, migrations for imsm metadata 2010-12-26 21:59:14 +11:00
.gitignore .gitignore update 2009-04-14 10:19:04 +10:00
ANNOUNCE-3.0 Release mdadm-3.0 2009-06-02 15:37:56 +10:00
ANNOUNCE-3.1 Release 3.1 2009-10-22 14:07:05 +11:00
ANNOUNCE-3.1.1 Release mdadm-3.1.1 2009-11-19 16:10:58 +11:00
ANNOUNCE-3.1.2 Add ANNOUNCE-3.1.2 2010-04-07 09:13:16 +10:00
ANNOUNCE-3.1.3 Release mdadm-3.1.3 2010-08-06 16:55:23 +10:00
ANNOUNCE-3.1.4 Release mdadm-3.1.4 2010-08-31 17:21:13 +10:00
ANNOUNCE-3.0.1 Release mdadm-3.0.1 2009-09-25 17:08:19 +10:00
ANNOUNCE-3.0.2 Release mdadm-3.0.2 2009-09-25 18:19:07 +10:00
ANNOUNCE-3.0.3 Release 3.0.3 2009-10-22 12:05:22 +11:00
Assemble.c Assemble: allow to assemble spares on their own 2011-01-05 13:54:18 +11:00
bitmap.c Compile with -Wextra by default 2010-08-05 13:13:02 +10:00
bitmap.h Remove spaces/tabs from ends of lines. 2007-12-14 20:13:43 +11:00
Build.c Improve type names for mddev_dev 2010-11-22 20:58:05 +11:00
ChangeLog Release mdadm-3.1.4 2010-08-31 17:21:13 +10:00
config.c Remove content from mddev_dev 2010-11-22 20:58:05 +11:00
COPYING mdctl-0.6 2002-03-06 23:17:40 +00:00
crc32.c Add crc32 files. 2008-05-15 16:48:10 +10:00
crc32.h Add crc32 files. 2008-05-15 16:48:10 +10:00
Create.c Create/grow: improve checks on number of devices. 2010-12-01 14:51:27 +11:00
Detail.c imsm: set imsm spare uuid to 0 2010-12-26 21:59:31 +11:00
dlink.c mdadm-1.5.0 2004-01-22 02:10:29 +00:00
dlink.h This is to avoid gcc warnings when building with strict-aliasing optimization 2006-05-29 02:06:32 +00:00
Examine.c Don't close fds in write_init_super 2011-01-25 07:56:53 +11:00
external-reshape-design.txt Refactor reshape monitoring. 2011-01-06 15:58:32 +11:00
Grow.c Add raid10 -> raid0 takeover support 2011-01-26 08:50:37 +10:00
Incremental.c Incremental: move suitable spares to container when subarrays started. 2011-01-05 14:42:27 +11:00
INSTALL mdadm-0.8.1 2002-04-05 22:00:28 +00:00
inventory Release mdadm-3.1.4 2010-08-31 17:21:13 +10:00
kernel-patch-2.6.18 Add new mode: --incremental 2006-12-21 17:10:52 +11:00
kernel-patch-2.6.18.6 Add new mode: --incremental 2006-12-21 17:10:52 +11:00
kernel-patch-2.6.19 Add new mode: --incremental 2006-12-21 17:10:52 +11:00
kernel-patch-2.6.25 Fix kernel patch 2008-07-12 20:27:39 +10:00
kernel-patch-2.6.27 mdmon: periodically retry to create the socket 2008-10-15 14:15:52 -07:00
Kill.c open_subarray: pass subarray name as explicit arg. 2010-11-22 19:35:25 +11:00
makedist Release mdadm-3.1.3 2010-08-06 16:55:23 +10:00
Makefile extension of IncrementalRemove to store location (path-id) of removed device 2010-11-22 20:58:06 +11:00
Manage.c Don't close fds in write_init_super 2011-01-25 07:56:53 +11:00
managemon.c Detect level change 2011-01-06 19:17:29 +11:00
mapfile.c Improve mddev_ident type definitions. 2010-11-22 20:58:05 +11:00
md_p.h FIX: Bad block verification during assembling array 2010-12-26 21:41:57 +11:00
md_u.h Remove spaces/tabs from ends of lines. 2007-12-14 20:13:43 +11:00
md.4 md.4: various improvements to new section on scrubbing. 2010-01-29 10:21:56 +11:00
md5.h mdadm fix compilation for uClibc 2009-02-02 09:53:51 +11:00
mdadm.8.in Allow --update=devicesize with --re-add 2010-12-09 13:06:29 +11:00
mdadm.c Allow --update=devicesize with --re-add 2010-12-09 13:06:29 +11:00
mdadm.conf-example mdadm.conf: fix AUTO typo 2010-06-05 08:02:11 +10:00
mdadm.conf.5 config: add 'homehost' option to 'AUTO' line. 2010-03-03 14:33:55 +11:00
mdadm.h Add 'restart' arg to various functions used for reshaping. 2011-01-17 09:53:56 +11:00
mdadm.spec Release mdadm-3.1.4 2010-08-31 17:21:13 +10:00
mdassemble.8 Release mdadm-3.1.4 2010-08-31 17:21:13 +10:00
mdassemble.c Improve mddev_ident type definitions. 2010-11-22 20:58:05 +11:00
mdmon-design.txt mdmon-design.txt 2010-12-16 22:12:26 +11:00
mdmon.8 Release mdadm-3.1.4 2010-08-31 17:21:13 +10:00
mdmon.c Make child_monitor a candidate for ->manage_reshape 2011-01-12 14:46:17 +11:00
mdmon.h mdmon: when a reshape is detected, add any newly added devices to the array. 2010-12-16 09:07:52 +11:00
mdopen.c Free some malloced memory that wasn't being freed. 2009-10-22 11:00:56 +11:00
mdstat.c FIX: Position calculation in mdstat_by_subdev 2011-01-06 16:07:20 +11:00
mkinitramfs Guides on how to use mdadm with initramfs 2005-12-05 05:56:42 +00:00
monitor.c FIX: sync_completed == 0 causes reshape cancellation in metadata 2011-01-17 12:44:52 +11:00
Monitor.c fix: Monitor: min_size must be set to 0 2011-01-17 12:46:14 +11:00
msg.c Remove stray 'free' in block_monitor. 2010-12-21 09:14:10 +11:00
msg.h block monitor: freeze spare assignment for external arrays 2010-11-23 15:00:54 +11:00
part.h Add mbr pseudo metadata handler. 2010-09-06 11:26:28 +10:00
platform-intel.c Compile with -Wextra by default 2010-08-05 13:13:02 +10:00
platform-intel.h create: Check with OROM limit before setting default chunk size 2010-06-15 18:41:53 -07:00
policy.c Policy is aware of metadata disk's controller domains. 2010-11-22 20:58:07 +11:00
probe_roms.c Compile with -Wextra by default 2010-08-05 13:13:02 +10:00
probe_roms.h platform: relax rom scanning alignment for ahci platforms 2009-07-31 17:11:41 -07:00
pwgr.c Improve compiling for static binaries. 2006-05-29 04:09:21 +00:00
Query.c get_info_super: report which other devices are thought to be working/failed. 2010-11-22 19:35:25 +11:00
raid5extend.c Remove spaces/tabs from ends of lines. 2007-12-14 20:13:43 +11:00
ReadMe.c Assemble: allow an array undergoing reshape to be started without backup file 2010-12-01 11:47:32 +11:00
README.initramfs Guides on how to use mdadm with initramfs 2005-12-05 05:56:42 +00:00
restripe.c FIX: Do not use layout for raid4 and raid0 while geo map computing 2010-12-03 15:03:25 +11:00
sg_io.c update copyright headers 2008-10-28 10:55:29 -07:00
sha1.c Make homehost information appear in superblock. 2006-05-19 06:56:06 +00:00
sha1.h Make homehost information appear in superblock. 2006-05-19 06:56:06 +00:00
super-ddf.c Don't close fds in write_init_super 2011-01-25 07:56:53 +11:00
super-gpt.c Remove subarray detection from load_super. 2010-11-22 20:24:50 +11:00
super-intel.c Add raid10 -> raid0 takeover support 2011-01-26 08:50:37 +10:00
super-mbr.c Remove subarray detection from load_super. 2010-11-22 20:24:50 +11:00
super1.c Don't close fds in write_init_super 2011-01-25 07:56:53 +11:00
super0.c Don't close fds in write_init_super 2011-01-25 07:56:53 +11:00
swap_super.c Getting ready for 2.0 release... 2005-08-26 02:26:37 +00:00
sysfs.c Fix some issues with setting 'new' state of a reshape 2011-01-26 08:50:28 +10:00
test Enable tests for OLCE, takeover, migrations for imsm metadata 2010-12-26 21:59:14 +11:00
TODO Initial DDF support code. 2008-05-15 16:48:14 +10:00
udev-md-raid.rules Update of udev rules to support IMSM devices 2010-11-22 20:58:06 +11:00
util.c Use one function chosing spares from container 2011-01-05 14:34:14 +11:00

Assembling md arrays at boot time.
---------------------------------
December 2005

These notes apply to 2.6 kernels only and, in some cases,
to 2.6.15 or later.

Md arrays can be assembled at boot time using the 'autodetect' functionality
which is triggered by storing components of an array in partitions of type
'fd' - Linux Raid Autodetect.
They can also be assembled by specifying the component devices in a
kernel parameter such as
  md=0,/dev/sda,/dev/sdb
In this case, /dev/md0 will be assembled (because of the 0) from the listed
devices.

These mechanisms, while useful, do not provide complete functionality
and are unlikely to be extended.  The preferred way to assemble md
arrays at boot time is using 'mdadm' or 'mdassemble' (which is a
trimmed-down mdadm).  To assemble an array which contains the root
filesystem, mdadm needs to be run before that filesystem is mounted,
and so needs to be run from an initial-ram-fs.  It is how this can
work that is the primary focus of this document.

It should be noted up front that only the array containing the root
filesystem should be assembled from the initramfs.  Any other arrays
should be assembled under the control of files on the main filesystem
as this enhanced flexibility and maintainability.

A minimal initramfs for assembling md arrays can be created using 3
files and one directory.  These are:

/bin           Directory
/bin/mdadm     statically linked mdadm binary
/bin/busybox   statically linked busybox binary
/bin/sh        hard link to /bin/busybox
/init          a shell script which call mdadm appropriately.

An example init script is:

==============================================
#!/bin/sh

echo 'Auto-assembling boot md array'
mkdir /proc
mount -t proc proc /proc
if [ -n "$rootuuid" ]
then arg=--uuid=$rootuuid
elif [ -n "$mdminor" ]
then arg=--super-minor=$mdminor
else arg=--super-minor=0
fi
echo "Using $arg"
mdadm -Acpartitions $arg --auto=part /dev/mda
cd /
mount /dev/mda1 /root ||  mount /dev/mda /root
umount /proc
cd /root
exec chroot . /sbin/init < /dev/console > /dev/console 2>&1
=============================================

This could certainly be extended, or merged into a larger init script.
Though tested and in production use, it is not presented here as
"The Right Way" to do it, but as a useful example.
Some key points are:

  /proc needs to be mounted so that /proc/partitions can be accessed
  by mdadm, and so that /proc/filesystems can be accessed by mount.

  The uuid of the array can be passed in as a kernel parameter
  (rootuuid).  As the kernel doesn't use this value, it is made available
  in the environment for /init

  If no uuid is given, we default to md0, (--super-minor=0) which is a
  commonly used to store the root filesystem.  This may not work in
  all situations.

  We assemble the array as a partitionable array (/dev/mda) even if we
  end up using the whole array.  There is no cost in using the partitionable
  interface, and in this context it is simpler.

  We try mounting both /dev/mda1 and /dev/mda as they are the most like
  part of the array to contain the root filesystem.

  The --auto flag is given to mdadm so that it will create /dev/md*
  files automatically.  This is needed as /dev will not contain
  and md files, and udev will not create them (as udev only created device
  files after the device exists, and mdadm need the device file to create
  the device).  Note that the created md files may not exist in /dev
  of the mounted root filesystem.  This needs to be deal with separately
  from mdadm - possibly using udev.

  We do not need to create device files for the components which will
  be assembled into /dev/mda.  mdadm finds the major/minor numbers from
  /proc/partitions and creates a temporary /dev file if one doesn't already
  exist.

The script "mkinitramfs" which is included with the mdadm distribution
can be used to create a minimal initramfs.  It creates a file called
'init.cpio.gz' which can be specified as an 'initrd' to lilo or grub
(or whatever boot loader is being used).




Resume from an md array
-----------------------

If you want to make use of the suspend-to-disk/resume functionality in Linux,
and want to have swap on an md array, you will need to assemble the array
before resume is possible.
However, because the array is active in the resumed image, you do not want
anything written to any drives during the resume process, such as superblock
updates or array resync.

This can be achieved in 2.6.15-rc1 and later kernels using the
'start_readonly' module parameter.
Simply include the command
  echo 1 > /sys/module/md_mod/parameters/start_ro
before assembling the array with 'mdadm'.
You can then echo
  9:0
or whatever is appropriate to /sys/power/resume to trigger the resume.