mdadm

Author	SHA1	Message	Date
Dan Williams	1ce0101c9a	imsm: defend against unsupported migrations (temporary) Until support for higher order migrations (online capacity expansion, raid level migration, chunk size migration...) are implemented do not allow arrays in these states to be assembled. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-04-12 00:58:28 -07:00
Dan Williams	1484e72797	imsm: add 'verify', 'verify with fixup', and 'general' migration types imsm distinguishes parity initialization from parity checking in the metadata. Older option roms marked the repair operation with the 'verify' type and a 'with fixup' flag in the raid device 'status' field. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-04-12 00:58:27 -07:00
Dan Williams	ff5963088d	imsm: fix imsm_map.num_domains 'num_domains' is the number of parity domains. I.e. 2 in the raid10 case (2-mirrors), while raid0 through raid5 have 1 parity domain (even though raid0 does not have parity). Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-04-12 00:58:27 -07:00
Dan Williams	1f45a8ad20	imsm: ensure mpb buffer is zeroed Don't leak unitialized data into the mpb. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-04-12 00:58:27 -07:00
Dan Williams	9d84c8eac2	imsm: support --examine --export Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-04-11 21:53:25 -07:00
Dan Williams	ae2bfd4e13	imsm: make uuid separator consistent with ddf '-' to ':' Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-04-11 21:53:25 -07:00
Dan Williams	316e2bf426	imsm: extract right-most whitespace stripped serial number According to new documentation the metadata expects that all whitespace (characters <= 0x20) are stripped from the incoming serial number. If the length remains longer than MAX_RAID_SERIAL_LEN then only the right-most characters are preserved. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-04-08 11:41:51 -07:00
NeilBrown	b9d77223eb	Release mdadm-3.0-devel3	2009-03-10 16:59:57 +11:00
Dan Williams	8be094f0ee	imsm: display supported chunk sizes in --detail-platform Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-27 15:35:20 -07:00
Dan Williams	efb30e7f1e	imsm: auto layout In support of auto-layout: 1/ collect and merge all extents to find the largest common-start free region 2/ verify that we meet the "all volumes must use the same set of disks" 2/ mark the disks to be added in add_to_super_imsm_volume Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-24 18:45:57 -07:00
Dan Williams	dab4a5134e	sysfs: allow sysfs_read to detect and drop removed disks All operations that rely on loading from an existing container (like --add) will fail after a disk has been removed. Provide an option to skip missing / offline disks rather than abort. We attempt to do this in the load_super_{imsm,ddf}_all cases when mdmon is running i.e. we already have a consitent version of the metadata running in the system. Otherwise, we fail as normal and let the administrator fix up the container. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-24 18:45:56 -07:00
Dan Williams	db575f3b9e	imsm: retry load_imsm_mpb if we suspect mdmon has made modifications If the checksum verification fails and mdmon is running we retry the load to get a consistent snapshot of the mpb. Found by tests/08imsm-overlap. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-24 18:45:56 -07:00
Dan Williams	ecf45690f2	imsm: verify single sector mpb checksums If the mpb is only one sector do not skip the checksum verification. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-24 18:45:56 -07:00
Dan Williams	0556e1a2b1	imsm: fix mark_failure / introduce mark_missing Actually, rename mark_failure to mark_missing and then implement the correct mark_failure which according to new documentation is to: 1/ Set the FAILED status bit 2/ Set IMSM_ORD_REBUILD to mark the disk out of sync 3/ Set map->failed_disk_num if this is the first failure detected failure (it is ~0 otherwise) Previously the assumption was that IMSM_ORD_REBUILD only appeared in map[1], so all routines that care about out-of-sync disks need to be updated. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-24 18:45:56 -07:00
Dan Williams	620b171338	imsm: introduce get_imsm_disk_slot Implement a common disk index to disk slot routine and replace open coded versions. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-24 18:45:56 -07:00
Dan Williams	df4746577e	imsm: fix activate spare to ignore foreign disks A foreign disk is one that all other drives believe is not-in-sync but does not have the 'failed' status bit set. This also reverts, because that commit is addressing the wrong problem. Ideally mdmon would kick "non-fresh" drives like the kernel does at native-md activation time, but that is too awkward to implement at the moment because mdadm owns container manipulations. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-23 23:06:24 -07:00
Dan Williams	7a70e8aa8d	imsm: fixup container spare uuids by default Spares in the imsm case are marked with the "match-all" uuid of ffffffff-ffffffff-ffffffff-ffffffff. When performing incremental assembly we need to associate such devices with a populated container uuid. Also when performing --detail on a container with only spares present we can make an attempt to return a real uuid. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-23 23:06:24 -07:00
Dan Williams	689c9bf3c3	imsm: fix missing initializations of the per-disk extents pointer Fixes a glibc assertion when trying to free a pointer that was not malloc'd. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-23 23:06:24 -07:00
Dan Williams	cceebc67f1	imsm: provide a simulated option-rom for regression tests IMSM_NO_PLATFORM turns off checks that should be tested, so provide a IMSM_TEST_OROM variable to allow testing the orom constraints in the mdadm regression suite. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-23 14:26:10 -07:00
Dan Williams	5a03814040	imsm: block creation of devices with identical names Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-02 15:01:13 -07:00
Dan Williams	78757ce8a5	imsm: don't check raid1 chunk size mdadm -C /dev/md/r1d2n1s0-5 -amd -l1 --size 5242880 -n 2 /dev/sdb /dev/sdc -R -f -v -c 64 mdadm: chunk size ignored for this level mdadm: super0.90 cannot open /dev/sdb: Device or resource busy mdadm: super1.x cannot open /dev/sdb: Device or resource busy mdadm: platform does not support a chunk size of: 0 mdadm: device /dev/sdb not suitable for any style of array Reported-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Tested-by: Jacek Danecki <jacek.danecki@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-02 10:55:31 -07:00
Dan Williams	caf8d23175	imsm: fix failed disks are allowed back into the container Failed disks do not have valid serial numbers which means we will not pick up the 'failed' status bit from the metadata entry. Check for dl->index == -2 to prevent failed disks from being incorporated into the container. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-01-23 15:45:34 -07:00
Dan Williams	5615172f1d	Create: warn when a metadata format's platform components are missing If the metadata handler can not find its platform support components then there is no way for it to verify that the raid configuration will be supported by the option-rom. Provide a generic method for metadata handlers to warn the user that the array they are about to create may not work as intended with a given platform. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-01-20 01:36:51 -07:00
Dan Williams	a20d2ba5f3	imsm: enforce "all member disks must be members of all arrays" This is a key orom-compatibility constraint. A nice side effect is that it precludes the corner case of 'create' racing against 'spare activate' since the create will fail to convert a spare into an array member. At create time we check if this is the first member array in the container if it is than all disks are possible candidates, if it is not then only current members are permitted. A bit hairier is spare-activation handling in the presence of this constraint. It is difficult because spare handling is per array. The approach taken is to: 1/ check that a new spare can cover all defined arrays in the container 2/ ensure that partially assimilated spares are the first candidates when looking for a spare region to activate. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-01-20 01:36:51 -07:00
Dan Williams	1c556e92ba	imsm: enforce num_disks constraints RAID1 == 2 disks RAID5 >= 3 disks RAID10 == 4 disks Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-01-20 01:36:50 -07:00
Dan Williams	35f81cbbc5	imsm: rename vprintf macro to pr_vrb Don't redefine standard library calls unecessarily... Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-01-20 01:36:50 -07:00
Dan Williams	a18a888ea7	Create: allow per-metadata default layouts Let handlers specifiy their own defaults, specifically needed for the imsm-raid5 case where mdadm defaults to 'ls' and imsm to 'la'. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-01-20 01:36:50 -07:00
Dan Williams	03cd4cc810	imsm: imsm_read_serial check for zero-length response VMWare virtual disks successfully run the inquiry but return a zero response. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-01-20 01:33:56 -07:00
Dan Williams	be2c0e387b	imsm: fix dev_open return value handling dev_open returns an fd Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-01-20 00:29:34 -07:00
NeilBrown	45b662b611	Merge branch 'devel' of git://git.kernel.org/pub/scm/linux/kernel/git/djbw/mdadm into devel-3.0	2008-12-18 16:58:25 +11:00
Dan Williams	4025c288b2	imsm: don't take chunk_size into account for raid1 Results in chopping off usable parts of the requested size. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-12-08 16:59:18 -07:00
Dan Williams	c8151cbc42	imsm: reverse swapped arguments to posix_memalign in imsm_prepare_update Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-12-08 16:59:18 -07:00
Dan Williams	ba2de7ba05	imsm: convert dev_tbl to devlist ...to facilitate testing arbitrary numbers of raid devices Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-12-08 16:59:18 -07:00
Dan Williams	d665cc31e7	imsm: provide a detail_platform method Dump the orom capabilities and hardware disk configuration. This code relies on the name of scsi_host objects to determine the hardware port number. Hopefully this information is stable... Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-12-08 16:59:18 -07:00
Dan Williams	4cce406959	introduce --detail-platform to display platform raid capabilities Metadata formats like imsm work in concert with platform firmware and hardware, so provide a way for mdadm to display this info to the user. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-12-08 16:59:18 -07:00
Dan Williams	88c32bb1ec	imsm: validate arrays being created against firmware capabilities These checks are only enabled when platform support for imsm is found, i.e. ahci driver is loaded and talking to an Intel(R) controller, and the option rom header is located. They can be turned off by setting the environment variable IMSM_NO_PLATFORM to 1. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-12-08 16:59:18 -07:00
Dan Williams	54c2c1ea23	imsm: pass disk info in create message We may be creating on spare disks in which case we need to know which disk goes in which slot. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-12-08 16:59:17 -07:00
Dan Williams	0dcecb2e2d	imsm: correct start offset handling at create time imsm metadata requires all members of a raid volume to start at the same offset. So, incrementally build a composite disk from all the candidates passed to ->validate_geometry. After each disk is added merge the extents and search for a common start offset that satisfies the requested raid device size. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-12-08 16:59:17 -07:00
Dan Williams	03bcbc654f	imsm: fix setting of device size for raid1 When chunksize is 0 in the raid1 case we need to use info_to_blocks_per_member() to calculate the array size. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-12-08 16:59:17 -07:00
NeilBrown	8592f29d64	Create: support autolayout when creating in a DDF If, when creating an array, a signal target device is given which is a container, then allow the metadata handler to choose which devices to use. This is currently only supported for DDF. Signed-off-by: NeilBrown <neilb@suse.de>	2008-12-04 16:08:33 +11:00
NeilBrown	e46273ebe4	Change 'size' argument to validate_geometry to be sectors, not K That way it is the same a *freesize, and generally less confusing. Signed-off-by: NeilBrown <neilb@suse.de>	2008-12-04 15:47:57 +11:00
Dan Williams	dda5855f96	imsm: fix metadata reservation 1/ When truncating the space reserved for the metadata round down to an even numbered sector count to avoid an off-by-one error when sysfs_add_disk rounds up. 2/ Set the current metadata parameter block size as a floor. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-27 15:41:03 +11:00
NeilBrown	208933a7a8	Tidy error messages for add_to_super failure. Make sure every failure from add_to_super prints a suitable error message, and then don't print any error in the caller. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-27 15:39:59 +11:00
Dan Williams	f20c396836	allow add_to_super to return errors Prepare add_to_super to validate disks against the platform capabilities Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-27 15:30:39 +11:00
Dan Williams	92bd8f8d3f	imsm: fix uuid_from_super given 'signature' is not constant The version portion of the signature changes depending on the contents of the container. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-08 16:03:07 -07:00
Dan Williams	4d1313e901	imsm: compatibility fixes for creating imsm arrays When creating an imsm array use the lowest possible feature set to maximize compatibility. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-08 16:03:07 -07:00
Dan Williams	f2f27e63c4	imsm: fixup disk status definition endianess Change the multibyte disk status field definitions to imsm byte-order (little-endian) to match other multibyte field definitions. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-08 16:02:56 -07:00
Dan Williams	fe7ed8cb4f	imsm: add definitions for recent imsm versions Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-08 15:47:39 -07:00
Dan Williams	e3bba0e010	imsm: cleanup migration definitions and usage imsm_set_array_state need not look at the map_state when failed==0 Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-07 15:57:31 -07:00
Dan Williams	5115ca67fd	imsm: cleanup ->match_home and comment on return value Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-07 15:08:09 -07:00
NeilBrown	97f734fde2	A couple of bugfixes found by suse autobuilding: 1/ ia64 appear to have __clone2, not clone. 2/ Including "++" in the arg to a macro is a bad thing to do. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-07 14:46:30 +11:00
Dan Williams	3ebe00a1e2	imsm: display container uuid in detail_super Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:50:39 +11:00
Dan Williams	44470971ce	imsm: display member array uuid in examine_super_imsm Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:50:39 +11:00
NeilBrown	cf8de6913b	Don't give array name in --examine --brief output if it is doubtful. Now that mdadm.conf doesn't need an array name, we don't need to give one if the array cannot reliably provide one.	2008-11-04 20:50:38 +11:00
NeilBrown	40ebbb9cfe	util: make env checking more generic Change the "env_check_mdmon" function to be more generic, accepting and environment variable name, as soon we will have a new use for it. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 10:35:43 +11:00
NeilBrown	d9b420a5cd	intel: Avoid 'may be used before initialised' warning. When compile with -Os, the compile doesn't work out that the variable is always initialised before usage, so we tell it.	2008-11-04 10:35:40 +11:00
Dan Williams	1e7bc0ed08	imsm: include members in ->brief_examine A prerquisite for getting imsm arrays assembled by mdadm -As.	2008-10-28 10:55:31 -07:00
Dan Williams	78d30f94c4	imsm: copy raid device info when associating spares If a spare is included in the list of examined disks we need to copy in at least enough information to get the uuid of the populated container. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-28 10:55:31 -07:00
Dan Williams	a575e2a7cd	imsm: return associated uuid for spares This prevents a uuid of all f's from being displayed when an imsm spare is listed along with active disks for mdadm -Eb. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-28 10:55:31 -07:00
Dan Williams	032e9e2953	Examine: fix MD_DISK_SYNC is a bit not a flag Examine() is actually looking at the ACTIVE bit. This happened to work for imsm spares but now it needs to be fixed up. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-28 10:55:31 -07:00
Dan Williams	072b727f72	imsm: update metadata immediately on "add spare" events ...without this the spare record is delayed until the next metadata event. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-28 10:55:31 -07:00
Dan Williams	a54d52625a	update copyright headers Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-28 10:55:29 -07:00
Dan Williams	57ed8c9155	Treat all devices at the container level as spares Raid disk and disk number information is not relevant at the container level, especially for imsm. So arrange for getinfo_super_imsm() to always publish devices as spares and report the number of spares at Assemble() time. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:43:57 -07:00
Dan Williams	36ba7d4849	Allow a uuid of all f's to always match The uuid returned for an imsm spare device will never match the uuid of an active disk. So make mdadm interpret a uuid of all f's as "match any". Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:43:57 -07:00
Dan Williams	27fd627414	imsm: show uuid in ->examine_super() ...and add "auto=md" to the brief output. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:43:56 -07:00
Dan Williams	792449393d	non-trivial warn_unused_result fixes, activate_spare Both super-ddf and super-intel ignore memory allocation failures during ->activate_spare. Fix these up by cancelling the activation. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:52 -07:00
Dan Williams	3f6efecc4c	imsm: determine failed indexes from the most up-to-date disk load_imsm_disk() currently notices if spares missed their activation update, but we allow a stale failed disk back in to the array because its serial number is clobbered in the most up-to-date disk. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	47ee5a4566	imsm: manage a list of missing disks If a drive is removed while mdmon is not running we need a way to identify what is missing and mark that disk as failed in the metadata. At ->load_super() time create a list of missing disks defined as a disk that is marked in-sync yet does not appear in super->disks. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	1ee1e9fc62	imsm: fix mpb_size calculation in write_super_imsm Spotted a thinko... raid devices are dynamically sized, disks are not. The space for disks is always mpb->num_disks * sizeof(struct imsm_disk). Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	f8f603f133	imsm: enable checkpointing of migration (resync/rebuild) When the array is shutdown, or when mdadm --wait-clean is called, any active resync process will be idled allowing mdmon to record the current resync position. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	593add1b56	monitor: protect against CONFIG_LBD=n md/resync_start reports different terminal values depending on kernel configuration (~0UL versus ~0ULL). Make detection of the resync-complete state more robust by comparing against array size. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	14e8215b1b	imsm: trust sector reservation from metadata On ich6r the option-rom appears to reserve only 432 sectors rather than the 418+4096 of newer implementations. For compatibility trust the metadata in these cases. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	c92a2527e1	imsm: confirm raid10 layout, fix up handling raid10 failures 1/ near-2 indeed matches how the Windows driver lays out the data 2/ update imsm_check_degraded to check for rebuilding disks in the raid10 case Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:47 -07:00
Dan Williams	5c3db629a6	imsm: more serial handling fixups zero-initialize the serial buffer to handle cases where the response is less than MAX_RAID_SERIAL_LEN. Tested-by: Jacek Danecki <jacek.danecki@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 13:12:17 -07:00
NeilBrown	ff54de6e47	Report uuid in --detail --brief for ddf and intel The uuid is slightly fictitious but needed for array matching.	2008-09-18 16:11:40 +10:00
NeilBrown	51006d8586	Add uuid support for super-intel. 'imsm' does not provide any real uuid, so we synthesise one from various stable bits of the superblock.	2008-09-18 16:07:32 +10:00
NeilBrown	9362c1c80c	Allow metadata handler to report that it doesn't record homehost. For now, this means that the lack of a homehost doesn't always prevent assembly. Soon we will allow assembly anyway, but have different messages if homehost isn't supported.	2008-09-18 16:06:41 +10:00
NeilBrown	c5afc314e2	Lots of fixes to make incremental assembly of containers work. So: mdadm -I /dev/whatever will (if appropriate) add whatever to a container, then start any arrays inside the container.	2008-09-18 16:03:05 +10:00
NeilBrown	352452c364	Handle incremental assembly of containers. mdadm -I /dev/part-of-container should add that to a container, creating if it needed, and then try to assemble any arrays in the container. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 16:01:57 +10:00
NeilBrown	f35f252592	Move calls to SET_ARRAY_INFO to common helper. When we assemble an array, there are three different approaches depending on whether metadata is internal or external, and on kernel version. Move all this to a common helper instead of duplicating in 3 places. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 16:01:55 +10:00
NeilBrown	7801ac2092	Factor out add-disk code The variety of approaches to 'add_disk' are factored out into a separate function, and Incremental mode benefits by being closer to supporting the assembly of containers. Also remove the adding-to-array-data-structure out of sysfs_add_disk and into add_disk. And add some tests for --incremental mode to make sure we don't break it. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 15:13:32 +10:00
NeilBrown	0e60042683	Compile fixes, particularly moving more stuff under MDASSEMBLE Now 'make everything' works again.	2008-09-18 15:04:47 +10:00
NeilBrown	a8473e68c7	Fix compile warning/error. gcc said: error: large integer implicitly truncated to unsigned type Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 14:10:42 +10:00
Dan Williams	e553d2a458	imsm: allow a failed disk to be readded Allow the following sequence to rebuild the array mdadm --fail /dev/md/r1 /dev/disk mdadm --remove /dev/imsm /dev/disk mdadm --add /dev/imsm /dev/disk Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	301406c9fd	imsm: use ->getinfo_super() in ->container_content() * allows container_content() to pick up the safemode_delay * removes some duplicate code * fixes an endian bug setting info->array.chunk_size Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	a67dd8cc58	Allow metadata handlers to communicate desired safemode delay via mdinfo Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	1f24f03530	imsm: fix up serial handling * Trim trailing and leading whitespace * Allow unterminated serial numbers up to MAX_RAID_SERIAL_LEN Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:41 -07:00
Dan Williams	f9ba0ff124	imsm: only use the device name as a fallback when IMSM_DEVNAME_AS_SERIAL=1 Also ensure that the serial buffer is initialized. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:41 -07:00
Dan Williams	0c046afd06	imsm: rectify map handling The secondary map is used to reflect the migration state of the array i.e. from dev->vol.map[1] to dev->vol.map[0]. Ensure a rebuilding / initializing array is marked in the second map, while normal status is reflected in the first map. Also mark rebuilding drives with IMSM_ORD_REBUILD. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:41 -07:00
Dan Williams	24565c9a99	imsm: fix imsm_delete() * fix breakage from last merge (infinite loop in imsm_process_update()) * add ability to delete by index Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:41 -07:00
Dan Williams	b10b37b839	imsm: use IMSM_ORD_REBUILD instead of USABLE flag IMSM_ORD_REBUILD is the 'insync' flag in MD terms. USABLE is a flag to opt-in disks for use with the Windows driver. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:41 -07:00
Dan Williams	be73972fac	imsm: introduce set_imsm_ord_tbl_ent() Collapse all the open coded occurrences. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:41 -07:00
Dan Williams	fb49eef264	imsm: cleanup arguments to imsm_check_degraded Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:41 -07:00
Dan Williams	ff077194a1	imsm: cleanup get_imsm_disk_idx(), unify with get_imsm_ord_tbl_ent() Save some unnecessary calls to get_imsm_map() by teaching get_imsm_disk_idx() to retrieve the map. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:41 -07:00
Dan Williams	3e372e5a72	imsm: fix up compare_super_imsm() to match family_num for populated mpb's This allows spares to be associated with any family while not allowing disks from different families to be assembled. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:40 -07:00
Dan Williams	e0783b419d	imsm: fix up spare handling holdover in update_create_array We used to leave SPARE_DISK unset to indicate it was available to be assimilated into other arrays. Now we explicitly check the size. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:55:40 -07:00
Dan Williams	8796fdc4cd	imsm: mark failures like the Matrix driver * Truncate the first character of the serial number * Set 'scsi_id' to all f's * Expect to find disk entries with unmatchable serial numbers, i.e. expect get_imsm_disk() to return NULL in some situations * Allow discrepencies between mpb->num_disks and len(super->disks) Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:55:34 -07:00
Dan Williams	4d7b1503a7	imsm: provide for a larger mpb buffer when necessary Ensure that the mpb buffer is large enough to hold the extra imsm_map's of migrating arrays and dynamically created raid devices. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:55:34 -07:00
Dan Williams	fb9bf0d3e7	imsm: fix logic inversion in get_imsm_ord_tbl_ent() Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:55:30 -07:00
Dan Williams	6c386dd368	imsm: allow container assembly in the presence of failed disks For example, this allows one to still say mdadm -A /dev/sd[b-e] even though /dev/sde has replaced /dev/sdd. Otherwise mdadm will say: mdadm: superblock on /dev/sdd doesn't match others - assembly aborted Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-19 17:55:15 +10:00
Dan Williams	43dad3d6fb	mdadm: add device to a container Adding a device updates the container and then mdmon takes action upon noticing a change in devices. This reuses the container version of add_to_super to create a new record for the device. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 17:19:51 +10:00
Dan Williams	ae6aad8239	imsm: delete kicked disks When we have determined that a disk is no longer of any value, remove it from the data structure. This is now safe because the manager will back off while any metadata update is pending in the monitor. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 14:55:10 +10:00
NeilBrown	01f157d74a	Extra option for set_array_state: you choose dirty or clean. When we first start an array, it might be good to start recovery straight away. That requires setting the array to 'dirty', but only the metadata handler can know if that is required or not. So have a third possible 'consistent' option to set_array_state. Either 'no' or 'yes' or 'you choose'. Return value indicates what was chosen. '1' (no) should be chosen unless there is a good reason. Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 14:54:55 +10:00
Dan Williams	fcb844757f	imsm: include not synced disks in imsm_count_failed Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-15 10:58:42 -07:00
Dan Williams	7eef045331	imsm: use disk_ord_tbl to identify rebuilding disks Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-15 10:57:19 -07:00
Dan Williams	9a1608e5d0	imsm: fix up assembly of disks that are not in-sync 1/ Do not assemble !in_sync or failed devices in container_content. 2/ Prevent activation of failed or configured devices in activate_spare. 3/ Be sure to avoid dirty degraded if the array was shutdown cleanly. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-12 02:25:49 -07:00
Dan Williams	6a3e913ee9	imsm: fix create by mdmon-update imsm_dev dynamically grows, so dev_idx needs to be moved up in the definition to avoid getting clobbered. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-12 02:25:49 -07:00
Dan Williams	e74255d907	imsm: write_super return 0 on success Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-12 02:25:49 -07:00
Dan Williams	a48ac0a8d6	imsm: update mpb_size in write_super_imsm With dev->vol.map and mpb->disk entries entering and leaving the parameter block write_super_imsm needs to update the size before writeback. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-12 02:25:49 -07:00
Dan Williams	272906ef49	mdmon: use activate spare for re-add Disks that are not in-sync or failed are not assembled into member arrays by mdadm. Teach mdmon to resolve this situation by checking for spares at start. imsm_activate_spare() is updated to prefer devices that can be re-added versus new spares. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-12 02:25:46 -07:00
Dan Williams	3393c6af8b	imsm: fix handling of the 'migr_state' and 'migr_type' bits The option-rom and the Matrix driver mark resyncs/rebuilds with the migrate state bits. Update sizeof_imsm_dev to allow allocation of imsm_dev entries large enough to grow if migr_state is later set. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-12 02:05:20 -07:00
Dan Williams	a965f303c7	imsm: add get_imsm_map and sizeof_imsm_map retrieve map entries from a imsm_dev, and cleanup imsm_copy_dev Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-11 01:16:24 -07:00
Dan Williams	828408ebef	imsm: drop 'external' from imsm_examine_brief Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-11 01:16:24 -07:00
Dan Williams	19859edc2d	imsm: ensure 'usable' remains clear until the disk is in_sync Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-11 01:16:24 -07:00
Dan Williams	d23fe9472d	imsm: spare devices are represented as single disk containers This poses a small problem for the case of handling multiple raid1 arrays across separate disk pairs i.e. 2 mirrors on 4 disks. The option-ROM will configure this as two containers. We may need the capability for one container to ask for an unused spare in another container. For now spares will just maintain the affinity established at assemble time. To support this configuration spare devices must be allowed to be assembled into the container even though the metadata indicates the disk belongs to a different family. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-09 13:37:54 -07:00
Dan Williams	b9f594fea0	imsm: store imsm_disk entries in the super->disks list removes the need to lookup the disk by index in a few cases and is a preparation step for tracking spares outside the current anchor. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-09 13:37:54 -07:00
Jacek Danecki	604b746f7c	imsm: bad block management (phase1) This is the initial defensive implementation of bad block management support. It simply precludes assembly if there are entries in the bad block logs. This is sufficient for now as the conditions that lead to an entry in the bad block log would cause the array to be failed by MD (as of 2.6.27). [dan.j.williams@intel.com: general cleanups] Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-06 23:55:53 -07:00
Dan Williams	115c380305	imsm: do not mark arrays 'clean' if resync still in progress Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-06 23:53:44 -07:00
Dan Williams	a380c027e3	imsm: raid5 layout is left-asymmetric Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-06 23:53:44 -07:00
Dan Williams	9ca2c81c0f	imsm: trim some memory wastage drop the previously loaded data as load_imsm_super_all loops through the disks. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-25 17:45:13 -07:00
Dan Williams	949c47a0fa	imsm: refactor mpb handling into parse and coalesce Maintaining a single global buffer is unwieldly when extending/rewriting sections of the metadata. Parse the metadata into component data structures upon reading and coalesce to a coherent buffer before writing. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-25 16:59:47 -07:00
Dan Williams	f54e6321a2	imsm: use component_size not total array size in container_content_imsm Also kill setting ->array.size as nothing appears to be using it. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-24 17:26:24 -07:00
Dan Williams	fcfd95998b	imsm: take chunksize into account when calculating blocks_per_member Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-24 17:26:24 -07:00
Dan Williams	755c99faf2	sysfs: deprecate sysfs_disk_to_sg The cmd_filter patch merged for 2.6.27 broke retrieving the serial number via an ioctl to /dev/sgN. In debugging this I found that other utilities like sdparm simply run the ioctl on /dev/sdX. So just convert to that for protection in numbers, but scream on the mailing list for the inconvenience grr... Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-24 17:26:24 -07:00
Dan Williams	eb7ea4630f	imsm: fix store_zero_imsm breakage from posix_memalign conversion Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-24 17:26:23 -07:00
Dan Williams	0030e8d690	imsm: debug environment variable to use 'device name' as 'serial number' Facilitate using loopback devices for the imsm regression suite Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-24 17:26:23 -07:00
Dan Williams	e9d82038f8	imsm: allow --examine to succeed with damaged metadata limp along after seeing mismatched serial numbers Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-24 17:26:23 -07:00
Dan Williams	5802a8118e	imsm: handle degraded->normal transitions in set_disk Removes the need for the call to ->set_array_state when sync_action transitions from 'recover' to 'idle'. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-24 17:26:23 -07:00
Dan Williams	8273f55e4f	imsm: create array via update to mdmon	2008-07-24 17:26:19 -07:00
Dan Williams	ef649044b7	imsm: remove some casts Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-21 17:42:10 -07:00
Dan Williams	5257ce365b	imsm: activate_spare: fix handling of data_offset Neil rightly points out that imsm_activate_spare may skip valid free space on a spare, fix this up. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-21 11:08:20 -07:00
Dan Williams	e8319a19f2	imsm: process update Handle 'activate_spare' events	2008-07-14 14:10:02 -07:00
Dan Williams	88758e9daa	imsm: activate_spare Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-14 14:09:48 -07:00
Dan Williams	2da8544a39	imsm: metadata only supports a global 'data_offset' When creating a volume a drive must have free space starting at the same location as all the other drives in the array.	2008-07-14 13:59:56 -07:00
Dan Williams	ef6ffaded4	imsm: fix raid1 creation Need to supply dummy values for the chunk size and number of data stripes Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-14 13:58:19 -07:00
Dan Williams	0372d5a2f4	imsm: fail ->open_new if container_member is out of range Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-14 13:57:33 -07:00
Dan Williams	a3d6757fbf	imsm: kill obsolete call to fsync() in store_imsm_mpb Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-14 13:56:42 -07:00
Dan Williams	2c092cada6	imsm: fix insufficient mpb buffer allocation Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-14 13:56:28 -07:00
Dan Williams	ad97895ec8	imsm: catch posix_memalign allocation failures Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-14 13:55:12 -07:00
Dan Williams	bf5a934aff	imsm: remove extra superswitches Following the lead of 75ede16d. This incidentally fixes creation of a second array by gating call to getinfo_super_imsm_volume with a valid ->current_vol. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-14 13:55:12 -07:00
Dan Williams	99e2926423	imsm: fix store_imsm_mpb() alignment for O_DIRECT Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-14 13:55:02 -07:00
Neil Brown	2c514b7120	Pass 'verbose' flag to validate_geometry That way it can be silent when we are just trying to figure out which metadata to use, and noisy when detecting a real problem.	2008-07-12 20:28:38 +10:00
Neil Brown	6416d5275d	Use O_DIRECT for all IO to devices. Using buffered IO risks non-atomic updates to parts of the device that we don't actually want to write to. This isn't in general safe. So switch to O_DIRECT for all that IO and make sure we have properly aligned buffers.	2008-07-12 20:28:33 +10:00
Neil Brown	d2ca644994	Remove getinfo_super_n and do some other cleaning up. Getting close to a sensible description of what some of the superswitch methods are supposed to do!	2008-07-12 20:27:39 +10:00
Neil Brown	0d481d3723	External metadata shouldn't set array.*_version It doesn't mean anything and is never used.	2008-07-12 20:27:39 +10:00
Neil Brown	f7e7067b47	Add subarray field to supertype. When loading the metadata for a subarray (super_by_fd), we set ->subarray to be the name read from md/metadata_version so that getinfo_super can return info about the correct array. With this we can differentiate between a container and an array within the container by looking at ->subarray[0].	2008-07-12 20:27:38 +10:00
Neil Brown	0063ecba3d	Hide subordinate superswitch structures. Only one superswitch should be externally visible for each general type. Others which handle different flavours (e.g. container/data-array) should be internal only.	2008-07-12 20:27:38 +10:00
Neil Brown	b8ac196795	Remove 'major' from superswitch. It isn't generally meaningful.	2008-07-12 20:27:37 +10:00
Neil Brown	ef60947720	Always initialise a struct super_type to zero	2008-07-12 20:27:36 +10:00
Dan Williams	4e6e574a3e	mdmon: add debug print statements for profiling mdmon for development only as console output can block leading to monitor deadlocks in low mem situations Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-16 15:50:07 -07:00
Dan Williams	a862209dc3	imsm: set map state depending on sync state Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-16 15:36:41 -07:00
Dan Williams	272f648fb0	imsm: write anchor last Ensure that newly written anchors reflect a consistent metadata block. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-16 15:36:41 -07:00
Dan Williams	551c80c1c1	imsm: implement store_zero_imsm and init_zero_imsm Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-16 15:36:40 -07:00
Dan Williams	4a04ec6ca1	imsm: ->getinfo_super set raid_disk to disk.number A spare device by definition will have raid_disk set to -1, but when assembling the container we want this disk to by included. Fixes a SIGSEGV when doing: mdadm -A /dev/imsm -e imsm /dev/sd[b-e] ...where /dev/sde is marked as a global spare device Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-16 15:36:40 -07:00
Dan Williams	c2c087e631	imsm: add 'Create' support Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-16 15:36:38 -07:00
Dan Williams	8b35327854	imsm: 'volume' is the proper name for imsm container members Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-13 17:42:09 -07:00
Dan Williams	f4d11639d1	imsm: ->container_dev needs to be set in load_super_imsm_all It is used later by container_content_imsm to determine set the text_version of the member arrays. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-13 17:42:09 -07:00
Dan Williams	87eb16dfa9	imsm: add 'mpb_size' to --examine output Break out sector_count() for use elsewhere. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-13 17:42:09 -07:00
Dan Williams	90c8b70714	sysfs: provide a helper function for locating scsi_generic interfaces imsm records and validates this data in its metadata Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-13 17:27:30 -07:00
Neil Brown	2e735d1982	Allow passing metadata update to the monitor. Code in manager can now just call queue_metadata_update with a (freeable) buf holding the update, and it will get passed to the monitor and written out.	2008-06-12 10:13:23 +10:00
Neil Brown	cba0191bad	Parse the 'instance' part of external:/mdXX/INST in metadata handler. This give more flexability.	2008-05-27 09:18:57 +10:00
Neil Brown	159c3a1a77	Remove st->text_version in favour of info->text_version I want the metadata handler to have more control over the 'version', particularly for arrays which are members of containers. So discard st->text_version and instead use info->text_version which getinfo_super can initialise.	2008-05-27 09:18:55 +10:00
Neil Brown	ed9d66aade	Change mark_clean to set_array_state. DDF needs more fine grained understanding of the array state.	2008-05-27 09:18:54 +10:00
Neil Brown	4e5528c6f7	Implement mark_clean for ddf and remove mark_dirty and mark_sync mark_dirty is just a special case of mark_clean - with sync_pos == 0. mark_sync is not required. We don't modify the metadata when sync finishes. Only when the array becomes non-writeable at which point we use mark_clean to record how far the resync progressed.	2008-05-27 09:18:38 +10:00
Dan Williams	8d45d1969b	handle disk failures From: Dan Williams <dan.j.williams@intel.com> Added curr_state as a parameter to set_disk. Handlers look at this to record components failures, and set global 'degraded' or 'failed' status. When reading the state as faulty: 1/ mark the disk failed in the metadata 2/ write '-blocked' to the rdev state to allow the kernel's failure mechanism to advance 3/ the kernel will take away the drive's role in remove_and_add_spares() 4/ once the disk no longer has a role writing 'remove' to the rdev state will get the disk out of array. There is a window after writing '-blocked' where the kernel will return -EBUSY to remove requests. We rely on the fact that the disk will continue to show faulty so we lazily wait until the kernel is ready to remove the disk. If the manager thread needs to get the disk out of the way it can ping the monitor and wait, just like the replace_array() case. [buglet fix: swap the parameters of attr_match in read_dev_state] Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:49 +10:00
Dan Williams	c2a1e7dad7	imsm: update metadata for dirty -> clean or resync -> idle events From: Dan Williams <dan.j.williams@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:47 +10:00
Dan Williams	3dbccbcf5f	keep member disk fd(s) for updates From: Dan Williams <dan.j.williams@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:46 +10:00
Dan Williams	fd7cde1bf0	handle resync completion From: Dan Williams <dan.j.williams@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:42 +10:00
Neil Brown	845dea950c	Add mdmon functions to super-intel.	2008-05-15 16:48:41 +10:00
Dan Williams	0fd5c350e5	set resync_start in Incremental_container From: Dan Williams <dan.j.williams@intel.com> Metadata handlers set mdinfo.resync_start depending on the state of the array. By default mdadm assumes the array is dirty and needs a full resync. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:33 +10:00
Dan Williams	4f5bc4542c	imsm: assemble raid devices From: Dan Williams <dan.j.williams@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:27 +10:00
Dan Williams	f352c54527	imsm: assemble container support From: Dan Williams <dan.j.williams@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:24 +10:00
Dan Williams	cdddbdbca0	imsm: initial Intel(R) Matrix Storage Manager support From: Dan Williams <dan.j.williams@intel.com> The following now work: --examine --examine --brief Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:22 +10:00

... 2 3 4 5 6 ...

323 Commits