mdadm

Author	SHA1	Message	Date
NeilBrown	d2db304558	Add action=spare-same-slot policy. When "mdadm -I" is given a device with no metadata, mdadm tries to add it as a 'spare' somewhere based on policy. This patch changes the behaviour in two ways: 1/ If the device is at a 'path' where a previous device was removed from an array or container, then we preferentially add the spare to that array or container. 2/ Previously only 'bare' devices were considered for adding as spares. Now if action=spare-same-slot is active, we will add non-bare devices, but only if the path was previously in use for some array, and the device will only be added to that array. Based on code From: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
NeilBrown	625b25071b	incr/spare: recheck allowed action for each metadata. The current act_spare tests only test if it is allowed for some metadata. As we check each array or partitioning type, we need to double-check that sparing is allowed for that array or partitioning type. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
NeilBrown	6e57f80a90	Incr/spare: make sure failure to identify metadata if handled gracefully. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
NeilBrown	aaccda4406	Incr: fix up return value in try_spare We only want to try partition_try_spare if array_try_spare failed. If it succeeded, there is nothing more to try. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
NeilBrown	52e965c296	Factor out is_bare test. Instead of open coding (and using horrible gotos), make this a separate function. Also fix the check for end of device - SEEK_END doesn't work on block devices. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
Przemyslaw Czarnowski	403410eb97	extension of IncrementalRemove to store location (path-id) of removed device If the disk is taken out from its port this port information is lost. Only udev rule can provide us with this information, and then we have to store it somehow. This patch adds writing 'cookie' file in /dev/.mdadm/failed-slots directory in form of file named with value of f<path-id> containing the metadata type and uuid of the array (or container) that the device was a member of. The uuid is in exactly the same format as in the mapfile. FAILED_SLOTS_DIR constant has been added to hold the location of cookie files. Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
NeilBrown	08387a0473	Teach IncrementalRemove about containers. When we -I -R a device in a container, we must first fail it from each member array before we can remove it from the container. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
Przemyslaw Czarnowski	950bc34477	added --path <path_id> to give the information on the 'path-id' of removed device <path-id> allows to identify the port to which given device is plugged in. In case of hot-removal, udev can pass this information for future use (eg. write this name as 'cookie' allowing to detect the fact of reinserting device to the same port). --path <path-id> parameter has been added to device removal handle (and char *path has been added to IncrementalRemove() to pass this value) in order to pass path-id to this handler. Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
NeilBrown	3a3716107b	Add must_be_container helper. This checks a block device to see if it could be a container, and in particular cannot be a member device. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
NeilBrown	a655e55064	Improve type names for mddev_dev Remove the _t pointer typedef and remove the _s suffix for the structure, These things do not help readability. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:05 +11:00
NeilBrown	fa56eddbd1	Improve mddev_ident type definitions. Remove the _t typedef and remove the _s suffix from the struct name. These things do not help readability. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:05 +11:00
NeilBrown	47c74f3f50	Use load_container in Incremental assembly. We more clearly separate out -I on a container, and use load_container in that case and load_super only for true members. This removes another use of loaded_container. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:57:58 +11:00
NeilBrown	3a97f21010	Incremental: Factor out search of mdstat As we will soon use it in two places. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:24:50 +11:00
NeilBrown	7d91c3f547	Make Incremental_container static as it is only used in Incremental.c Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:24:50 +11:00
NeilBrown	00bbdbdac6	Add subarray arg to container_content. This allows the info for a single array to be extracted, so we don't have to write it into st->subarray. For consistency, implement container_content for super0 and super1, to just return the mdinfo for the single array. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 19:35:26 +11:00
NeilBrown	1d1a9f87a4	Incremental - fix small bug in count_active. If the first device found has a much smaller event count than a subsequent device, that device will not be entered in the 'avail' array properly. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 19:35:25 +11:00
NeilBrown	a5d85af748	get_info_super: report which other devices are thought to be working/failed. To accurately detect when an array has been split and is now being recombined, we need to track which other devices each thinks is working. We should never include a device in an array if it thinks that the primary device has failed. This patch just allows get_info_super to return a list of devices and whether they are thought to be working or not. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 19:35:25 +11:00
NeilBrown	4e8d9f0a16	Convert 'auto' config line to policy statements	2010-09-06 11:26:28 +10:00
NeilBrown	61018da020	Add support for auto-partitioning base devices. If a device is bare and policy suggests that it can be used as a spare for virtual 'partitions' array, find an appropriate partition table and write it to the device. Signed-off-by: NeilBrown <neilb@suse.de>	2010-09-06 11:26:28 +10:00
NeilBrown	56e8be854a	First steps to supporting auto-spare-add to groups of partitioned devices. Adding a spare to a group of partitioned devices is quite different from adding one to an array. So detect which option is worth trying based on policy and then try one or the other - or possibly both - as appropriate. Signed-off-by: NeilBrown <neilb@suse.de>	2010-09-06 11:26:28 +10:00
NeilBrown	0f22b998fb	Add mbr pseudo metadata handler. To support incorpating a new bare device into a collection of arrays - one partition each - mdadm needs a modest understanding of partition tables. The main needs to be able to recognise a partition table on one device and copy it onto another. This will be done using pseudo metadata types 'mbr' and 'gpt'. Signed-off-by: NeilBrown <neilb@suse.de>	2010-09-06 11:26:28 +10:00
NeilBrown	f08605b3ad	Allow --incremental to add a device as a spare if policy allows. If policy allows act_spare or act_force_spare, -I will add a bare device as a spare to an appropriate array. We don't support adding non-bare devices as spares yet. Signed-off-by: NeilBrown <neilb@suse.de>	2010-09-06 11:26:27 +10:00
NeilBrown	7e83544bc4	Use action policy to keep recently-disconnected devices in the array. When we find a device that was recently part of the array but is now out of date (based on the event count) we might want to add it back in (like --re-add) if the likely cause was a connection problem or we might not if the likely cause was device failure. So make this a policy issue: if action=re-add or better, try to re-add any device that looks like it might be part of the array. This applies: when we assemble the array: old devices will be evicted by the kernel and need to be re-added. when we assemble the array during --incr for the same reason. when we find a device that could be added to a running array. This doesn't affect arrays with external metadata at all. For such arrays: When the container is assembled, the most recent instance of each device is included without reference to whether it is too old or not. Then the metadata handler must which slices of which devices to include in which array and with what state. So the ->container_content should probably check the policy and compare the sequence numbers/event counts. When a device is added (--add) to a container with active arrays we only add as a 'spare'. --re-add doesn't seem to be an option. When a device is added with -I ->container_content gets another chance to assess things again. So again it should check the policy. Signed-off-by: NeilBrown <neilb@suse.de>	2010-09-06 11:26:27 +10:00
NeilBrown	15d4a7e447	Introduce single-exit pattern for Incremental All exits should goto the end for clean-up. Signed-off-by: NeilBrown <neilb@suse.de>	2010-09-06 11:26:27 +10:00
NeilBrown	ef83fe7cba	Allow --incremental to add spares to an array. Commit `3a6ec29ad5` stopped us from adding apparently-working devices to an active array with --incremental as there is a good chance that they are actually old/failed devices. Unfortunately it also stopped spares from being added to an active array, which is wrong. This patch refines the test to be more careful. Reported-by: <fibreraid@gmail.com> Analysed-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-08-12 11:41:41 +10:00
Dan Williams	fd4c9ba491	Incremental: return success in 'container not enough' case Commit `97b4d0e9` "Incremental: honor an 'enough' flag from external handlers" introduced a regression in that it changed the error return code for successful invocations. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Reported-by: Ignacy Kasperowicz <ignacy.kasperowicz@intel.com>	2010-08-10 08:44:45 -07:00
Doug Ledford	93c861ee57	Add warnings if we ever fail to get a lock on the mapfile. Signed-off-by: Doug Ledford <dledford@redhat.com>	2010-07-22 10:16:31 -04:00
NeilBrown	8562409dd1	Merge branch 'master' of git://github.com/djbw/mdadm	2010-07-22 17:43:35 +10:00
Dan Williams	1dccfff910	Incremental: restore assembly for inactive containers, block active GET_ARRAY_INFO always succeeds on an inactive container, so we need to be a bit more diligent about adding a disk to an active container. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-07-19 14:59:25 -07:00
Przemyslaw Czarnowski	aae3cdc35a	fix: IncrementalRemove leaves open handle Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com<mailto:przemyslaw.hawrylewicz.czarnowski@intel.com>>	2010-07-06 16:47:02 +10:00
NeilBrown	1538aca5cb	Merge branch 'master' of git://github.com/djbw/mdadm	2010-07-06 14:46:47 +10:00
NeilBrown	7d2e6486e3	Add --test option to --re-add and similar --test can be given in Manage mode. This can be used when there is an attempt to fail or remove 'faulty', 'failed' or 'detached' devices, or to re-add 'missing' devices. If no devices were failed, removed, or re-added, then mdadm will exit with status '2'. Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-06 12:07:07 +10:00
NeilBrown	3a6ec29ad5	Don't let incremental add devices to active arrays. Adding devices to active arrays in --incremental is a bit dubious. Normally the array won't be activated until all expected devices are present, so this situation would mean that the given device is not expected, so is probably failed. In that case it should only be added by explicit sysadmin request. However if --run was given, then quite possibly the array was assembled earlier when not complete, so it is less clear whether it is wrong to add this device or not. In that case add it as that is generally safest. It would be nice to allow policy for this to be explicitly given by sysadmin. Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-06 12:04:40 +10:00
NeilBrown	29ba480497	Add -fail support to --incremental This can be used for hot-unplug. When a device has been remove, udev can call mdadm --incremental --fail sda and mdadm will find the array holding sda and remove sda from the array. Based on code from Doug Ledford <dledford@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-30 16:55:17 +10:00
Dan Williams	b526e52dc7	Always assume SKIP_GONE_DEVS behaviour and kill the flag ...i.e. GET_DEVS == (GET_DEVS\|SKIP_GONE_DEVS) A null pointer dereference in Incremental.c can be triggered by replugging a disk while the old name is in use. When mdadm -I is called on the new disk we fail the call to sysfs_read(). I audited all the locations that use GET_DEVS and it appears they can tolerate missing a drive. So just make SKIP_GONE_DEVS the default behaviour. Also fix up remaining unchecked usages of the sysfs_read() return value. Reported-by: Dave Jiang <dave.jiang@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-16 17:26:04 -07:00
Dan Williams	3288b419b9	Revert "Incremental: honor --no-degraded to delay assembly" This reverts commit `fdb482f99b`. Now that containers can report state for ->container_enough we can automatically determine when the array can be started, and no longer need the --no-degraded hammer. Conflicts: Incremental.c Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-05-26 13:25:47 -07:00
Dan Williams	97b4d0e971	Incremental: honor an 'enough' flag from external handlers This is needed for imsm where: 1/ we want to report raid_disks as zero to allow mdadm -As to incorporate all spares 2/ we can't determine stale disks by looking at the event counts. 3/ we can't see per-subarray expectations with the info returned from the container level ->getinfo_super() Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-05-26 13:22:36 -07:00
NeilBrown	d1d3482b56	config: add 'homehost' option to 'AUTO' line. This allows basing auto-assembly decisions on whether the array is recorded as belonging to this host or not. Signed-off-by: NeilBrown <neilb@suse.de>	2010-03-03 14:33:55 +11:00
NeilBrown	e736b62389	Update copyright dates and remove references to @cse.unsw.edu.au Also removed 'paper' addresses. Signed-off-by: NeilBrown <neilb@suse.de>	2009-06-02 14:35:45 +10:00
NeilBrown	2400e6eb21	Incr: use devname_matches to when looking in mdadm.conf for bitmap file This is more likely to always do the right thing than a strcmp. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:47:11 +10:00
NeilBrown	ac7de9d97a	Incremental: fix uninitialised variable. st2 might not be initialised at this point. So use the more correct 'st'. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:47:10 +10:00
NeilBrown	339c2d6c5e	Incr: cope better with possibility that mp->path might be NULL Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:47:10 +10:00
NeilBrown	7cdc087234	Be more consistent about keeping the host: prefix on array names. If an array name contains a "hostname:" prefix, then --assemble will tend to leave it there, while --incremental will strip it off (when chosing a device name during auto-assembly). Make this more consistent: strip the name off if we decide that the name will be treated as 'local'. Leave it on if it will be treated as 'foreign'. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:47:10 +10:00
NeilBrown	0ac91628b9	Allow homehost to be largely ignored when assembling arrays. If mdadm.conf contains HOMEHOST <ignore> or commandline contains --homehost=<ignore> then the check that array metadata mentions the given homehost is replace by a check that the name recorded in the metadata is not already used by some other array mentioned in mdadm.conf. This allows more arrays to use their native name rather than having an _NN suffix added. This should only be used during boot time if all arrays required for normal boot are listed in mdadm.conf. If auto-assembly is used to find all array during boot, then the HOMEHOST feature should be used to ensure there is no room for confusion in choosing array names, and so it should not be set to <ignore>. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:46:46 +10:00
NeilBrown	05833051ee	Assemble/Incr : minor tidy up of setting 'trustworthy'. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:19:30 +10:00
NeilBrown	31015d5798	conf/assemble: new config line "auto". The line 'auto' in mdadm.conf can be used to disable assembly of specific metadata types, or of all arrays. This does not affect assembly of arrays listed in mdadm.conf or on command line. auto -all will disable all auto-assembly. auto -ddf will cause mdadm to ignore ddf arrays that are not explicitly mentioned, and auto assemble anything else it finds. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:17:33 +10:00
NeilBrown	112cace627	config: support "ARRAY <ignore> ..." lines in mdadm.conf Sometimes we want to ensure particular arrays are never assembled automatically. This might include an array made of devices that are shared between hosts. To support this, allow ARRAY lines in mdadm.conf to use the word "ignore" rather than a device name. Arrays which match such lines are never automatically assembled (though they can still be assembled by explicitly giving identification information on the mdadm command line. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:17:05 +10:00
NeilBrown	745f72f61a	assemble: support arrays created with --homehost=any If an array is created with --homehost=any, then --assemble and --incremental will treat it as being local to 'this' host, no matter what the name of this host is. This is useful for array that will be given unique names and be moved between machines. This needs to be documented. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:16:49 +10:00
NeilBrown	7c5483270d	Incremental - avoid NULL dereference. There structure returned by sysfs_read might not have any 'devs', don't assume it does. Signed-off-by: NeilBrown <neilb@suse.de>	2009-04-07 17:54:09 +10:00
NeilBrown	03b7f6c6bd	Incremental: be more relaxed about member arrays not completely assembling. During incremental assembly, if the member array doesn't assemble properly (yet), that isn't an error. Signed-off-by: NeilBrown <neilb@suse.de>	2009-04-07 17:49:05 +10:00

1 2 3

108 Commits