mdadm

Commit Graph

Author	SHA1	Message	Date
Lukasz Dorau	6d43efb59b	Manage.c: fix make everything compilation error This patch fixes the following make everything compilation error: Manage.c: In function ‘Manage_add’: Manage.c:538: error: ‘dev_st’ may be used uninitialized in this function make: *** [mdadm.Os] Error 1 Signed-off-by: Lukasz Dorau <lukasz.dorau@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-08-14 09:55:29 +10:00
NeilBrown	d070235d3f	Manage_subdevs: factor out Manage_delete Now Manage_subdevs is now small enough to be manageable. Signed-off-by: NeilBrown <neilb@suse.de>	2012-08-13 08:00:21 +10:00
NeilBrown	38aeaf3af6	Manage_subdevs: split most of 'add' handling into Manage_add. This makes Manage_subdevs smaller, and makes the error-path handling for Manage_add much cleaner and probably less buggy. Signed-off-by: NeilBrown <neilb@suse.de>	2012-08-13 08:00:21 +10:00
NeilBrown	abe94694da	Manage: split out attempt_re_add. The indent level is way too deep here, and this is a well defined task, so split it out to a separate function. Signed-off-by: NeilBrown <neilb@suse.de>	2012-08-13 08:00:21 +10:00
NeilBrown	46d475beb4	Manage_subdev: give 'st' a better name and narrower focus. 'st' is use to examine the metadata on the device being added to see if a 're-add' is possible. However it is loaded long before the 're-add' attempt is made. So move the 'load_super' closer to were it is used - allowing us to discard a number of 'free_super' call - and rename it to 'dev_st' to emphasize that it related to the current device. Signed-off-by: NeilBrown <neilb@suse.de>	2012-08-13 08:00:20 +10:00
NeilBrown	7bd04da926	Manage: minor cosmetic fixes. Signed-off-by: NeilBrown <neilb@suse.de>	2012-08-13 08:00:20 +10:00
NeilBrown	1d9976430c	Manage: simplify device searches in Manage_subdevs We currently have rather hard-to-follow loop to iterate through all the matches for 'missing' or 'faulty' or 'detached'. Simplify it by creating a list of possible devices for each of those and splicing the new list into the device list. This removes the need for 'jnext' and 'next' and various other hacks. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:22:16 +10:00
NeilBrown	ba728be72f	Convert 'quiet' to 'not verbose' in various places. If we change some functions to accept 'verbose', where <0 means to be quiet, in place of 'quiet', then we will be able to merge 'quiet' and 'verbose' together for simplicity. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:18:09 +10:00
NeilBrown	503975b9d5	Remove scattered checks for malloc success. malloc should never fail, and if it does it is unlikely that anything else useful can be done. Best approach is to abort and let some super-daemon restart. So define xmalloc, xcalloc, xrealloc, xstrdup which don't fail but just print a message and exit. Then use those removing all the tests for failure. Also replace all "malloc;memset" sequences with 'xcalloc'. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:14:16 +10:00
NeilBrown	c8e1a230b7	Remove re_add flag in favour of new disposition. Instead of disposition == 'a' re_add == 1 use disposition == 'A' to record that a re-add was requested. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:14:16 +10:00
NeilBrown	e7b84f9d50	Introduce pr_err for printing error messages. 'pr_err("' is a lot shorter than 'fprintf(stderr, Name ": ' cont_err() is also available. Signed-off-by: NeilBrown <neilb@suse.de>	2012-07-09 17:14:16 +10:00
NeilBrown	0a999759b5	Relax restrictions on when --add is permitted. The restriction that --add was not allowed on a device which looked like a recent member of an array was overly harsh. The real requirement was to avoid using --add when the array had failed, and the device being added might contain necessary information which can only be incorporated by stopping and re-assembling with --force. So change the test to reflect the need. Reported-by: Doug Ledford <dledford@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2012-04-18 14:19:49 +10:00
NeilBrown	480f356641	Raid limit of 1024 when scanning for devices. When we can for devices using GET_DISK_INFO we currently limit to 1024. But some arrays can have more than this. So raise it to 4096 and make the constant a #define. Signed-off-by: NeilBrown <neilb@suse.de>	2012-04-18 09:06:02 +10:00
NeilBrown	3556c2fafb	Fix typo: wan -> want Signed-off-by: NeilBrown <neilb@suse.de>	2012-04-04 14:02:00 +10:00
NeilBrown	9f58469128	Manage: freeze recovery while adding multiple devices. If the kernel supports it, freeze recovery over multiple adds, so that they can all be added to the array at the same time and be recovered in parallel. Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-22 16:15:03 +11:00
NeilBrown	bcbb3112d2	Manage: replace 'return 1' with 'goto abort'. This will allow exit processing in next patch Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-22 16:07:02 +11:00
NeilBrown	c69ffac0d6	Manage: allow --re-add to failed array. If both "legs" of a RAID1 (or equivalent in RAID10) fail, then one of the becomes available again it maybe appropriate to re-add the failed device(s). So remove the restriction that an array must has 'enough' devices before being re-added, and if there is no-where to read a superblock from for matching, then assume the kernel will do necessary checks. Signed-off-by: NeilBrown <neilb@suse.de>	2012-03-07 15:25:57 +11:00
Jes Sorensen	1471b8b14b	Manage_ro(): Check pointer rather than dereferencing it Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-03 08:09:41 +11:00
Jes Sorensen	bccd8153fa	Manage_runstop(): Avoid memory leak Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-02 10:48:53 +11:00
Jes Sorensen	b73e45ae6a	Managa_ro(): free() mdi before exiting Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-02 10:48:53 +11:00
Jes Sorensen	093d918759	Manage_subdevs(): avoid leaking super Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-02 10:48:53 +11:00
Jes Sorensen	d9ca03e9c3	remove_devices(): readlink returns -1 on error Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-02 10:48:53 +11:00
Doug Ledford	16715c01f7	Fix readding of a readwrite drive into a writemostly array If you create a two drive raid1 array with one device writemostly, then fail the readwrite drive, when you add a new device, it will get the writemostly bit copied out of the remaining device's superblock into it's own. You can then remove the new drive and readd it as readwrite, which will work for the readd, but it leaves the stale WriteMostly1 bit in devflags resulting in the device going back to writemostly on the next assembly. The fix is to make sure that A) when we readd a device and we might have filled the st->sb info from a running device instead of the device being readded, then clear/set the WriteMostly1 bit in the super1 struct in addition to setting the disk state (ditto for super0, but slightly different mechanism) and B) when adding a clean device to an array (when we most certainly did copy the superblock info from an existing device), then clear any writemostly bits. Signed-off-by: Doug Ledford <dledford@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-09-19 13:06:38 +10:00
NeilBrown	11b391ece9	Discourage large devices from being added to 0.90 arrays. 0.90 arrays can only use up to 4TB per device. So when a larger device is added, complain a bit. Still allow it if --force is given as there could be a valid use. Signed-off-by: NeilBrown <neilb@suse.de>	2011-09-08 13:05:31 +10:00
NeilBrown	9e6d929127	Check all member devices in enough_fd The loop over all member devices in enough_fd could easily stop before it had found all devices. This would cause --re-add to fail incorrectly. So change the loop to be based on the reported number of devices in the device - with a safe-guard limit of 1024. Change some other loops to be more careful too. Reported-by: "Schmidt, Annemarie" <Annemarie.Schmidt@stratus.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-05-23 17:21:35 +10:00
NeilBrown	873eec468c	Manage: minor fix to add/re-add handling. If using an old kernel we should still check if a re-add might be intended, so we can refuse and require a '--zero' first if it is not possible. Signed-off-by: NeilBrown <neilb@suse.de>	2011-05-10 16:20:25 +10:00
NeilBrown	51d9a2ce33	Merge branch 'master' into devel-3.2 Conflicts: Incremental.c Manage.c ReadMe.c inventory mdadm.8.in mdadm.spec mdassemble.8 mdmon.8	2011-03-24 12:00:55 +11:00
NeilBrown	fb0d4b9ca2	--stop: separate 'is busy' test for 'did it stop properly'. Stopping an md array requires that there is no other user of it. However with udev and udisks and such there can be transient other users of md devices which can interfere with stopping the array. If there is a transient users, we really want "mdadm --stop" to wait a little while and retry. However if the array is genuinely in-use (e.g. mounted), then we don't want to wait at all - we want to fail immediately. So before trying to stop, re-open device with O_EXCL. If this fails then the device is probably in use, so give up. If it succeeds, but a subsequent STOP_ARRAY fails, then it is possibly a transient failure, so try again for a few seconds. Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-23 15:42:24 +11:00
Adam Kwolek	c0f8269d57	FIX: Add spare throws exception (v2) sync_metadata() requires st->sb to be loaded, otherwise exception is generated. This fails expansion, because spares cannot be added. metadata update uses tst instead st pointer, it is better than loading anchor for st as I proposed previously. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-20 15:47:17 +11:00
Krzysztof Wojcik	1ae42d9d99	Retry writing 'inactive' state during stopping array Issue observed: Sporadicaly stopping arrays using "mdadm -Ss" command does not succeded. Cause: Writting "inactive" to the array state not succeded- array is busy (accessed by udev, blkid etc.) Resolution: If writing 'inactive' fails, wait and retry again (because it is possibly a transient failure) Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-18 12:42:17 +11:00
Adam Kwolek	983fff45a1	FIX: ping_monitor() usage causes memory leaks When for ping_monitor() input devnum2devname() is used, received string pointer should be passed to free() for memory release. It is not made in several places. This use case should have function to avoid memory leak. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-18 12:32:16 +11:00
NeilBrown	d6221e667f	Manage: fix the mess I made in earlier patch. When I separated the 'native metadata' case more cleanly from the "external metadata" case for adding a drive, I left some 'external' code in the 'native' case, and didn't copy it to the 'external' case. When - in the external case - we add to super, we much check for mdmon first, so we know whether to do the metadata update ourselves or not, then afterwards call either flush_metadata_updates (to send to mdmon) or sync_metadata (to do it directly). Reported-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-18 12:31:45 +11:00
NeilBrown	eb0af52689	--stop: separate 'is busy' test for 'did it stop properly'. Stopping an md array requires that there is no other user of it. However with udev and udisks and such there can be transient other users of md devices which can interfere with stopping the array. If there is a transient users, we really want "mdadm --stop" to wait a little while and retry. However if the array is genuinely in-use (e.g. mounted), then we don't want to wait at all - we want to fail immediately. So before trying to stop, re-open device with O_EXCL. If this fails then the device is probably in use, so give up. If it succeeds, but a subsequent STOP_ARRAY fails, then it is possibly a transient failure, so try again for a few seconds. Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-17 13:35:10 +11:00
NeilBrown	88b496c269	Merge branch 'master' into devel-3.2 Conflicts: Manage.c managemon.c super-ddf.c super-intel.c	2011-03-15 15:35:04 +11:00
NeilBrown	02c39ab1d5	Manage/external: for external metadata, add_to_super needs lock on container. add_to_super could use information from the current superblock (ddf does), so add_to_super for external metadata should be called with the O_EXCL lock held on the container to ensure the update is complete before any other process tries to make any changes (like adding another device to array). Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-15 14:48:20 +11:00
NeilBrown	d6508f0cfb	Manage: be more careful about --add attempts. If an --add is requested and a re-add looks promising but fails or cannot possibly succeed, then don't try the add. This avoids inadvertently turning devices into spares when an array is failed but the devices seem to actually work. Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-10 17:25:40 +11:00
Czarnowska, Anna	0081eb007c	modified message on failure to read metadata in Manage Loading container may fail if e.g. one of the disks in container has been detached but udev has not realized the change. Addition to such array will fail because reading superblock from one of disks in array fails. Current message is a bit confusing. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-21 16:43:41 +11:00
NeilBrown	47573b0015	Fix regression with removing 'failed' and 'detached' devices. If a request to remove all 'failed' or 'detached' devices chooses to remove the first device, it will not actually try the removal and will skip any following devices. This fixes it. Reported-by: Rémi Rérolle <rrerolle@lacie.com> Tested-by: Rémi Rérolle <rrerolle@lacie.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-15 10:45:42 +11:00
NeilBrown	5b660791b4	Fix regression with removing 'failed' and 'detached' devices. If a request to remove all 'failed' or 'detached' devices chooses to remove the first device, it will not actually try the removal and will skip any following devices. This fixes it. Reported-by: Rémi Rérolle <rrerolle@lacie.com> Tested-by: Rémi Rérolle <rrerolle@lacie.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-15 10:45:01 +11:00
NeilBrown	562e70e4c4	Call free_super before attempting to add a new device Now that write_init_super doesn't close fds any more, we need to call free_super before the ADD_NEW_DISK ioctl. Also call free_super before some error returns, for cleanliness. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 13:53:35 +11:00
NeilBrown	1cc7f4feb9	Don't close fds in write_init_super We previously closed all 'fds' associated with an array in write_init_super .. sometimes, and sometimes at bad times. This isn't neat and free_super is a better place to close them. So make sure free_super always closes the fds that the metadata manager kept hold of, and stop closing them in write_init_super. Also add a few more calls to free_super to make sure they really do get closed. Reported-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-25 07:56:53 +11:00
Adam Kwolek	73cb8d43f4	Add spares to raid0 in mdadm When user wants to add spares to container with raid0 arrays only it is not possible to update metadata due to lack of running mdmon. To allow for this direct metadata update by mdadm is used in such case. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 18:42:53 +11:00
Anna Czarnowska	d52bb542d4	move_spare function modified and moved to Manage.c It will also be needed for Incremental. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-05 14:34:32 +11:00
NeilBrown	833bb0f8f6	Allow --update=devicesize with --re-add This is useful with 1.1 and 1.2 metadata to update the metadata if the device size has changed. The same functionality can be achieved by writing to the device size in sysfs after re-adding normally, but in some cases this might be easier. Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-09 13:06:29 +11:00
Dan Williams	9ea5a25217	Manage: allow manual control of external raid0 readonly flag mdadm --readwrite <subarray> will clear the external readonly flag ('-' to '/'), but only for redudant arrays. Allow raid0 arrays as well so the user has a simple helper to control this flag. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:08:19 +11:00
NeilBrown	0fb69d1dae	Replace various load_super calls with load_container When we call load_super expecting to find a container, we now just call load_container directly. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
NeilBrown	a655e55064	Improve type names for mddev_dev Remove the _t pointer typedef and remove the _s suffix for the structure, These things do not help readability. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:05 +11:00
NeilBrown	fa56eddbd1	Improve mddev_ident type definitions. Remove the _t typedef and remove the _s suffix from the struct name. These things do not help readability. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:05 +11:00
NeilBrown	a951a4f78f	Pass subarray arg explicitly to ->update_subarray. This is better than hiding it in the supertype structure where we are never quite sure who needs it. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:24:50 +11:00
NeilBrown	4725bc31fb	super_by_fd: return subarray info explicitly. Rather than hiding this in the 'st', return it explicitly. In the one case we still need it, copy it into st where needed. This will disappear in a future patch. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 19:35:25 +11:00
NeilBrown	feab51f8f7	open_subarray: pass subarray name as explicit arg. Rather than hiding this arg in the 'st' structure, pass it explicitly. This is a first step to getting rid of 'subarray' from 'supertype'. The strcpy in open_subarray should have better error checking, but it will disappear soon so there is little point. Signed-off-by: NeilBrown <neilb@suse.de.	2010-11-22 19:35:25 +11:00
NeilBrown	a5d85af748	get_info_super: report which other devices are thought to be working/failed. To accurately detect when an array has been split and is now being recombined, we need to track which other devices each thinks is working. We should never include a device in an array if it thinks that the primary device has failed. This patch just allows get_info_super to return a list of devices and whether they are thought to be working or not. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 19:35:25 +11:00
NeilBrown	8453e70430	Manage: be more careful about --add attempts. If an --add is requested and a re-add looks promising but fails or cannot possibly succeed, then don't try the add. This avoids inadvertently turning devices into spares when an array is failed but the devices seem to actually work. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 19:35:25 +11:00
NeilBrown	93601c439c	Fix spare migration. Spare migration uses major:minor device names. When we added support for kernel style names, we broke that. Signed-off-by: NeilBrown <neilb@suse.de>	2010-08-31 17:21:11 +10:00
NeilBrown	b1702f4826	Don't remove md devices with standard names. If udev is not in use, we create device in /dev when assembling arrays and remove them when stopping the array. However it may not always be correct to remove the device. If the array was started with kernel auto-detect, them mdadm didn't create anything and so shouldn't remove anything. We don't record whether we created things, so just don't remove anything with a 'standard' name. Only remove symlinks to the standard name as we almost certainly created those. Reported-by: Petre Rodan <petre.rodan@avira.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-08-31 15:21:40 +10:00
NeilBrown	f21e18ca89	Compile with -Wextra by default This produced lots of warning, some of which pointed to actual bugs. Signed-off-by: NeilBrown <neilb@suse.de>	2010-08-05 13:13:02 +10:00
Doug Ledford	cfad27a937	Two Minor bug fixes to incremental support One: a single character typo (of instead of or in an error printout) Two: Audited usage of tfd file descriptor. Make sure that the tfd file is always closed after usage, and that the tfd variable is reset to -1 if we are going to continue in our loop (not necessary if we know we will return from our function without going through the dv loop again). Signed-off-by: Doug Ledford <dledford@redhat.com>	2010-07-22 10:16:31 -04:00
NeilBrown	1538aca5cb	Merge branch 'master' of git://github.com/djbw/mdadm	2010-07-06 14:46:47 +10:00
NeilBrown	7d2e6486e3	Add --test option to --re-add and similar --test can be given in Manage mode. This can be used when there is an attempt to fail or remove 'faulty', 'failed' or 'detached' devices, or to re-add 'missing' devices. If no devices were failed, removed, or re-added, then mdadm will exit with status '2'. Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-06 12:07:07 +10:00
NeilBrown	a4e13010df	Add support for "--re-add missing" If the device name "missing" is given for --re-add, then mdadm will attempt to find any device which should be a member of the array but currently isn't and will --re-add it to the array. This can be useful if a device disappeared due to a cabling problem, and was then re-connected. The appropriate sequence would be mdadm /dev/mdX --fail detached mdadm /dev/mdX --remove detached mdadm /dev/mdX --re-add missing Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-06 12:06:11 +10:00
NeilBrown	b3b4e8a7a2	Avoid skipping devices where removing all faulty/detached devices. When using 0.90 metadata, devices can be renumbered when earlier devices are removed. So when iterating all devices looking for 'failed' or 'detached' devices, we need to re-check the same slot we checked last time to see if maybe it has a different device now. Reported-by: Jim Paris <jim@jtan.com> Resolves-Debian-Bug: 587550 Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-30 17:20:38 +10:00
NeilBrown	29ba480497	Add -fail support to --incremental This can be used for hot-unplug. When a device has been remove, udev can call mdadm --incremental --fail sda and mdadm will find the array holding sda and remove sda from the array. Based on code from Doug Ledford <dledford@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-30 16:55:17 +10:00
NeilBrown	98d27e3964	Support fail/remove using kernel name Allow kernel names like "sda" and "hdb1" to be used to fail/remove devices from an array. This is useful as after a device has been removed it can be difficult to get the major/minor number. Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-30 16:55:17 +10:00
Dan Williams	aa534678ba	Rename subarray v2 Allow the name of the array stored in the metadata to be updated. In some cases the metadata format may not be able to support this rename without modifying the UUID. In these cases the request will be blocked. Otherwise we allow the rename to take place, even for active arrays. This assumes that the user understands the difference between the kernel node name, the device node symlink name, and the metadata specific name. Anticipating further need to modify subarrays in-place, introduce the ->update_subarray() superswitch method. A future potential use case is setting storage pool (spare-group) identifiers. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-22 16:30:59 -07:00
Dan Williams	4eb269706f	Create: cleanup after failed create in duplicated array member case mdadm prevents creation when device names are duplicated on the command line, but leaves the partially created array intact. Detect this case in the error code from add_to_super() and cleanup the partially created array. The imsm handler is updated to report this conflict in add_to_super_imsm_volume(). Note that since neither mdmon, nor userspace for that matter, ever saw an active array we only need to perform a subset of the cleanup actions. So call ioctl(STOP_ARRAY) directly and arrange for Create() to cleanup the map file rather than calling Manage_runstop(). Reported-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-04-19 15:28:07 +10:00
NeilBrown	430ea469ad	Stop: done stop a container which still have members active. Doing that is just confusing... Signed-off-by: NeilBrown <neilb@suse.de>	2010-03-09 16:32:04 +11:00
NeilBrown	5a9de8db04	Manage: fix regression on removing detached devices. If /dev is static, a name may remain there after the device has been detached from the system. Using 'mdadm' to remove such a device from the array should still work (even though "mdadm --remove detached" might be preferred). So when processing a device for '-r', don't insist on being able to open the device. Signed-off-by: NeilBrown <neilb@suse.de>	2010-03-03 13:16:24 +11:00
NeilBrown	c1e3ab8c1e	Merge branch 'master' of git://github.com/djbw/mdadm	2009-12-30 13:42:37 +11:00
Dan Williams	2904b26f05	Support external metadata recovery-resume Minimal changes needed to permit reassembling partially recovered external metadata arrays. The biggest logical change is that ->container_content() can now surface partially rebuilt members rather than omitting them from the disk list. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-12-21 12:51:57 -07:00
Dan Williams	d23534e464	Teach sysfs_add_disk() callers to use ->recovery_start versus 'insync' parameter Also fixup 'in_sync' versus 'insync' typo. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-12-21 11:26:21 -07:00
NeilBrown	5c64fcb535	Don't attempt a re-add if the device is marked as faulty. If a device is marked as faulty, then a re-add will cause it to be added as a faulty drive, which is not what it wanted. So just refuse to try to re-add a device which is marked 'faulty'. Signed-off-by: NeilBrown <neilb@suse.de>	2009-12-08 17:26:31 +11:00
NeilBrown	751fd6c093	Don't silently map --re-add to --add As --add can destroy important data on a disk, and --re-add is not suppose to, it is wrong to silently try --add if --re-add fails. So print a message and abort instead. Signed-off-by: NeilBrown <neilb@suse.de>	2009-11-17 13:15:34 +11:00
NeilBrown	4a997737a1	Merge branch 'master' into devel-3.1	2009-10-22 11:13:13 +11:00
NeilBrown	0eb26465c0	Free some malloced memory that wasn't being freed. As mdadm is normally a short-lived program it isn't always necessary to free memory that was allocated, as the 'exit()' call will automatically free everything. But it is more obviously correct if the 'free' is there. So this patch add a few calls to 'free' Signed-off-by: NeilBrown <neilb@suse.de>	2009-10-22 11:00:56 +11:00
NeilBrown	ca4f89a3b7	Merge branch 'master' into devel-3.1 Conflicts: mdadm.8	2009-10-01 16:58:40 +10:00
NeilBrown	d823a6c872	Remove Manage_reconfing in favour of Grow_reshape Bother Manage_reconfig and Grow_reshape provide for changing the 'layout' of a faulty array. This is no necessary. So discard Manage_reconfig and just use Grow_reshape Signed-off-by: NeilBrown <neilb@suse.de>	2009-07-14 12:11:31 +10:00
NeilBrown	3b435195fc	Merge branch 'master' into devel-3.0 Conflicts: super0.c super1.c	2009-06-02 15:28:36 +10:00
NeilBrown	e736b62389	Update copyright dates and remove references to @cse.unsw.edu.au Also removed 'paper' addresses. Signed-off-by: NeilBrown <neilb@suse.de>	2009-06-02 14:35:45 +10:00
NeilBrown	c7b474474c	Monitor: support spare-group manipulation for 1.x metadata. The code for moving spares around a spare-group currently only works for 0.90 metadata. Generalise it for 1.x metadata as well. Reported-by: "Garth Snyder" <garth@grsweb.us> Signed-off-by NeilBrown <neilb@suse.de>	2009-05-12 09:49:45 +10:00
NeilBrown	8320878543	Merge branch 'master' into devel-3.0 Conflicts: Build.c mdadm.c mdadm.h super1.c	2009-05-11 16:05:41 +10:00
NeilBrown	c9f39c1b9b	re-add error check	2009-05-05 21:13:29 +10:00
NeilBrown	462906cdee	incremental_container: preserve 'in_sync' flag when adding to existing array. When building container members with -IR, we need to ensure that devices added to an active array preserve the 'in_sync' status so they don't needlessly get rebuilt. So allow sysfs_add_disk to do this (only works in kernels since 2.6.30) and pass the relevant flag down. Signed-off-by: NeilBrown <neilb@suse.de>	2009-04-14 10:19:02 +10:00
Dan Williams	85f9b5f798	Manage: permit '--remove detached' for containers Skip the unique holder check in the detached case... pretty sure no one is holding on to it if open() returns ENXIO. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-24 18:45:57 -07:00
NeilBrown	208933a7a8	Tidy error messages for add_to_super failure. Make sure every failure from add_to_super prints a suitable error message, and then don't print any error in the caller. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-27 15:39:59 +11:00
Dan Williams	f20c396836	allow add_to_super to return errors Prepare add_to_super to validate disks against the platform capabilities Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-27 15:30:39 +11:00
Dan Williams	ef70b9f4ef	fix remove_devices() Don't write to 'pe' if 'path' was not specified Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-08 16:03:07 -07:00
NeilBrown	d49410d3ab	Stop: drop any partitions that may be associated with an array when stopping it. Not all kernels automatically discard partitions when the array is stopped, so call the RRPART ioctl to force it. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 21:56:42 +11:00
NeilBrown	4ccad7b163	Manage: when stopping an array, delete all names from /dev. This only applies if udev isn't installed or is disabled by MDADM_NO_UDEV We try to remove partitions too. We find names to remove by looking in /var/run/mdadm/map Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:50:39 +11:00
NeilBrown	9759037678	Generate 'change' uevents when arrays change in non-obvious ways. When a 'container' gets started, we need udev to notice, but the kernel has no way of knowing that a KOBJ_CHANGE event is needed. So send one directly via the 'uevent' sysfs attribute. Also, uevents don't get generated when md arrays are stopped (prior to 2.6.28) so send 'change' events then too. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:50:39 +11:00
NeilBrown	b01b06bda8	Merge branch 'master' into devel-3.0 Conflicts: Create.c Manage.c	2008-10-27 10:10:08 +11:00
NeilBrown	b3d3195538	Allow WRITEMOSTLY to be cleared on --readd using --readwrite. Previously it was possible to set the WRITEMOSTLY flag when adding a device to an array, but not to clear the flag when re-adding. This is now possible with --readwrite. Signed-off-by: NeilBrown <neilb@suse.de>	2008-10-25 18:20:49 +11:00
NeilBrown	492350045c	Merge branch 'master' into devel-3.0 Conflicts: Manage.c	2008-10-17 12:46:23 +11:00
NeilBrown	2a528478c7	Manage: allow adding device that is just large enough to v1.x array. When adding a device to an array, we check that it is large enough. Currently the check makes sure there is also room for a reasonably sized bitmap. But if the array doesn't have a bitmap, then this test might be too restrictive. So when adding, only insist there is enough space for the current bitmap. When Creating, still require room for the standard sized bitmap. This resolved Debian Bug 500309	2008-10-13 16:15:16 +11:00
NeilBrown	0e60042683	Compile fixes, particularly moving more stuff under MDASSEMBLE Now 'make everything' works again.	2008-09-18 15:04:47 +10:00
Dan Williams	313a4a82f1	ping_manager() to prevent 'add' before 'remove' completes It is currently possible to remove a device and re-add it without the manager noticing, i.e. without detecting a mdstat->devcnt container->devcnt mismatch. Introduce ping_manager() to arrange for mdmon to run manage_container() prior to mdadm dropping the exclusive open() on the container. Despite these precautions sysfs_read() may still fail. If this happens invalidate container->devcnt to ensure manage_container() runs at the next event. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:43 -07:00
Dan Williams	c94709e83f	Add ping_monitor() to mdadm --wait The action we are waiting for may not be complete until the monitor has had a chance to take action on the result. The following script can now remove the device on the first attempt, versus a few attempts with the original Wait(): #!/bin/bash #export MDADM_NO_MDMON=1 export IMSM_DEVNAME_AS_SERIAL=1 ./mdadm -Ss ./mdadm --zero-superblock /dev/loop[0-3] echo 2 > /proc/sys/dev/raid/speed_limit_max ./mdadm --create /dev/imsm /dev/loop[0-3] -n 4 -e imsm -a md ./mdadm --create /dev/md/r1 /dev/loop[0-3] -n 4 -l 5 --force -a mdp ./mdadm --fail /dev/md/r1 /dev/loop3 ./mdadm --wait /dev/md/r1 x=0 while ! ./mdadm --remove /dev/imsm /dev/loop3 > /dev/null 2>&1 do x=$((x+1)) done echo "removed after $x attempts" ./mdadm --add /dev/imsm /dev/loop3 Include 2 small cleanups: * remove the almost open coded fd2devnum() in Wait() by introducing a new utility routine stat2devnum() * teach connect_monitor() to parse the container device from a subarray string Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
NeilBrown	e9dd159873	Allow an externally managed array to be marked readonly If the metadata_version is -mdXXX/whatever rather than /mdXXX/whatever then the array is readonly and should be left alone by mdmon. Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 17:55:15 +10:00
NeilBrown	3c558363a1	Factor out test for subarray version string. We are about to change the syntax of the version string for 'subarray's. So factor out the test into a single function. Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 17:55:15 +10:00
Dan Williams	43dad3d6fb	mdadm: add device to a container Adding a device updates the container and then mdmon takes action upon noticing a change in devices. This reuses the container version of add_to_super to create a new record for the device. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 17:19:51 +10:00
NeilBrown	37ea3936a6	Merge branch 'master' into from-stable Conflicts: Create.c Manage.c	2008-08-07 14:12:25 +10:00

1 2 3 4 5

206 Commits