mdadm

Author	SHA1	Message	Date
Doug Ledford	cfad27a937	Two Minor bug fixes to incremental support One: a single character typo (of instead of or in an error printout) Two: Audited usage of tfd file descriptor. Make sure that the tfd file is always closed after usage, and that the tfd variable is reset to -1 if we are going to continue in our loop (not necessary if we know we will return from our function without going through the dv loop again). Signed-off-by: Doug Ledford <dledford@redhat.com>	2010-07-22 10:16:31 -04:00
Doug Ledford	753cf90512	Fix all the confusion over directories once and for all. We now have 3 directory definitions: mdmon directory for its pid and sock files (compile time define, not changable at run time), mdmonitor directory which is for the mdadm monitor mode pid file (can only be passed in via command line at the time mdadm is invoked in monitor mode), and the directory for the mdadm incremental assembly map file (compile time define, not changable at run time). Only the mdadm map file still hunts multiple locations, and the number of locations has been reduced to /var/run and the compile time specified location. Re-use of similar sounding defines that actually didn't denote their actual usage at compile time made it more difficult for a person to know what affect changing the compile time defines would have on the resulting programs. This patch renames the various defines to clearly identify which item the define affects. It also reduces the number of various directories which will be searched for these files as this has lead to confusion in mdadm and mdmon in terms of which files should take precedence when files exist in multiple locations, etc. It's best if the person compiling the program intentionally and with planning selects the right directories to be used for the various purposes. Which directory is right depends on which items you are talking about and what boot loader your system uses and what initramfs generation program your system uses. Because of the inter-dependency of all these items it would typically be up to the distribution that mdadm is being integrated into to select the correct values for these defines. Signed-off-by: Doug Ledford <dledford@redhat.com>	2010-07-22 10:16:30 -04:00
NeilBrown	8562409dd1	Merge branch 'master' of git://github.com/djbw/mdadm	2010-07-22 17:43:35 +10:00
NeilBrown	c43f7d91cc	Don't report Used Dev Size for RAID0. This number isn't meaningful for RAID0 as a different amount of space might be used from each device. It isn't meaningful for linear either, but already was not reported for linear. Detail doesn't report it either. So make --examine not report it. Signed-off-by: NeilBrown <neilb@suse.de> Reported-by: Mario 'BitKoenig' Holbe <Mario.Holbe@TU-Ilmenau.DE>	2010-07-22 15:45:18 +10:00
NeilBrown	3e4165619b	Cast to long long before left-shifting too much. When left-shifting we must be sure that the value being shifted is large enough to not lose bits. The 'chunkssize' in CreateBitmap is only 'long' so it can overflow. So cast to 'long long' first. Also fix a similar issue in Detail even though it isn't currently being compiled. Signed-off-by: NeilBrown <neilb@suse.de> Reported-by: Tomasz Chmielewski <mangoo@wpkg.org>	2010-07-22 15:35:54 +10:00
Dan Williams	1dccfff910	Incremental: restore assembly for inactive containers, block active GET_ARRAY_INFO always succeeds on an inactive container, so we need to be a bit more diligent about adding a disk to an active container. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-07-19 14:59:25 -07:00
NeilBrown	50526e9090	super-0.90: don't write bitmap larger than 60K The 4K superblock can be as close as 64K from the end of the device. As the bitmap (with header) lives after the superblock (with 0.90 metadata) there could be as little as 60K of space. So limit the bitmaps to 59.5K, and only write 60K including the header. The bug fixed here means that bitmaps cannot be created on devices which are exact multiples of 64K in size Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-07 21:31:33 +10:00
NeilBrown	3d5279b053	Improve --re-add documentation and add the fact that --test can now be used with --manage operations. Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-07 13:35:07 +10:00
Dan Williams	569cc43ffb	imsm: fix a -O2 build warning super-intel.c: In function ‘imsm_add_spare’: super-intel.c:4833: error: ‘array_start’ may be used uninitialized in this function super-intel.c:4834: error: ‘array_end’ may be used uninitialized in this function This is valid, if we don't find a spare candidate then array_{start,end} will be uninitialized. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-07-06 12:48:59 -07:00
Dan Williams	f4190c2f12	mdmon: satisfy glibc tls abi requirements with pthreads Setting up a proper tls descriptor is required to conform to the abi [1]. Until it can be implemented in mdmon use pthreads instead of clone(2) to let glibc handle the details. The old behaviour can be had by un-defining USE_PTHREADS. Note, the "O2" builds need LDFLAGS now to pick up the '-pthread' option. [1]: http://people.redhat.com/drepper/tls.pdf Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-07-06 12:48:56 -07:00
Marcin Labun	0ad6835c98	Fix the count of member devices in mdstat_read function. Correction of the number of container or volume member devices (devcnt in struct mdstat_ent). The number after the last devices was counted towards member of devices. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-06 18:02:22 +10:00
Przemyslaw Czarnowski	aae3cdc35a	fix: IncrementalRemove leaves open handle Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com<mailto:przemyslaw.hawrylewicz.czarnowski@intel.com>>	2010-07-06 16:47:02 +10:00
NeilBrown	1538aca5cb	Merge branch 'master' of git://github.com/djbw/mdadm	2010-07-06 14:46:47 +10:00
NeilBrown	7d2e6486e3	Add --test option to --re-add and similar --test can be given in Manage mode. This can be used when there is an attempt to fail or remove 'faulty', 'failed' or 'detached' devices, or to re-add 'missing' devices. If no devices were failed, removed, or re-added, then mdadm will exit with status '2'. Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-06 12:07:07 +10:00
NeilBrown	a4e13010df	Add support for "--re-add missing" If the device name "missing" is given for --re-add, then mdadm will attempt to find any device which should be a member of the array but currently isn't and will --re-add it to the array. This can be useful if a device disappeared due to a cabling problem, and was then re-connected. The appropriate sequence would be mdadm /dev/mdX --fail detached mdadm /dev/mdX --remove detached mdadm /dev/mdX --re-add missing Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-06 12:06:11 +10:00
NeilBrown	3a6ec29ad5	Don't let incremental add devices to active arrays. Adding devices to active arrays in --incremental is a bit dubious. Normally the array won't be activated until all expected devices are present, so this situation would mean that the given device is not expected, so is probably failed. In that case it should only be added by explicit sysadmin request. However if --run was given, then quite possibly the array was assembled earlier when not complete, so it is less clear whether it is wrong to add this device or not. In that case add it as that is generally safest. It would be nice to allow policy for this to be explicitly given by sysadmin. Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-06 12:04:40 +10:00
NeilBrown	e5c99c0811	Assemble: Fix honouring of 'auto' config line commit `1ff9833928` broke the checking of metadata types via the 'auto' line. Be moving 'load_super" before "conf_test_metadata" we left tst->sb set even if conf_test_metadata fails, so the device will actually be accepted and used. So if we decide to reject the device, free the superblock so it is clear that it is rejected. Signed-off-by: NeilBrown <neilb@suse.de>	2010-07-06 11:57:09 +10:00
Dan Williams	d19e3cfb66	Merge branch 'fixes' into for-neil	2010-07-01 17:36:11 -07:00
Dan Williams	8cfc801c72	Merge branch 'subarray' into for-neil Conflicts: mdadm.h super-intel.c	2010-07-01 17:36:05 -07:00
Dan Williams	23eb475a96	mdmon: prevent allocations due to late binding Current versions of glibc do not provide a useable interface to clone(2) as it inflicts hidden dependencies on setting up a glibc specific tls descriptor. The dynamic linker trips this dependency and causes mdmon to intermittently fail to load. Resolving all dynamic linking prior to starting the monitor thread appears to mitigate the issue but there is no guarantee that another tls dependency will bite us later. However, while the debate continues with the glibc maintainers it seems prudent to keep this change. It ensures that we do not get into a situation where the monitor thread needs to make a late allocation to resolve a symbol. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-07-01 17:28:14 -07:00
NeilBrown	b3b4e8a7a2	Avoid skipping devices where removing all faulty/detached devices. When using 0.90 metadata, devices can be renumbered when earlier devices are removed. So when iterating all devices looking for 'failed' or 'detached' devices, we need to re-check the same slot we checked last time to see if maybe it has a different device now. Reported-by: Jim Paris <jim@jtan.com> Resolves-Debian-Bug: 587550 Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-30 17:20:38 +10:00
NeilBrown	7efa6bc34f	Update udev rules for hotplug support. - split the rules for handling components of array to be clearly separate from rules for handling the arrays themselves. - add call to "-If" when removing a device - uncomment the --incremental call when adding a device. Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-30 16:55:17 +10:00
NeilBrown	29ba480497	Add -fail support to --incremental This can be used for hot-unplug. When a device has been remove, udev can call mdadm --incremental --fail sda and mdadm will find the array holding sda and remove sda from the array. Based on code from Doug Ledford <dledford@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-30 16:55:17 +10:00
NeilBrown	98d27e3964	Support fail/remove using kernel name Allow kernel names like "sda" and "hdb1" to be used to fail/remove devices from an array. This is useful as after a device has been removed it can be difficult to get the major/minor number. Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-30 16:55:17 +10:00
NeilBrown	3b57c4661a	Add mdstat_by_component This allows finding the array which contains a given component. Components are named using the kernel-internal string name such as "sda1" or "hdb". Don't return member arrays, only the contain that contains them. Also tidy up the parsing of 'inactive' arrays in /proc/mdstat. If we see 'inactive' we need to set 'in_devs' immediately as there is no level coming. Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-30 16:55:17 +10:00
NeilBrown	7e6140e6c6	Correct documentation for --rebuild-map In some places it is referred to as "--rebuild", and while that works due to getopt allowing prefixes, it could appear confusing (rebuild means other things too) and being explicit is some safeguard if we want to add e.g. --rebuild-foo later. Reported-by: Doug Ledford <dledford@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-30 16:52:54 +10:00
Jeff DeFouw	b6d7a7fbaa	Fix parsing of inactive arrays in /proc/mdstat They don't have a level, so we should not expect one, and should expect devices instead. Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-29 16:42:48 +10:00
Dan Williams	aa534678ba	Rename subarray v2 Allow the name of the array stored in the metadata to be updated. In some cases the metadata format may not be able to support this rename without modifying the UUID. In these cases the request will be blocked. Otherwise we allow the rename to take place, even for active arrays. This assumes that the user understands the difference between the kernel node name, the device node symlink name, and the metadata specific name. Anticipating further need to modify subarrays in-place, introduce the ->update_subarray() superswitch method. A future potential use case is setting storage pool (spare-group) identifiers. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-22 16:30:59 -07:00
NeilBrown	ccaeea03a9	Create: fix typo in RAID10 default layout message. It reports "layout defaults to n1" but it means "... to n2". Reported-by: Adrian Sandor <aditsu@yahoo.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-06-17 20:17:10 +10:00
Dan Williams	b526e52dc7	Always assume SKIP_GONE_DEVS behaviour and kill the flag ...i.e. GET_DEVS == (GET_DEVS\|SKIP_GONE_DEVS) A null pointer dereference in Incremental.c can be triggered by replugging a disk while the old name is in use. When mdadm -I is called on the new disk we fail the call to sysfs_read(). I audited all the locations that use GET_DEVS and it appears they can tolerate missing a drive. So just make SKIP_GONE_DEVS the default behaviour. Also fix up remaining unchecked usages of the sysfs_read() return value. Reported-by: Dave Jiang <dave.jiang@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-16 17:26:04 -07:00
Dan Williams	6a0ee6a077	Remove 'checkpointing' side effect of --wait-clean Now that mdmon records periodic checkpoints, and checkpoints every ->set_array_state() event we no longer need to 'idle' sync_action from --wait-clean. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-15 18:41:57 -07:00
Dan Williams	4f0a7acc9a	mdmon: record sync_completed directly to the metadata When sync_action is idle mdmon takes the latest value of md/resync_start or md/<dev>/recovery_start to record the resync/rebuild checkpoint in the metadata. However, now that mdmon is reading sync_completed there is no longer a need to wait for, or force an idle event to take a checkpoint. Simply update the forward progress of ->last_checkpoint at every wakeup event and force it to be recorded at least every 1/16th array-size interval. It may be recorded more frequently if a ->set_array_state() event occurs. This also cleans up some confusion in handling the dual-rebuild case. If more than one spare has been activated the kernel starts the rebuild at the lowest recovery offset, so we do not need to worry about min_recovery_start(). Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-15 18:41:57 -07:00
Dan Williams	0d80bb2f97	imsm: dump each disk's view of the slot state Allow --examine to determine which disk might have a stale view of the per-disk out-of-sync state. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-15 18:41:57 -07:00
Dave Jiang	0bd16cf217	create: Check with OROM limit before setting default chunk size Make create check with the appropriate meta data handler and see what the largest chunk size is supported. The current 512K default is not supported by existing imsm OROM. [dan.j.williams@intel.com: trim the upper limit to 512k for future oroms] Signed-off-by: Dave Jiang <dave.jiang@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-15 18:41:53 -07:00
Dan Williams	33414a0182	Kill subarray v2 Support for deleting a subarray out of a container. When all subarrays are deleted the component devices are converted back into spares, a --zero-superblock is still needed to kill the remaining metadata at this point. This operation is blocked when the subarray is active and may also be blocked by the metadata handler when deleting the subarray might change the uuid of other active subarrays. For example, with imsm, deleting subarray 'n' may change the uuid of subarrays with indexes > n. Deleting a subarray needs to be a container wide event to ensure disks that record the modified subarray list perceive other disks that did not receive this change as out of date. Notes: The st->subarray parsing in super-intel.c and super-ddf.c is updated to be more strict now that we are reading user supplied subarray values. Offline container modification shares actions that mdmon typically handles so promote is_container_member() and version_to_superswitch() (formerly find_metadata_methods()) to generic utility functions for the cases where mdadm performs the operation. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-15 17:55:41 -07:00
NeilBrown	0017a23753	mdadm.conf: fix AUTO typo Reported-by: Marc Schiffbauer <marc@schiffbauer.net> Signed-off-by: Mike Frysinger <vapier@gentoo.org>	2010-06-05 08:02:11 +10:00
NeilBrown	fd547b508c	Fix man page reference to --level changes with --grow. --level can be used with --grow now, so correct man page. Reported-by: "Leslie Rhorer" <lrhorer@satx.rr.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-05-31 12:52:37 +10:00
martin f. krafft	26f467a954	Compile-time switch to enable 0.9 metadata as default This commit introduces DEFAULT_OLD_METADATA as a preprocessor definition. If defined, it causes mdadm to assume metadata version 0.9 as default. If not defined, version 1.x (currently 1.2) is used as default. The man page mdadm.8 is also modified to reflect the chosen default. The selftests will not work if the old default is chosen. This patch was requested by Debian so they could distribute a current mdadm together with boot loaders that only understand 0.90 metadata for md-raid. Preferred usage is simply make DEFAULT_OLD_METADATA=yes Signed-off-by: martin f. krafft <madduck@debian.org> Signed-off-by: NeilBrown <neilb@suse.de>	2010-05-31 12:52:37 +10:00
NeilBrown	5082750467	Revert change to handling of -empty-string- metadata. If the metadata is an empty string, it means the array in question does not use metadata. This comes from sysfs_read finding "none" in "metadata_version", then super_by_fd noticing the vers == -1, and so just using the ->text_version (which is empty). In this case we want to use the super0 metadata handler routines because that is what we always used to do before commit `7d5c3964cc` And that commit was wrong because "" doesn't mean "default" and so should not have been changed at the same time. Reported-by: martin f. krafft <madduck@debian.org> Signed-off-by: NeilBrown <neilb@suse.de>	2010-05-31 12:08:02 +10:00
NeilBrown	d492df0307	Merge commit '3288b419b988b20a53a2b12eb8e5f9f536228db4'; commit '4363fd80bcc9f85ed824228dee5e6350a8d73e18'; commit '63b4aae33ebf00d443378daf313622630f2336c0' * commit '3288b419b988b20a53a2b12eb8e5f9f536228db4': Revert "Incremental: honor --no-degraded to delay assembly" Incremental: honor an 'enough' flag from external handlers * commit '4363fd80bcc9f85ed824228dee5e6350a8d73e18': imsm: robustify recovery-start detection fix: memory leak in mdmon_pid() * commit '63b4aae33ebf00d443378daf313622630f2336c0': mdmon: fix missing open of md/<dev>/recovery_start	2010-05-31 11:34:14 +10:00
Dan Williams	4363fd80bc	imsm: robustify recovery-start detection update_recovery_start() assumed that the out-of-sync disk would always be marked as IMSM_ORD_REBUILD in the disk_ord_tbl, but the segmentation fault reported by Andy proves otherwise. This might also be explained by an interrupted rebuild and the disk has not yet been marked missing. https://bugzilla.redhat.com/show_bug.cgi?id=592030 Reported-by: Andy Lutomirski <luto@mit.edu> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-05-26 13:33:43 -07:00
Dan Williams	3288b419b9	Revert "Incremental: honor --no-degraded to delay assembly" This reverts commit `fdb482f99b`. Now that containers can report state for ->container_enough we can automatically determine when the array can be started, and no longer need the --no-degraded hammer. Conflicts: Incremental.c Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-05-26 13:25:47 -07:00
Dan Williams	97b4d0e971	Incremental: honor an 'enough' flag from external handlers This is needed for imsm where: 1/ we want to report raid_disks as zero to allow mdadm -As to incorporate all spares 2/ we can't determine stale disks by looking at the event counts. 3/ we can't see per-subarray expectations with the info returned from the container level ->getinfo_super() Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-05-26 13:22:36 -07:00
NeilBrown	4460f8f7c3	Monitor: don't report the disappearance of a faulty device as SpareActive. Normally Monitor doesn't see faulty devices in active slots - they get moved away too quickly. But if it does, it reports the "faulty device disappeared" event (when it finally does get moved away) as SpareActive due to insufficient checking. So add a better check. Reported-by: Pierre Vignéras <pierre@vigneras.name>	2010-05-18 12:31:29 +10:00
NeilBrown	c03ef02d92	Grow: move error message closer to error cause. A recent change move the sysfs_read call away from the check that it succeeded. This patch moves the check back next to the sysfs_read call. Signed-off-by: NeilBrown <neilb@suse.de>	2010-05-18 12:29:28 +10:00
Przemyslaw Hawrylewicz Czarnowski	10013317ce	fix: memory leak in mdmon_pid() devnum2devname() returns pointer to memory allocated with strdup. It must be released to prevent memory leak. Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-05-17 15:38:34 -07:00
Dan Williams	484240d8a3	mdmon: periodically checkpoint recovery The kernel updates and notifies md/sync_completed when it is time to take a checkpoint. When this occurs (at 1/16 array size intervals) write 'idle' to md/sync_action to have the current recovery position updated in recovery_start and resync_start. Requires the metadata handler to reset ->last_checkpoint when it has determined that recovery has ended. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-05-14 17:42:49 -07:00
Dan Williams	63b4aae33e	mdmon: fix missing open of md/<dev>/recovery_start When activating a spare we neglect to open recovery_start and as such do not see checkpoint events. Move disk initialization to common routine to mitigate recurrence. Reported-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-04-29 10:50:29 -07:00
NeilBrown	200871adf9	Grow: avoid overflow of chunk sizes. Chunks aren't particularly big, but when you could them in bytes and multiply them together (as we do for calculating the backup size for 'grow') they can overflow a 32bit int. So group the division by 512 more closely with the chunk size so were would need 30Meg chunks to come close to overflowing 32bits. Signed-off-by: NeilBrown <neilb@suse.de>	2010-04-29 16:14:30 +10:00
NeilBrown	691c6ee1b6	IMSM/DDF: don't recognised these metadata on partitions. These metadata are not expected on partitions, and they have no way of differentiation whether which is correct if they are found both on the device and on the last partition. So if the device is a partition, refuse to read the metadata. Signed-off-by: NeilBrown <neilb@suse.de>	2010-04-29 16:09:59 +10:00

... 4 5 6 7 8 ...

1559 Commits