mdadm

Author	SHA1	Message	Date
NeilBrown	4c8214543f	Create: report failure if array cannot be started. We weren't checking the result of writing 'active' to array_state Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-01 11:03:28 +11:00
NeilBrown	97c9c10014	ddf: fail creation of new subarray with same name as old. Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-01 09:55:35 +11:00
NeilBrown	f49208ec69	ddf: don't print warning on assemble Now that we check the error return of 'update_super' better, we much make sure that ddf doesn't incorrectly report that the superblocks are wrong during assemble. Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-01 09:47:21 +11:00
NeilBrown	ab2bb0b621	mdmon: don't copy an invalid chunk_size As chunk_size in mdstat_ent is never set, we shouldn't copy it into a->info.array. In fact, it is safest to get rid of the field altogether. Reported-by: "Kwolek, Adam" <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-30 18:35:36 +11:00
NeilBrown	484ae54d16	Assemble: call remove_partitions later. We shouldn't call remove_partitions until we have made a really firm decision to include the device into the array. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-30 16:56:01 +11:00
NeilBrown	5a31170d95	Assemble: add --update=no-bitmap This allows an array with a corrupt internal bitmap to be assembled without the bitmap. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-30 16:46:01 +11:00
NeilBrown	ff63406404	Grow: give useful message when adding bitmap gives EBUSY. If adding a bitmap fails with EBUSY, then it is because the array is currently resyncing/recovering/reshaping. As this is non-obvious, give a message explaining the fact. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-30 16:34:25 +11:00
NeilBrown	b3bd581b1d	Fix warning about host-endian bitmaps. Hostendian bitmaps should be warned about on all arch's. And fix a speeling mistake. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-30 16:25:26 +11:00
NeilBrown	36fad8ecb9	Allow K,M,G suffix on chunk sizes as well as device/array sizes. We already allow K,M,G suffixes for --size and --array-size. Allow it for --chunk and --bitmap-chunk as well. Also add this info to man page, and remove the duplication of info about --array-size. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-30 16:23:02 +11:00
Adam Kwolek	1c009fc218	Compute backup blocks in function. number of backup blocks evaluation is put in to function for code reuse. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-30 13:30:22 +11:00
Adam Kwolek	130994cb83	Prepare and free fdlist in functions fd handles table creation is put in to function for code reuse. In manage_reshape(), child_grow() function from Grow.c will be reused. To prepare parameters for this function, code from Grow.c can be reused also. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-30 13:27:08 +11:00
Adam Kwolek	8ba77d3281	imsm: Allow multiple spares to be collected. Assumption for spares searching was that after picking new device, it has to be added to array before next search. This causes returning different disk on each call. When creating a spare list during Online Capacity Expansion, we will first collect the devices list and then all devices are added to md. Picked device from spares pool has to be checked against picked devices so far. If not, the same disk will be returned all the time. Already picked devices are stored in the list and this list is used for new devices verification also. So add an extra arg to imsm_add_spare to hold a list of known spares to ignore. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-30 13:14:24 +11:00
Adam Kwolek	36988a3dda	imsm: FIX: core dump during imsm metadata writing Wrong number of disks during metadata update causes core dump. New disks number based on internal mdmon information has to used for calculation (not previously read from metadata). Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 12:53:16 +11:00
Adam Kwolek	28bce06f17	imsm: Add support for general migration Internal IMSM procedures need to support the General Migration. It is used during operations like: - Online Capacity Expansion, - migration initialization, - finishing migration, - apply changes to raid disks etc. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 12:28:01 +11:00
Adam Kwolek	6d11ec6fc2	Treat feature as experimental Due to fact that IMSM Windows compatibility was not tested yet, feature has to be treated as experimental until compatibility verification will be performed. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 12:11:09 +11:00
Adam Kwolek	62a48395f6	Disk removal support for Raid10->Raid0 takeover Until now Raid10->Raid0 takeover was possible only if all the mirrors where removed before md starts the takeover. Now mdadm, when performing Raid10->raid0 takeover, will remove all unwanted mirrors from the array before actual md takeover is called. Signed-off-by: Maciej Trela <maciej.trela@intel.com> Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 11:57:51 +11:00
NeilBrown	746a6567d3	Improve comments for block_monitor. Also not that the leading '-' on the metadata names now simply means that mdmon must not reconfiure the array. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 10:32:15 +11:00
Anna Czarnowska	ef15641fb5	Monitor: array that has disappeared doesn't need spares If a degraded array disappears we still have it in statelist with active<raid but it is pointless to look for spares for it. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 09:58:22 +11:00
Anna Czarnowska	a1bb206520	Monitor: fix writing autorebuild.pid If /var/run/mdadm doesn't exist we can never succeed writing so we should try to create it first. When we make sure it is there we write pid file as before. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 09:57:41 +11:00
Anna Czarnowska	24baa548c4	Monitor: reset dev when size too small Cc: linux-raid@vger.kernel.org, Williams, Dan J <dan.j.williams@intel.com>, Ciechanowski, Ed <ed.ciechanowski@intel.com> Otherwise spare will be considered good anyway. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 09:56:48 +11:00
Anna Czarnowska	0f0749ad93	Monitor: devid should be dev_t For consistency with makedev(). int is not sufficient. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 09:56:28 +11:00
Anna Czarnowska	ff044d6ba7	Monitor: few bug fixes for spare migration 1. If array not changed we should still report any degraded - another array may have a new spare that we can move. 2. Array with err=1 can't give a spare. 3. We look for spares in "from" not "st" which is supertype and has devname=NULL. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 09:51:27 +11:00
Anna Czarnowska	976915080e	Spare migration tests This is a series of tests checking if mdadm Monitor migrates spares according to rules in /etc/mdadm.conf defined by POLICY lines. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 09:43:29 +11:00
NeilBrown	de6ae75015	Incremental - avoid including wayward devices. If a devices - typically in a mirrored set - is assembled independently of the other devices, and then attempted to be brought back into the set, it could contain inconsistent data. It should not be included. So detect this situation by ensuring that the 'most recent' device is believed to be active by every other device. If a device is wayward, it will only consider fellow wayward devices to be active and will think all others are failed or missing. This patches fixes --incremental, --assemble was done in an earlier patch. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 09:40:15 +11:00
NeilBrown	1c7a808c4d	Improve opt parsing, and distinguish long from short. In several cases, two different long options map to the same short option. So e.g. you could give '--brief' and it would be interpreted as '--bitmap'. That isn't really good. So for every shared short option, define an option number and return that for the long option instead. Then always check for both the short and long options. Also give some bugs like " mode == 'G'" which should be '== GROW'. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-25 18:58:45 +11:00
NeilBrown	2bf740b394	Incr: reduce the number of times we load data from sysfs. Rather than calling sysfs_read whenever we want data from sysfs, call it once at the start will all the requests of interest, then just use that, Make sure we free it properly too. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-25 18:58:44 +11:00
NeilBrown	5739e0d007	Monitor: choose spare correctly for external metadata. When metadata is managed externally - probably as a container - we need to examine that metadata to see which devices are spares. So use the getinfo_super_disk message and use the info returned. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-25 18:58:27 +11:00
NeilBrown	0fa21e8522	Monitor: separate 'choose_spare' out from 'move_spare' choosing a spare from a container is more complicated that from a native array. So separate out choose_spare to make it easier to use an alternate implementation Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-25 18:37:23 +11:00
Dan Williams	7f2ba464e4	External reshape (step 2): Freeze container When growing the number of raid disks the reshape process will promote container-spares to subarray-spares (later the kernel promotes them to subarray-members in raid5_start_reshape()). The automatic spare promotion that mdmon performs upon seeing a degraded array must be disabled until the reshape process has been initiated. Otherwise, mdmon may start a rebuild before the reshape parameters can be specified. In the external case we arrange for the monitor to be blocked, and turn off the safemode delay. Mdmon is updated to check sync_action is not frozen before initiating recovery. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 16:39:58 +11:00
Dan Williams	7bc7119671	External reshape (step 1): container reshape and ->reshape_super() In the native metadata case Grow_reshape() and the kernel validate what reshapes are possible / supported and the kernel handles all the metadata updates. In the external case the metadata format may have specific constraints above this baseline. External formats also introduce the constraint of only permitting some reshapes at container scope versus subarray scope. For exmaple imsm changes to 'raiddisks' must be applied to all arrays in the container. This operation assumes that its 'st' parameter has been obtained from super_by_fd() (such that st->subarray is up to date), and that a snapshot of the metadata has been loaded from the container. Why a new method, versus extending an existing one? ->validate_geometry: this routine assumes it is being called from Create(), adding reshape complicates the cases that this routine needs to handle. Where we find that checks can be shared between the two cases those routines refactored into common code internal to the metadata handler, i.e. no need to provide a unified external interface. ->validate_geometry() also does not expect to update the metadata. ->update_super: this is meant to update single fields at Assembly() and only at the container scope. Reshape potentially wants to update multiple fields at either container or subarray scope. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 16:09:27 +11:00
Dan Williams	d54d79bdc4	Document the external reshape implementation Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:53:00 +11:00
Dan Williams	5f7e44b29f	Initialize st->devnum and st->container_dev in super_by_fd Precludes needing to deduce this information later, like in Detail.c and soon in Grow.c. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:31:18 +11:00
Dan Williams	30f58b2208	Create: cleanup/unify default geometry handling Support metadata specific level, layout and chunksize defaults. Kill an uneeded superswitch methods ahead of adding more for the reshape case. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:20:50 +11:00
Dan Williams	3e82d76d2e	fix a get_linux_version() comparison typo Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:17:48 +11:00
Dan Williams	72e4a37822	Grow: fix check for raid6 layout normalization If the user does not specify a layout, don't skip asking about retaining the non-standard raid6 layout which may be implicitly changed. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:11:37 +11:00
Dan Williams	dcc4210f58	Assemble: fix assembly in the delta_disks > max_degraded case Incremental assembly works on such an array because the kernel sees the disk as in-sync and that the array is reshaping. Teach Assemble() the same assumptions. This is only needed on kernels that do not initialize ->recovery_offset when activating spares for reshape. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:10:01 +11:00
Dan Williams	4411fb1749	Grow: mark some functions static Going through the Grow api found some local routines that could be marked static. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:08:42 +11:00
Dan Williams	9ea5a25217	Manage: allow manual control of external raid0 readonly flag mdadm --readwrite <subarray> will clear the external readonly flag ('-' to '/'), but only for redudant arrays. Allow raid0 arrays as well so the user has a simple helper to control this flag. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:08:19 +11:00
Dan Williams	bc77ed535d	block monitor: freeze spare assignment for external arrays In order to support reshape and atomic removal of spares from containers we need to prevent mdmon from activating spares. In the reshape case we additionally need to freeze sync_action while the reshape transaction is initiated with the kernel and recorded in the metadata. When reshaping a raid0 array we need to freeze the array before it is transitioned to a redundant raid level. Since sync_action does not exist at this point we extend the '-' prefix of a subarray string to flag mdmon not to activate spares. Mdadm needs to be reasonably certain that the version of mdmon in the system honors this 'freeze' indication. If mdmon is not already active then we assume the version that gets started is the same as the mdadm version. Otherwise, we check the version of mdmon as returned by the extended ping_monitor() operation. This is to catch cases where mdadm is upgraded in the filesystem, but mdmon started in the initramfs is from a previous release. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:00:54 +11:00
Dan Williams	e5408a3202	Provide a mdstat_ent to subarray helper ...before introducing another open coded instace of this conversion. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 14:44:23 +11:00
NeilBrown	87477e6d5e	Assemble: get content before testing it. When checking that a container matches the required uuid, we need to call 'getinfo_super' before we have a 'content' to test. Reported-by: "Czarnowska, Anna" <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 11:34:36 +11:00
NeilBrown	062dc4817d	Monitor: check spare group is non-NULL before adding to domain list ... otherwise we crash. Reported-by: "Labun, Marcin" <Marcin.Labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 11:11:45 +11:00
Marcin Labun	2cda7640f9	Policy is aware of metadata disk's controller domains. Platform (metadata) domain let the metadata handlers differentiate disk domains based on controllers that the disk belongs to. Platform domain is sub-domain inside user specified domain in mdadm.conf configuration files inheriting all parameters from it. The metadata domain name is used disk domain matching functions. The disk with the same metadata domain name belong to the same metadata domain. New metadata handler is added that retrieves platform domain string based on disk path: const char (get_disk_controller_domain)(const char *path); Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00
Anna Czarnowska	80e7f8c31a	Monitor: Allow metadata to set minimum size for spare to migrate in. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00
NeilBrown	66f5c4b665	Monitor: teach spare migration about containers When trying to move a spare, move to the container of a degraded array, not to the array itself. And don't try to move from a subarray, only from a native or container array. And don't move from a container which contains degraded subarrays. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00
NeilBrown	e78dda3bf5	Monitor: policy based spare migration. Rather than only migrating between arrays with the same spare_group, we now migrate based on domains set in the policy. In order for spare_group to continue to work, we treat it as a domain of the destination array, and a domain of any device we might remove from a source array. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00
NeilBrown	2feb22efbc	Monitor: split out check_donor Checking compatibility between arrays for spare migration is going to become a little more complicated, so split it out into a separate function. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00
NeilBrown	6d3d44d98c	Monitor: split out move_spare in spare migration. This is a simple refactoring with no functionality change. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00
NeilBrown	e0bd6a9637	Monior: create struct for holding alert info. Rather than passing mailaddr, mailfrom, cmd, dosyslog around in argument lists, create a structure to hold them all. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00
NeilBrown	9bfc6a7d1a	Monitor: use calloc rather than malloc calloc zeros the memory allocated, which is safer, particularly as we add more things to struct state. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00

... 4 5 6 7 8 ...

1717 Commits