mdadm

Author	SHA1	Message	Date
Adam Kwolek	70bdf0dcc3	imsm: move common code for array size calculation to function Array size calculation is made in the same way in few places in code. Make function imsm_set_device_size() for this common code. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-03 17:57:25 +11:00
Adam Kwolek	ed7333bd68	imsm: FIX: Debug strings cleanup Some debug strings remains as they were introduced, before code was moved to separate function. Information displayed by debug information in not all cases was correct. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-03 17:57:10 +11:00
Adam Kwolek	d55adef98e	imsm: fix: imsm_num_data_members() can return error imsm_num_data_members() can indicate error by returning 0 value In such case size cannot be set based on 0 value. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-03 17:47:52 +11:00
Adam Kwolek	e154ced310	imsm: FIX: put expansion finalization in to one place When a->last_checkpoint variable can reach array end, reshape finalization can be put in to single place. There is no need to reset migration variables. imsm_set_disk() will call end_migration() and this sets all migration variables to required values. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-03 17:46:17 +11:00
Adam Kwolek	5e7b033066	imsm: FIX: crash during getting map When get_imsm_map() is called with second_map parameter == '-1' and array is not in migration state NULL pointer is returned. This is wrong. '-1' means return map as migration record points. '-1' can be passed to get_imsm_map() from imsm_num_data_members(). imsm_num_data_members() is called to get current map members based on migr_state information Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-03 17:02:39 +11:00
NeilBrown	71204a5029	Various compile fixes. Make "make everything" succeed. This fixed some real bugs. Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 15:48:03 +11:00
Adam Kwolek	1dfaa38015	imsm: FIX: map coping causes mdmon crash Too big map was copied (outside allocated memory) and this causes mdmon crash for 2 raid0 arrays in container. Map of correct (smaller) size should be copied, to not overwrite any internal data. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:40:56 +11:00
Adam Kwolek	401d313b7f	imsm: FIX: mdmon crash during 2 raid0 arrays expansion When expansion is run on 2 raid0 arrays in container no update is sent to mdmon because mdmon is off (mdadm performs update) Memory size for first reshaped array is allocated to satisfy memory requirements for expanded maps. Memory for second device is allocated using old disks number, as in metadata there is no information about this array reshape. When mdmon initiates second array reshape it overwrites internal structures and crashes). There is no place to keep expanded maps. To avoid this situation during loading metadata, allocated memory should be performed using the maximum used disks number in particular container. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:31:27 +11:00
Adam Kwolek	820eb8dba7	imsm: Update metadata for second array When second array reshape is about to start external metadata should be updated by mdmon in imsm_set_array_state(). For this purposes imsm_progress_container_reshape() is reused. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:31:06 +11:00
Adam Kwolek	d098291aec	imsm:FIX: change arrays reshape order Reshape is started from second array, so it causes imsm incompatibility and problems during second array start. Reshape should be started in arrays metadata order. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-02-01 10:17:06 +11:00
Adam Kwolek	cb82edca14	imsm: FIX: not all disks are released in free_imsm_disks() Adding spare disks to imsm container fails due to problem with writing new_dev to sysfs. This problem was caused by not closed handle (opened exclusively) in Manage.c:803. Disk handle was not closed by free_imsm(). This is due to not released disk_mgmt_list in free_imsm_disks(). Proper release of imsm metadata allows for spare adding without problems. Memory leak was fixed also. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 11:06:42 +11:00
Adam Kwolek	d7d205bd25	imsm: FIX: do not allow for container operation for the same disks number imsm_reshape_super() currently allows for expansion when requested raid_disks number is the same as current. This is wrong. Existing in code condition is too weak. We should allow for expansion when new disks_number is greater than current only. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 10:25:26 +10:00
Krzysztof Wojcik	dfe77a9ed2	Add raid1->raid0 takeover support Add support for raid1 to raid0 takeover operation in user space. This patch includes support for native and imsm metadata. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:47:15 +10:00
Labun Marcin	96234762a6	imsm: support for Intel SAS controller in get_disk_controller_domain handler get_disk_controller_domain recognizes Intel (R) SAS controller (isci). The function returns three different strings that differentiate disk attached to AHCI, ISCI or unknown controller types to create separate domains for each case. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:22:34 +10:00
Labun Marcin	155cbb4c2c	imsm: detail_platform_imsm supports Intel SAS controller (isci driver) Added support in detail_platform_imsm for Intel (R) SAS controller. Function supports AHCI and ISCI controllers. RAID properties are derived from common OROM for both types. Based on code From: Artur Wojcik <artur.wojcik@intel.com> Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:22:07 +10:00
Labun Marcin	120dc88745	imsm: prepare detail_platform_imsm to support different types of controllers Pull out the AHCI specific parts of detail_platform_imsm to separate functions. Introduce support new types of controllers. Based on code From: Artur Wojcik <artur.wojcik@intel.com> Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:21:29 +10:00
Labun Marcin	88654014d4	imsm: support for Intel(R) SAS controller in imsm handler add_to_super_imsm handler is able to recognize new type of controller. It stores the controller information in its structures and blocks mixing of different controller type in the same container. In this way it maintains compatibility between Linux and Windows IMSM RAID stacks. IMSM metadata does not allow arrays to span on devices attached to different storage controllers. Based on code From: Artur Wojcik <artur.wojcik@intel.com> Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:09:57 +10:00
Krzysztof Wojcik	43d5ec1844	Check number of failed disks durig raid10->raid0 takeover Number of failed disks MUST be half of initial number of disks. If number of failed disks is different we should not update metadata- data corruption may occur after array reassemlation. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 10:44:59 +10:00
Krzysztof Wojcik	8ca6df95a2	raid0->raid10 takeover- process metadata update Implementation of raid0->raid10 takeover metadata update at process_update level. - We are using memory previously allocated in prepare_update to create two dummy disks will be inserted in the metadata and new imsm_dev structure with expanded disk order table. - Update indexes in disk list - Update metadata map - Update disk order table Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 10:41:27 +10:00
Krzysztof Wojcik	abedf5fc46	raid0->raid10 takeover- allocate memory for added disks Allocate memory will be used in process_update. For raid0->raid10 takeover operation number of disks doubles so we should allocate memory for additional disks and one imsm_dev structure with extended order table. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 10:38:10 +10:00
Krzysztof Wojcik	0529c688e8	raid0->raid10 takeover- create metadata update Create metadata update for raid0 -> raid10 takeover. Because we have no mdmon running for raid0 we have to update metadata using local update mechanism Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 10:38:00 +10:00
Krzysztof Wojcik	bb025c2f22	Add raid10 -> raid0 takeover support The patch introduces takeover from level 10 to level 0 for imsm metadata. This patch contains procedures connected with preparing and applying metadata update during 10 -> 0 takeover. When performing takeover 10->0 mdmon should update the external metadata (due to disk slot and level changes). To achieve that mdadm calls reshape_super() and prepare the "update_takeover" metadata update type. Prepared update is processed by mdmon in process_update(). Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 08:50:37 +10:00
NeilBrown	1cc7f4feb9	Don't close fds in write_init_super We previously closed all 'fds' associated with an array in write_init_super .. sometimes, and sometimes at bad times. This isn't neat and free_super is a better place to close them. So make sure free_super always closes the fds that the metadata manager kept hold of, and stop closing them in write_init_super. Also add a few more calls to free_super to make sure they really do get closed. Reported-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-25 07:56:53 +11:00
Krzysztof Wojcik	471bceb681	Define imsm_analyze_change function Function intended to use for single volume migration. Function analyze transition and validate if it is supported. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 12:52:36 +11:00
Krzysztof Wojcik	694575e786	reshape_super reorganization Function has been divided into two clear parts: 1. Container operations 2. Volume operations Prototype of imsm_analyze_change function has been added. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-13 13:02:44 +11:00
Adam Kwolek	04c3c51413	imsm: FIX: spares are not counted Field info->array.spare_disks is used on begin of reshape_array() to check if there is enough number of spares to process reshape. During container_content_imsm() call spare disks are not counted. This causes that reshape_array() reports that there is not enough spares to execute reshape. Patch adds spares counting for reshape process. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-13 10:16:07 +11:00
Adam Kwolek	819bc6345e	imsm: FIX: old devices memory has to be released When process_update() replaces memory for bigger devices, old memory areas are collected in a list and has to be assigned in to pointer in update for later release. List created from old devices is created and attached to space_list for later releasing. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-13 10:06:29 +11:00
Adam Kwolek	f557efbb3f	imsm: FIX: local mdadm update shouldn't be done in update creation function. Local update is performed based on created update, so this code can broke local update and it is not necessary as prepare and process update functions are used. Code removed. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-13 10:05:31 +11:00
Adam Kwolek	8dd70bce5b	imsm: FIX: mdadm should process local data When update is created by mdadm, local information should be updated also. This makes us to prepare one update for mdmon and second "update" to maintain local changes. we can use prepared update for "local/mdadm" metadata update purposes. We have 2 cases: 1. when metadata is updated by mdmon, we avoid metadata reloading in mdadm. we proceed the same updtate 2 times: - one time in mdadm for "local update" - second time in mdmon for real metadat update 2. when metadata is updated by mdadm (no mdmon running) updates are processed in the same way. - one time in mdadm for "local update" - there is no "second time" update but mdadm just flushes metadata to array This let us to avoid code duplication by using prepare and process update functions as for update via mdmon. This makes update preparing mdmon independent and there is no need to maintain the same thing in 2 places in code. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-13 10:04:33 +11:00
Adam Kwolek	bbd24d8616	imsm: FIX: only one spare is passed in update Only one spare is passed in update. When more than one disk is added first spare is passed multiple times. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-12 16:46:38 +11:00
Adam Kwolek	ee4beede22	imsm: FIX: set correct slot information in metadata (raid0) Slot was set based on anchor information. Disks information was copied outside disk list area. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-12 16:40:42 +11:00
NeilBrown	4a011f1009	load_super should not try to load_container Now that load_container is a separate operation, load_super should not try it first. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-12 16:18:04 +11:00
Adam Kwolek	86e3692b06	imsm: FIX: update disks status in container_contents() Based on status information disks are added to array during grow (in reshape_array()). This information currently is not present and all disks (old and new) were added to md. To avoid adding already present disks, disk.state has to be set. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-12 15:12:44 +11:00
NeilBrown	999b497251	Make child_monitor a candidate for ->manage_reshape Child_monitor was design to perform 'manage_reshape' for native arrays. So change the signature for ->manage_reshape to match child_monitor and move the all to the same place that child_monitor is called from. Also give super-intel a manage_reshape handler which simple calls child_monitor. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-12 14:46:17 +11:00
Adam Kwolek	89c6788213	imsm: FIX: do not repair raid4 arrays As raid4 is not supported by imsm (this is takeovered raid0) do not fix degraded raid4 arrays. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 19:20:25 +11:00
Adam Kwolek	ed08d51c1a	imsm: Update raid0 metadata for reshape When raid0 reshape is performed metadata has to be applied by mdadm. (without mdmon) Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 19:14:24 +11:00
NeilBrown	2e5dc01050	imsm: Move reshape update processing to function For code reuse in raid0 reshape case when monitor is not loaded. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 19:10:04 +11:00
Adam Kwolek	0e2d1a4e68	imsm: Update metadata for second array When second array reshape is about to start metadata should be updated by mdmon in imsm_set_array_state(). Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 18:29:23 +11:00
Adam Kwolek	ef83fa1a7c	imsm: update array size information in metadata When disks are added size has to increase in metadata. This size should be used by common code to set size in md when reshape will be finished. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 18:29:15 +11:00
Adam Kwolek	6345120e38	imsm: FIX: Division by 0 For general migration function blocks_per_migr_unit() has to return valid value. If there is no valid return, 0 is returned instead and causes division by 0 error. Additionally guard in function was added for such case. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 18:29:11 +11:00
Adam Kwolek	a4546b6189	imsm: Finalize reshape in metadata When reshape is finished monitor calls set_array_state() and finishes migration in metadata. This change allows for finishing metadata migration on reshape end. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 18:29:07 +11:00
Adam Kwolek	b335e59305	imsm: FIX: support general migration by getinfo_super_imsm_volume Add support for reading volume information during migration process. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 18:28:56 +11:00
Adam Kwolek	834acc0bd2	imsm: FIX: update first array in container only During first metadata update imsm for compatibility reason should update only one array. Buffers in prepare_update() are prepared for second update as well. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 18:26:59 +11:00
Adam Kwolek	d677e0b8ec	imsm: FIX: Perform first metadata update for container operation Meta data was not updated due to the following problems: 1.disk index < 0 was treated as invalid, but this is spare device 2. disk index greater than currently used disks is correct also 3. newmap pointer has to be refreshed for second map copy operation 4. size calculation has to be guarded for shrinking operation Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 16:24:07 +11:00
Adam Kwolek	690aae1ae5	imsm: FIX: display error message When container operation is not allowed user has to get proper information on console about it Currently this information was displayed as debug info only. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 16:22:01 +11:00
Adam Kwolek	dd8bcb3b69	imsm: FIX: display correct information for '-E' option Correct information displayed by '-E' option. 1. FIX: Slot information during raid0 migration is displayed incorrectly (missing disk position is taken from wrong map) 2. Improvement: information about (level, members, chunk size) migration is displayed. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 16:11:35 +11:00
Adam Kwolek	98130f4013	mdadm: second_map enhancement for imsm_get_map() Allow map related operations for the given map: first of second. For reshape specific functionality it is required to have an access Until now, the active map was chosen according to the current volume status. Signed-off-by: Maciej Trela <maciej.trela@intel.com> Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-06 16:08:04 +11:00
Anna Czarnowska	326727d9c9	Use one function chosing spares from container container_chose_spares in Monitor.c and get_spares_for_grow in super-intel.c do the same thing: search for spares in a container. Another version will also be needed for Incremental so a more general solution is presented here and applied in two previous contexts. Normally domlist==NULL would lead an empty list but this is typically checked earlier so here it is interpreted as "do not test domains". Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-05 14:34:14 +11:00
Anna Czarnowska	22e263f64a	imsm: set imsm spare uuid to 0 uuid_match_any is replaced by uuid_zero for imsm spares. Function fixup_container_spare_uuid not needed as it gives unwanted uuid to spares. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-26 21:59:31 +11:00
Krzysztof Wojcik	a06d022db4	FIX: Bad block verification during assembling array We need to refuse to assemble an arrays with bad blocks. Initially there was condition in container_content function that returns error value in the case when metadata store information about bad blocks. When the container_content function is called from functions NOT connected with assemble (Kill_subarray, Detail) we get faulty error return value. Patch introduces new flag in array.status - MD_SB_BBM_ERRORS. It is set in container_content when bad blocks are detected and can be checked by container_content caller. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-26 21:41:57 +11:00
Adam Kwolek	81ac8b4d56	imsm: Fill delta_disks field in getinfo_super() delta_disks field is not always filled during getinfo_super() call. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-16 15:55:40 +11:00
Adam Kwolek	4c9bc37b97	imsm: Do not indicate resync during reshape If reshape is started resync is not allowed in parallel. This would break reshape. If array is in General Migration state do not indicate resync and allow for reshape continuation. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-16 15:48:27 +11:00
NeilBrown	aad6f216a1	Handle checkpointing during reshape We need to allow metadata to handle progress of reshape, completion, and abort-before-start. Include all those in ->set_array_state() Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-16 15:48:05 +11:00
Adam Kwolek	1af97990a6	imsm: Block array state change during reshape Array state change is blocked due to reshape action in progress metadata changes are during applying. '1' is returned to indicate that array is clean Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-16 13:17:47 +11:00
Adam Kwolek	d195167d9c	imsm: Process reshape_update in mdmon For this update prepare_update() allocates memory to relink imsm (bigger) device imsm structures. It calculates new /bigger/ anchor size. Process update applies update in to imsm structures. This includes - converting selected spares into configured devices - marking the arrays as migrating - making a new 'map' for each array with the changed details. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-16 13:17:45 +11:00
NeilBrown	cb23f1f4c3	Allow a metadata update to have a linked list of allocated spaces. Sometimes one metadata update will require allocating several larger data structures. As 'monitor' cannot allocate, 'manager' must, so it must be able to attach a list of allocates to the update, and importantly it must be able to easily free them. So add a 'space_list' element to metadata updates where each element on the list starts with a pointer to the next. Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-16 12:10:01 +11:00
NeilBrown	78b10e663c	imsm: Prepare reshape_update in mdadm During Online Capacity Expansion metadata has to be updated to show array changes and allow for future assembly of array. To do this mdadm prepares and sends reshape_update metadata update to mdmon. The update contains the old and new number of raid disks, and the indices of the spare disks that will be used to fill the spaces. This works as follows: 1. reshape_super() prepares metadata update. 2. mdadm discovers the spares and adds them to the array 3. mdadm sends the update to mdmon 4. managemon in prepare_update() allocates required memory for bigger device object 5. monitor in process_update() updates the metadata to record the new sizes and the newly assigned devices. 6. mdadm initiates the reshape Based on code From: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-16 11:45:21 +11:00
NeilBrown	94827db3b3	imsm: add spares to --examine output. When we examine a container, list the spare devices as well as the active devices. Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-16 11:33:23 +11:00
Adam Kwolek	6c93202898	imsm: FIX: imsm_add_spare() wrongly tests spares list For more than one disk tested additional_test_list was searched from last point, not from begin. This bug causes that more than 2 disks cannot be added to imsm array, when imsm_add_spare() is used for this. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-16 09:03:03 +11:00
Labun, Marcin	95d07a2cdd	IMSM: do not rebuild the array if a non-redundant sub-array with failed disks is present Before looking for a spare to rebuild a degraded array, check if there are any failed disks in container. Block rebuild if another sub-array is failed until failed disks are removed from container. Currently, Intel metadata handler rebuilds all sub-arrays even if one of them is non-redundant. In case of failed sub-array, failed disks are just replaced with new ones in the metadata mapping. The data for failed disk is not restored even the disk is present in the system. With this fix, we require the removal of the failed disk from container to start the process of rebuilding the array with failed member. If the disk is physically pulled out of the system, the disk is removed from container automatically by exiting udev rules. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-15 15:51:53 +11:00
Labun, Marcin	1a64be565b	IMSM: Fix problem in mdmon monitor of using removed disk in imsm container. Manager thread shall pass the information to monitor thread (mdmon) that some devices are removed from container. Otherwise, monitor (mdmon) might use such devices (spares) to rebuild the array that has gone degraded. This problem happens for imsm containers, since a list of the container disks is maintained in intel_super structure. When array goes degraded, the list is searched to find a spare disks to start rebuild. Without this fix the rebuild could be stared on the spare device that was a member of the container, but has been removed from it. New super type function handler has been introduced to prepare metadata format specific information about removed devices. int (remove_from_super)(struct supertype st, mdu_disk_info_t *dinfo) The message prepared in remove_from_super is later processed by process_update handler in monitor thread. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-15 15:51:51 +11:00
NeilBrown	1d54f2867b	Merge branch 'master' into devel-3.2 Conflicts: super-intel.c	2010-12-13 14:00:05 +11:00
Luca Berra	a2973b6af2	segfault in imsm create with wrong arguments When calling mdadm -C --metadata=imsm -l 1 /dev/sd.. mdadm segfaults in default_chunk_imsm() above syntax is incorrect, but mdadm should error instead of segfaulting Signed-off-by: Luca Berra <bluca@comedia.it> Signed-off-by: NeilBrown <neilb@suse.de>	2010-12-13 13:51:07 +11:00
Adam Kwolek	8ba77d3281	imsm: Allow multiple spares to be collected. Assumption for spares searching was that after picking new device, it has to be added to array before next search. This causes returning different disk on each call. When creating a spare list during Online Capacity Expansion, we will first collect the devices list and then all devices are added to md. Picked device from spares pool has to be checked against picked devices so far. If not, the same disk will be returned all the time. Already picked devices are stored in the list and this list is used for new devices verification also. So add an extra arg to imsm_add_spare to hold a list of known spares to ignore. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-30 13:14:24 +11:00
Adam Kwolek	36988a3dda	imsm: FIX: core dump during imsm metadata writing Wrong number of disks during metadata update causes core dump. New disks number based on internal mdmon information has to used for calculation (not previously read from metadata). Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 12:53:16 +11:00
Adam Kwolek	28bce06f17	imsm: Add support for general migration Internal IMSM procedures need to support the General Migration. It is used during operations like: - Online Capacity Expansion, - migration initialization, - finishing migration, - apply changes to raid disks etc. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-29 12:28:01 +11:00
Dan Williams	30f58b2208	Create: cleanup/unify default geometry handling Support metadata specific level, layout and chunksize defaults. Kill an uneeded superswitch methods ahead of adding more for the reshape case. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-23 15:20:50 +11:00
Marcin Labun	2cda7640f9	Policy is aware of metadata disk's controller domains. Platform (metadata) domain let the metadata handlers differentiate disk domains based on controllers that the disk belongs to. Platform domain is sub-domain inside user specified domain in mdadm.conf configuration files inheriting all parameters from it. The metadata domain name is used disk domain matching functions. The disk with the same metadata domain name belong to the same metadata domain. New metadata handler is added that retrieves platform domain string based on disk path: const char (get_disk_controller_domain)(const char *path); Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00
Anna Czarnowska	80e7f8c31a	Monitor: Allow metadata to set minimum size for spare to migrate in. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00
Anna Czarnowska	5c4cd5da70	imsm: create mdinfo list of disks in a container from supertype If getinfo_super is called on a container supertype we only get information on first disk. As a parameter it uses reference to preallocated mdinfo structure. Amending getinfo_super to return full list of disks would require ammending all previous calls and subsequently freeing memory allocated for mdinfo list. Function container_content that returns a mdinfo list is written specifically for assembly, performing actions not needed to just fill mdinfo. It also does not include spares so is unsuitable. As an alternative a new function getinfo_super_disks is created to obtain information about all disks states in array. Existing function sysfs_free is used to free memory allocated by getinfo_super_disks. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:07 +11:00
NeilBrown	157e6e24b9	Remove loaded_container This field is now only set, never used. So remove it. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
NeilBrown	ab3cb6b3b7	imsm: always calculate container_enough in getinfo_super_imsm We are about to lose the loaded_container field, and we don't really need to use it to protect the calculation of container_enough. So drop the test. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:06 +11:00
NeilBrown	fa56eddbd1	Improve mddev_ident type definitions. Remove the _t typedef and remove the _s suffix from the struct name. These things do not help readability. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:58:05 +11:00
NeilBrown	2b959fbf66	New method: load_container This handles the 'container' part of 'load_super', so we can soon make them completely separate - it is just confusing to overload these two. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:24:50 +11:00
NeilBrown	e1902a7b6c	Remove keep_fd arg from load_super_XXX_all It is always set to 1, so we don't need it. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:24:50 +11:00
NeilBrown	69b2fcc5bb	Remove subarray field in supertype. This is now only ever set, never used. So remove it. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:24:50 +11:00
NeilBrown	d1d599ea0d	Create: user container_dev rather than subarray for some tests. It makes more sense to test for container_dev than for subarray for several places in Create where it then uses container_dev. This allows us to subsequently remove subarray. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:24:50 +11:00
NeilBrown	e32bd33f44	Remove subarray detection from load_super. Nothing relies on this any more, so remove it. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:24:50 +11:00
NeilBrown	a951a4f78f	Pass subarray arg explicitly to ->update_subarray. This is better than hiding it in the supertype structure where we are never quite sure who needs it. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 20:24:50 +11:00
NeilBrown	00bbdbdac6	Add subarray arg to container_content. This allows the info for a single array to be extracted, so we don't have to write it into st->subarray. For consistency, implement container_content for super0 and super1, to just return the mdinfo for the single array. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 19:35:26 +11:00
NeilBrown	a5d85af748	get_info_super: report which other devices are thought to be working/failed. To accurately detect when an array has been split and is now being recombined, we need to track which other devices each thinks is working. We should never include a device in an array if it thinks that the primary device has failed. This patch just allows get_info_super to return a list of devices and whether they are thought to be working or not. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 19:35:25 +11:00
NeilBrown	1e2b276535	Report error in --update string is not recognised. If an --update is requested by the relevant metadata doesn't understand it, print a useful message rather than silently ignoring the issue. Signed-off-by: NeilBrown <neilb@suse.de>	2010-11-22 19:35:24 +11:00
NeilBrown	64436f0628	intel: Don't try to read from tiny devices. If a device is less than 1K, avoid even trying to seek to 1K before the end. The seek will fail anyway so this is a fairly cosmetic fix. Signed-off-by: NeilBrown <neilb@suse.de>	2010-09-06 11:26:28 +10:00
NeilBrown	cdbe98cd54	Fix compiler warning concering bad use of snprintf. Signed-off-by: NeilBrown <neilb@suse.de> Reported-by: Mikael Abrahamsson <swmike@swm.pp.se>	2010-08-06 20:10:48 +10:00
NeilBrown	f21e18ca89	Compile with -Wextra by default This produced lots of warning, some of which pointed to actual bugs. Signed-off-by: NeilBrown <neilb@suse.de>	2010-08-05 13:13:02 +10:00
Dan Williams	569cc43ffb	imsm: fix a -O2 build warning super-intel.c: In function ‘imsm_add_spare’: super-intel.c:4833: error: ‘array_start’ may be used uninitialized in this function super-intel.c:4834: error: ‘array_end’ may be used uninitialized in this function This is valid, if we don't find a spare candidate then array_{start,end} will be uninitialized. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-07-06 12:48:59 -07:00
Dan Williams	d19e3cfb66	Merge branch 'fixes' into for-neil	2010-07-01 17:36:11 -07:00
Dan Williams	8cfc801c72	Merge branch 'subarray' into for-neil Conflicts: mdadm.h super-intel.c	2010-07-01 17:36:05 -07:00
Dan Williams	aa534678ba	Rename subarray v2 Allow the name of the array stored in the metadata to be updated. In some cases the metadata format may not be able to support this rename without modifying the UUID. In these cases the request will be blocked. Otherwise we allow the rename to take place, even for active arrays. This assumes that the user understands the difference between the kernel node name, the device node symlink name, and the metadata specific name. Anticipating further need to modify subarrays in-place, introduce the ->update_subarray() superswitch method. A future potential use case is setting storage pool (spare-group) identifiers. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-22 16:30:59 -07:00
Dan Williams	b526e52dc7	Always assume SKIP_GONE_DEVS behaviour and kill the flag ...i.e. GET_DEVS == (GET_DEVS\|SKIP_GONE_DEVS) A null pointer dereference in Incremental.c can be triggered by replugging a disk while the old name is in use. When mdadm -I is called on the new disk we fail the call to sysfs_read(). I audited all the locations that use GET_DEVS and it appears they can tolerate missing a drive. So just make SKIP_GONE_DEVS the default behaviour. Also fix up remaining unchecked usages of the sysfs_read() return value. Reported-by: Dave Jiang <dave.jiang@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-16 17:26:04 -07:00
Dan Williams	4f0a7acc9a	mdmon: record sync_completed directly to the metadata When sync_action is idle mdmon takes the latest value of md/resync_start or md/<dev>/recovery_start to record the resync/rebuild checkpoint in the metadata. However, now that mdmon is reading sync_completed there is no longer a need to wait for, or force an idle event to take a checkpoint. Simply update the forward progress of ->last_checkpoint at every wakeup event and force it to be recorded at least every 1/16th array-size interval. It may be recorded more frequently if a ->set_array_state() event occurs. This also cleans up some confusion in handling the dual-rebuild case. If more than one spare has been activated the kernel starts the rebuild at the lowest recovery offset, so we do not need to worry about min_recovery_start(). Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-15 18:41:57 -07:00
Dan Williams	0d80bb2f97	imsm: dump each disk's view of the slot state Allow --examine to determine which disk might have a stale view of the per-disk out-of-sync state. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-15 18:41:57 -07:00
Dave Jiang	0bd16cf217	create: Check with OROM limit before setting default chunk size Make create check with the appropriate meta data handler and see what the largest chunk size is supported. The current 512K default is not supported by existing imsm OROM. [dan.j.williams@intel.com: trim the upper limit to 512k for future oroms] Signed-off-by: Dave Jiang <dave.jiang@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-15 18:41:53 -07:00
Dan Williams	33414a0182	Kill subarray v2 Support for deleting a subarray out of a container. When all subarrays are deleted the component devices are converted back into spares, a --zero-superblock is still needed to kill the remaining metadata at this point. This operation is blocked when the subarray is active and may also be blocked by the metadata handler when deleting the subarray might change the uuid of other active subarrays. For example, with imsm, deleting subarray 'n' may change the uuid of subarrays with indexes > n. Deleting a subarray needs to be a container wide event to ensure disks that record the modified subarray list perceive other disks that did not receive this change as out of date. Notes: The st->subarray parsing in super-intel.c and super-ddf.c is updated to be more strict now that we are reading user supplied subarray values. Offline container modification shares actions that mdmon typically handles so promote is_container_member() and version_to_superswitch() (formerly find_metadata_methods()) to generic utility functions for the cases where mdadm performs the operation. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-06-15 17:55:41 -07:00
NeilBrown	d492df0307	Merge commit '3288b419b988b20a53a2b12eb8e5f9f536228db4'; commit '4363fd80bcc9f85ed824228dee5e6350a8d73e18'; commit '63b4aae33ebf00d443378daf313622630f2336c0' * commit '3288b419b988b20a53a2b12eb8e5f9f536228db4': Revert "Incremental: honor --no-degraded to delay assembly" Incremental: honor an 'enough' flag from external handlers * commit '4363fd80bcc9f85ed824228dee5e6350a8d73e18': imsm: robustify recovery-start detection fix: memory leak in mdmon_pid() * commit '63b4aae33ebf00d443378daf313622630f2336c0': mdmon: fix missing open of md/<dev>/recovery_start	2010-05-31 11:34:14 +10:00
Dan Williams	4363fd80bc	imsm: robustify recovery-start detection update_recovery_start() assumed that the out-of-sync disk would always be marked as IMSM_ORD_REBUILD in the disk_ord_tbl, but the segmentation fault reported by Andy proves otherwise. This might also be explained by an interrupted rebuild and the disk has not yet been marked missing. https://bugzilla.redhat.com/show_bug.cgi?id=592030 Reported-by: Andy Lutomirski <luto@mit.edu> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-05-26 13:33:43 -07:00
Dan Williams	97b4d0e971	Incremental: honor an 'enough' flag from external handlers This is needed for imsm where: 1/ we want to report raid_disks as zero to allow mdadm -As to incorporate all spares 2/ we can't determine stale disks by looking at the event counts. 3/ we can't see per-subarray expectations with the info returned from the container level ->getinfo_super() Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-05-26 13:22:36 -07:00
Dan Williams	484240d8a3	mdmon: periodically checkpoint recovery The kernel updates and notifies md/sync_completed when it is time to take a checkpoint. When this occurs (at 1/16 array size intervals) write 'idle' to md/sync_action to have the current recovery position updated in recovery_start and resync_start. Requires the metadata handler to reset ->last_checkpoint when it has determined that recovery has ended. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-05-14 17:42:49 -07:00
NeilBrown	691c6ee1b6	IMSM/DDF: don't recognised these metadata on partitions. These metadata are not expected on partitions, and they have no way of differentiation whether which is correct if they are found both on the device and on the last partition. So if the device is a partition, refuse to read the metadata. Signed-off-by: NeilBrown <neilb@suse.de>	2010-04-29 16:09:59 +10:00
Dan Williams	4eb269706f	Create: cleanup after failed create in duplicated array member case mdadm prevents creation when device names are duplicated on the command line, but leaves the partially created array intact. Detect this case in the error code from add_to_super() and cleanup the partially created array. The imsm handler is updated to report this conflict in add_to_super_imsm_volume(). Note that since neither mdmon, nor userspace for that matter, ever saw an active array we only need to perform a subset of the cleanup actions. So call ioctl(STOP_ARRAY) directly and arrange for Create() to cleanup the map file rather than calling Manage_runstop(). Reported-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2010-04-19 15:28:07 +10:00

1 2 3 4 5 ...

385 Commits