mdadm

Commit Graph

Author	SHA1	Message	Date
NeilBrown	e809000535	super1: fix regression in write_init_super. Now that a 'supertype' container more information, the simplistic copying of 'st' into 'refst' is incorrect and results in closing some fds when load_super1(refst) calls free_super(). So do it more correctly using dup_super. Reported-by: "Labun, Marcin" <Marcin.Labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 11:33:18 +11:00
Adam Kwolek	cb82edca14	imsm: FIX: not all disks are released in free_imsm_disks() Adding spare disks to imsm container fails due to problem with writing new_dev to sysfs. This problem was caused by not closed handle (opened exclusively) in Manage.c:803. Disk handle was not closed by free_imsm(). This is due to not released disk_mgmt_list in free_imsm_disks(). Proper release of imsm metadata allows for spare adding without problems. Memory leak was fixed also. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-31 11:06:42 +11:00
Czarnowska, Anna	a1e49d6956	Monitor: avoid adding too many spares to container Tests revealed that sometimes there are still more spares taken than needed. The reason for this is that after adding one spare to container with degraded subarray if between ioctl in main loop and load_container in try_spare_migration mdmon activates the spare we see active<raid but find no spares in parent container and so add an extra spare. To prevent such behaviour we count active disks in the list returned by getinfo_super_disks and compare it with subarray->active. If the number has increased it means new spare was added and activated so there is no need for more. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 11:18:57 +10:00
Krzysztof Wojcik	24aebf3add	FIX: Meet SET_ARRAY_INFO ioctl requirements Problem has been observed when raid10<->raid0 takeover operation is executed. In code updating layout, raid_disks and chunk_size for non-restriping operations in reshape array functions SET_ARRAY_INFO ioctl call was not succeeded. Takeover process finish execution with error, mdadm shows message: "mdadm: failed to set disks" Cause is not meeting SET_ARRAY_INFO ioctl requirements: - only one parameter may be changed at one time - level of current array info and new info should be the same Patch introduces solution for this issue. At the beginning of discussed code we read current information about array and then compare them with new values should be set. If particular value is different (and should be set), we are overwrite only this one in array info and then call ioctl. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 11:03:12 +10:00
Krzysztof Wojcik	d5ca4a23fc	FIX: Remove disks in mdmon for external metadata For raid10 -> raid0 takeover operation we should reject disks in mirror by marking them as 'failed' and then remove them from array by writing "remove" to disk state. For external metadata second action is executed by mdmon. According the description in monitor.c:175 when monitor detect "faulty" in device state, it blocks the device, mark it as failed in metadata, unblocks the device and finally writes "remove" to device state. For external case writing "remove" to device state in mdadm is not necessary and harmful. It may cause following issues: 1. "remove" operation for external case in mdadm is not finish with successful result because monitor may block the device or disk has been already removed by monitor. 2. If disk is removed by mdadm earlier than mdmon catch "failed" state, metadata is not properly updated- is not marked as failed. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 11:03:11 +10:00
Adam Kwolek	10d0d365eb	WORKAROUND: mdadm hangs during reshape (PART #2 ) After loop can occurs that due to 0 value reported by kernel we have 0 in completed variable. This is wrong. we are interested in real completed point. 0 value means that we reached sync point set in md, so we can set completed variable to just reached point. this point value is stored in max_progress variable. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 11:03:11 +10:00
Adam Kwolek	fab32c9702	FIX: start_reshape status should be checked mdadm should verify if reshape is started before it goes in to check-pointing machine. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 10:40:42 +10:00
Adam Kwolek	a9c3e78fdd	FIX: Array after takeover has to be frozen Problem occurs when we want to expand single disk raid0 array. This is done via degraded 2 disks raid4 array. When new spare is added to array, md immediately initiates recovery before mdadm can configure and start reshape. This is due fact that 2 disk raid4/5 array is special md case. Mdmon does nothing here because container is blocked. Put array in to frozen state allows mdadm to finish configuration before reshape is executed in md. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 10:26:15 +10:00
Adam Kwolek	d7d205bd25	imsm: FIX: do not allow for container operation for the same disks number imsm_reshape_super() currently allows for expansion when requested raid_disks number is the same as current. This is wrong. Existing in code condition is too weak. We should allow for expansion when new disks_number is greater than current only. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-28 10:25:26 +10:00
Dan Williams	aa4cab513d	fix extended partition detection # mdadm --detail --export /dev/md127p1 Before: MD_LEVEL=raid5 MD_DEVICES=4 MD_METADATA=0.90 After: MD_LEVEL=raid5 MD_DEVICES=4 MD_CONTAINER=/dev/md0 MD_MEMBER=0 MD_UUID=55746a20:925d24a7:4f9bd7e2:9c9a411f We parse the symlink target with a format: ../../block/mdXXX/mdXXXpYY ...and need the second '/' from the end of the string to read detect a 'md' device. Reported-by: Krzysztof Wasilewski <krzysztof.wasilewski@intel.com> Cc: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:56:51 +10:00
Labun, Marcin	20b60dcd6c	Dynamic hot-plug udev rules for policies Neil, Please consider this patch that once was discussed and I think agreed with in general direction. It was sent a while ago but somehow did not merged into your devel3-2. This patch enables hot-plug of so called bare devices (as understand by domain policies rules in mdadm.conf). Without this patch we do NOT serve hot-plug of bare devices at all. Thanks, Marcin Labun Subject was: FW: Autorebuild, new dynamic udev rules for hot-plugs >>From c0aecd4dd96691e8bfa6f2dc187261ec8bb2c5a2 Mon Sep 17 00:00:00 2001 From: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Date: Thu, 23 Dec 2010 16:35:01 +0100 Subject: [PATCH] Dynamic hot-plug udev rules for policies Cc: linux-raid@vger.kernel.org, Williams, Dan J <dan.j.williams@intel.com>, Ciechanowski, Ed <ed.ciechanowski@intel.com> When introducing policies, new hot-plug rules were added to support bare disks. Mdadm was started for each hot plugged block device to determine if it could be used as spare or as a replacement member for degraded array. This patch introduces limitation of range of devices that are handled by mdadm. It limits them to the ones specified in domains associated with the actions: spare-same-port, spare and spare-force. In order to enable hot-plug for bare disks one must update udev rules with command mdadm --activate-domains[=filename] Above command writes udev rule configuration to stdout. If 'filename' is given output is written to the file provided as parameter. It is up to system administrator what should be done later. To make such rule permanent (i.e. remain after reboot) rule should be writen to /lib/udev/rules.d directory. Other cases will just need to write it to /dev/.udev/rules.d directory where temporary rules lies. One should be aware of the meaning of names/priorities of the udev rules. After mdadm.conf is changed one is obliged to re-run "mdadm --activate-domains" command in order to bring the system configuration up to date. All hot-plugged disks containing metadata are still handled by existing rules. Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:48:04 +10:00
NeilBrown	d6bd632c41	Ignore/don't set data_disks for level=1 When analyse_change sets level=1, data_disks is meaningless as is layout. So don't set them, and make sure we ignore them. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:47:58 +10:00
Krzysztof Wojcik	c8b06d8239	Mistake in raid1->raid5 migration 1. Mistake in target level comparison. 2. Initialize reshape->after.data_disks field to proper spares_needed calculation Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:47:53 +10:00
Krzysztof Wojcik	dfe77a9ed2	Add raid1->raid0 takeover support Add support for raid1 to raid0 takeover operation in user space. This patch includes support for native and imsm metadata. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 12:47:15 +10:00
Adam Kwolek	26d6e1574a	WORKAROUND: mdadm hangs during reshape During reshape when reshape is finished in md, progress_reshape() hangs on select(). This is because 'sync_completed' is reset to zero before 'sync_action' becomes 'idle', and we don't look for notification on 'sync_action'. So if completed becomes zero after reshape_progress has made some progress, then deduce that reshape has finished. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 07:56:21 +11:00
Adam Kwolek	16d4d84e5d	FIX: monitor doesn't handshake with md when in container are present raid0 and raid5 arrays, and reshape order is: 1. raid0 array 2. raid5 array mdadm cannot set new raid_disks for raid0 array. For this action md has to have handshake with mdmon. We have the following conditions: 1. Raid0 is not monitored 2. raid0 has been just takeovered to raid4/5 (it has to be monitored 3. monitor has to start monitor new raid4/5 array 4. monitor is not started (it is started to second raid5 array) In such situation pig_monitor is required to let know to m monitor about new array (not in the starting monitor case only) Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-27 07:31:25 +11:00
Labun Marcin	96234762a6	imsm: support for Intel SAS controller in get_disk_controller_domain handler get_disk_controller_domain recognizes Intel (R) SAS controller (isci). The function returns three different strings that differentiate disk attached to AHCI, ISCI or unknown controller types to create separate domains for each case. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:22:34 +10:00
Labun Marcin	155cbb4c2c	imsm: detail_platform_imsm supports Intel SAS controller (isci driver) Added support in detail_platform_imsm for Intel (R) SAS controller. Function supports AHCI and ISCI controllers. RAID properties are derived from common OROM for both types. Based on code From: Artur Wojcik <artur.wojcik@intel.com> Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:22:07 +10:00
Labun Marcin	120dc88745	imsm: prepare detail_platform_imsm to support different types of controllers Pull out the AHCI specific parts of detail_platform_imsm to separate functions. Introduce support new types of controllers. Based on code From: Artur Wojcik <artur.wojcik@intel.com> Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:21:29 +10:00
Labun Marcin	88654014d4	imsm: support for Intel(R) SAS controller in imsm handler add_to_super_imsm handler is able to recognize new type of controller. It stores the controller information in its structures and blocks mixing of different controller type in the same container. In this way it maintains compatibility between Linux and Windows IMSM RAID stacks. IMSM metadata does not allow arrays to span on devices attached to different storage controllers. Based on code From: Artur Wojcik <artur.wojcik@intel.com> Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:09:57 +10:00
Labun Marcin	a8e5382a54	imsm platform: support for Intel(R) SAS controller. This patch adds platform support for SAS controller(s) built in Intel(R) Patsburg chipset. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: Artur Wojcik <artur.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 11:06:55 +10:00
Krzysztof Wojcik	10ce18083d	FIX: Reset disk state if disk is missing If we can't read actual disk state, it shoud be initiated to 0. Overwise it may be out of date value resulting false action later in code (e.g. set disk to improper state). Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 10:47:52 +10:00
Krzysztof Wojcik	43d5ec1844	Check number of failed disks durig raid10->raid0 takeover Number of failed disks MUST be half of initial number of disks. If number of failed disks is different we should not update metadata- data corruption may occur after array reassemlation. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 10:44:59 +10:00
Krzysztof Wojcik	8ca6df95a2	raid0->raid10 takeover- process metadata update Implementation of raid0->raid10 takeover metadata update at process_update level. - We are using memory previously allocated in prepare_update to create two dummy disks will be inserted in the metadata and new imsm_dev structure with expanded disk order table. - Update indexes in disk list - Update metadata map - Update disk order table Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 10:41:27 +10:00
Krzysztof Wojcik	abedf5fc46	raid0->raid10 takeover- allocate memory for added disks Allocate memory will be used in process_update. For raid0->raid10 takeover operation number of disks doubles so we should allocate memory for additional disks and one imsm_dev structure with extended order table. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 10:38:10 +10:00
Krzysztof Wojcik	0529c688e8	raid0->raid10 takeover- create metadata update Create metadata update for raid0 -> raid10 takeover. Because we have no mdmon running for raid0 we have to update metadata using local update mechanism Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 10:38:00 +10:00
Krzysztof Wojcik	bb025c2f22	Add raid10 -> raid0 takeover support The patch introduces takeover from level 10 to level 0 for imsm metadata. This patch contains procedures connected with preparing and applying metadata update during 10 -> 0 takeover. When performing takeover 10->0 mdmon should update the external metadata (due to disk slot and level changes). To achieve that mdadm calls reshape_super() and prepare the "update_takeover" metadata update type. Prepared update is processed by mdmon in process_update(). Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 08:50:37 +10:00
NeilBrown	f897078e8b	Fix some issues with setting 'new' state of a reshape - when reshaping a container, ->reshape_active is already set even though it isn't really active yet, so we need to set the new geometry even when reshape_active is set. This is safe. - When restarting a reshape, make sure the reshape_position is set appropriately when external metadata is used. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-26 08:50:28 +10:00
NeilBrown	1cc7f4feb9	Don't close fds in write_init_super We previously closed all 'fds' associated with an array in write_init_super .. sometimes, and sometimes at bad times. This isn't neat and free_super is a better place to close them. So make sure free_super always closes the fds that the metadata manager kept hold of, and stop closing them in write_init_super. Also add a few more calls to free_super to make sure they really do get closed. Reported-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-25 07:56:53 +11:00
NeilBrown	446d2a5ad4	Fix up analysis of reshape from RAID1 to RAID5. Need to allow raid-disks to change at the same time. NeilBrown <neilb@suse.de>	2011-01-21 09:13:44 +11:00
NeilBrown	e7a71c6b77	reindent/reformat some code. Indenting was all wrong here, so fix it up. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-21 09:06:31 +11:00
NeilBrown	10af14c4d2	Be more careful about adjusting reshape_progress based on backup. Only adjust reshape_progress based on the backup that was found if the backup covered the current reshape_progress point. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-21 09:03:53 +11:00
NeilBrown	25da62d9ad	Correctly initialise backup_point when reshaping backwards. When reshaping backwards we only backup from backup_blocks to the start, so initialise backup_point appropriately. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-21 09:01:09 +11:00
NeilBrown	2c6ac128c4	error check reading of 'degraded' from sysfs. I'm seen mdadm spinning while failing to read 'degraded'. This doesn't really fix it, but is a reminder that it needs to be fixed. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-21 08:59:00 +11:00
NeilBrown	ddee071d3e	Initialise reshape_progress properly in reshape_array. Previously uninitialised. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-21 08:56:35 +11:00
NeilBrown	38dad34a35	Fix management of backed-up region for hi-to-low reshapes. When reshaping from the end of the array to the start, for times when the number of data devices is decreasing, the handling of the backup area isn't a simple mirror of the handling on low-to-hi reshapes as the backup areas is always low in the array. So re-write that to make it work. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-21 07:59:53 +11:00
Adam Kwolek	55f1472192	FIX: Arrays cannot be opened exclusively When reshaping it is correct to open containers exclusively, but not arrays. The array could very easily be in use, e.g. by a mounted filesystem. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 15:06:02 +11:00
Krzysztof Wojcik	e35b189b4e	Unfreeze for non re-striping transitions For non re-striping transitions array must be unfrozen before end of processing. For restriping transitions we normally let the child unfreeze the array but in this case there is no child. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 12:56:43 +11:00
Krzysztof Wojcik	031d445c18	Set reshape.after.data_disks for raid0<->raid10 takeover reshape.after.data_disks field must be initiated for raid0<->raid10 transition. Instead calculated spares_needed variable in reshape_array function has random value. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 12:53:31 +11:00
Krzysztof Wojcik	471bceb681	Define imsm_analyze_change function Function intended to use for single volume migration. Function analyze transition and validate if it is supported. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 12:52:36 +11:00
Czarnowska, Anna	300f503323	fix: Monitor: min_size must be set to 0 Otherwise a random value will be used for comparison later for native and ddf metadata (until min_acceptable_spare_size is defined). Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 12:46:14 +11:00
Adam Kwolek	2a9f840972	FIX: sync_completed == 0 causes reshape cancellation in metadata md signals reshape completion (whole area or parts) by setting sync_completed to 0. This causes in set_array_state() to rollback metadata changes (super-intel.c:4977. To avoid this do not allow for set last_checkpoint to 0 if reshape is finished. This was also root cause of my previous fix for finalization reshape that I agreed earlier is not necessary, Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 12:44:52 +11:00
Adam Kwolek	cf6ac177b4	FIX: mdadm throws coredump on exit in error case When mdadm falls in "reduce size" error on takeovered array it jumps to release and tries execute backward takeover. This time sra pointer is not initialized and coredump is generated. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 12:38:13 +11:00
Czarnowska, Anna	c0dc0ad5f8	fix: segfault if subarray is monitored but container is not In this situation to->parent is null so "to" doesn't change to parent container and to->metadata is still null. This results in segmentation fault when checking to->metadata->ss->external. We should just skip this array as container is needed to move spares to. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 12:01:11 +11:00
NeilBrown	a93f87eee6	Add 'restart' arg to various functions used for reshaping. When we restart an array in the middle of a reshape, we reuse a lot of the code for starting the reshape, but it needs to know that circumstances are slightly different. So add a 'restart' arg which is used: - skip checking and adding spares - activate the array (rather than start reshape) - allow the backup file to already exist Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 09:53:56 +11:00
NeilBrown	6d5316f66a	Be more careful checking why reshape has stopped. If reshape_position reports 'none', check array_state to see if array is still active. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 09:53:25 +11:00
NeilBrown	18eaf9e553	Ignore error when setting sync_min When restarting an array that is in the middle of a reshape, sync_min cannot be set. So just ignore any errors we get when trying to set it. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 09:51:33 +11:00
NeilBrown	77a73a17be	Make sure child_monitor reliably reports whether reshape completed or aborted. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-17 09:51:25 +11:00
NeilBrown	631d7405f1	Clean up return path for reshape_array. Particular problem was that we didn't unfreeze if a reshape wasn't needed. But all that 'rv' stuff isn't needed and some of it was wrong, so simplify it all. Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-13 15:20:21 +11:00
Krzysztof Wojcik	5dad078d99	FIX: Mistake in delta_disk comparison. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-01-13 13:37:31 +11:00

1 2 3 4 5 ...

1643 Commits All Branches Search

1643 Commits

All Branches