mdadm

Commit Graph

Author	SHA1	Message	Date
Jes Sorensen	b2916f2514	validate_geometry_imsm_volume(): Avoid NULL pointer dereference Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-02 10:48:53 +11:00
NeilBrown	446894ea8d	Grow: fix check_reshape and open_code it. check_reshape should not try to parse the subarray string - only metadata handlers are allowed to do that. The common code and only interpret a subarray string by passing it to "container_content" which will then return only the member for that subarray. So remove check_reshape and place similar logic explicitly at the two call-sites. They are different enough that it is probably clearer to have explicit code. Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-01 15:45:46 +11:00
Jes Sorensen	ea944c8f50	Avoid memory leak In case of second posix_memalign() failing, release memory allocated in first posix_memalign() call. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-11-01 14:55:59 +11:00
Labun, Marcin	81219e70f2	kill-subarray: fix, IMSM cannot kill-subarray with unsupported metadata container_content retrieves volume information from disks in the container. For unsupported volumes the function was not returning mdinfo. When all volumes were unsupported the function was returning NULL pointer to block actions on the volumes. Therefore, such volumes were not activated in Incremental and Assembly. As side effect they also could not be deleted using kill-subarray since "kill" function requires to obtain a valid mdinfo from container_content. This patch fixes the kill-subarray problem by allowing to obtain mdinfo of all volumes types including unsupported and introducing new array.status flags. There are following changes: 1. Added MD_SB_BLOCK_VOLUME for blocking an array, other arrays in the container can be activated. 2. Added MD_SB_BLOCK_CONTAINER_RESHAPE block container wide reshapes (like changing disk numbers in arrays). 3. IMSM container_content handler is to load mdinfo for all volumes and set both blocking flags in array.state field in mdinfo of unsupported volumes. In case of some errors, all volumes can be affected. Only blocked array is not activated (also reshaped as result). The container wide reshapes are also blocked since by metadata definition they require modifications of both arrays. 4. Incremental_container and Assemble functions check array.state and do not activate volumes with blocking bits set. 5. assemble_container_content is changed to check container wide reshapes before activating reshapes of assembled containers. 6. Grow_reshape and Grow_continue_command checks blocking bits before starting reshapes or continueing (-G --continue) reshapes. 7. kill-subarray ignores array.state info and can remove requested array. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-31 11:29:46 +11:00
Jes Sorensen	e9ef57a816	GCC compile fix: remove calculation of unused variable 'reservation' gcc 4.6.1 doesn't like calculating a variable that then isn't used. Remove it. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-27 15:27:20 +11:00
root	5961eeec2f	imsm: fix: Fixes metadata after migration from Raid 0 to Raid 10 After migration from Raid 0 to Raid 10, the metadata is incorrect, leaving one mirror disk marked as spare and one missing disk as a member of the array. The reason is that the metadata update code for spare activation procedure takes into account one spare disk only, not checking the following ones. Signed-off-by: Lukasz Orlowski <lukasz.orlowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-22 11:42:16 +11:00
Lukasz Orlowski	061d7da34c	imsm: Moves metadata update code for spare activation to separate function The metadata update code during spare activation is moved to a separate function for clarity of code, as a prework for the next patch fixing the bug. Signed-off-by: Lukasz Orlowski <lukasz.orlowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-22 11:38:56 +11:00
Lukasz Dorau	c4acd1e5c8	imsm: fix: correct debug printing of the volume's name The volume's name is saved in the array of chars. All elements of the array can have nonzero values and the next byte in memory does not have to have the value of 0, so one must be cautious when printing out the volume's name. Signed-off-by: Lukasz Dorau <lukasz.dorau@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-20 12:56:56 +11:00
Lukasz Dorau	7d0c5e24a5	imsm: fix: prevent segfault in mark_failure Using an array of chars without the terminating null byte as a parameter of sprintf() function causes segfault when dealing with SAS drives (with 20-digits serial number). The memcpy() function is used instead. Signed-off-by: Lukasz Dorau <lukasz.dorau@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-20 12:56:56 +11:00
Thomas Jarosch	9cf014ec40	Fix off-by-one in readlink() buffer size handling readlink() returns the number of bytes in the buffer. If we do something like len = readlink(path, buf, sizeof(buf)); buf[len] = '\0'; we might write one byte past the end of the buffer. Signed-off-by: Thomas Jarosch <thomas.jarosch@intra2net.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-17 11:15:04 +11:00
Lukasz Dorau	b601104eb4	imsm: fix: stopped resync does not continue after auto-assemblation Resync stopped with "mdadm -Ss" command does not continue after issuing "mdadm -As" command. Signed-off-by: Lukasz Dorau <lukasz.dorau@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-10 09:16:40 +11:00
Przemyslaw Czarnowski	ea672ee119	imsm: always use set_migr_type to set type of migration For 'resync' besides the update of migration type (imsm_vol.migr_type structure) additionally status (imsm_dev.status) flag is set to DEV_VERIFY_AND_FIX. In order to clean up after migration, status flag must be cleared. For this reason, migration type shouldn't be set directly but via set_migr_type(). Otherwise status does not reflect the state of array. Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-06 14:53:31 +11:00
Lukasz Dorau	b303fe21b5	imsm: fix: correct adding and activation of spare disks During activation of spare disks, only one of all available spare disks can be activated at this moment. It causes that for example during take-over from RAID0 with 2 disks to RAID10, only one of two spare disks is taken for recovery and a degraded RAID10 array with only 3 of 4 working disks is created. It has been fixed by adding more than one of all available spare disks and saving them in additional_test_list which is passed to imsm_add_spare(). Signed-off-by: Lukasz Dorau <lukasz.dorau@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-05 14:17:38 +11:00
Adam Kwolek	3ad2563886	imsm: Fill recovery_blocked field present in mdinfo If any reshape in container is active set recovery_blocked field. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-05 13:32:28 +11:00
Adam Kwolek	b91726651d	imsm: Do not mark resync during reshape During reshape, resync/rebuild in the same container is not possible due to fact that all arrays in container has to share the same disks set. Block new resync/rebuild process initialization and setting resync_start to 0 while any reshape in container is active. This avoids breaking container reshape and doesn't allow for starting multiple processes /resync/rebuild and reshape/ at the same time in md. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-03 10:31:22 +11:00
Adam Kwolek	e2962bfc21	imsm: FIX: Do not allow for spare disk activation during reshape Spare disk activation or starting repair for one array while on second reshape is in progress, will lead to IMSM incompatible situation when 2 arrays in container shares different disks sets. This can cause that 2 processes in container /reshape and rebuild/ are in progress in parallel. This is IMSM incompatible situation also. Block spare disk activation and starting resync if any reshape in container is in progress. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-10-03 10:30:28 +11:00
Czarnowska, Anna	b81221b74e	imsm: Calculate reservation for a spare based on active disks in container New function to calculate minimum reservation to expect from a spare is introduced. The required amount of space at the end of the disk depends on what we plan to do with the spare and what array we want to use it in. For creating new subarray in an empty container the full reservation of MPB_SECTOR_COUNT + IMSM_RESERVED_SECTORS is required. For recovery or OLCE on a volume using new metadata format at least MPB_SECTOR_CNT + NUM_BLOCKS_DIRTY_STRIPE_REGION is required. The additional space for migration optimization included in IMSM_RESERVED_SECTORS is not necessary and is not reserved by some oroms. MPB_SECTOR_CNT alone is not sufficient as it does not include the reservation at the end of subarray. However if the real reservation on active disks is smaller than this (when the array uses old metadata format) we should use the real value. This will allow OLCE and recovery to start on the spare even if the volume doesn't have the reservation we normally use for new volumes. Signed-off-by: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-09-21 14:44:35 +10:00
NeilBrown	ecbd9e8160	Create: improve messages from validate_geometry. When validate_geometry finds that we haven't committed to a metadata yet and that the subdev is a member of 'our' container, it needs to report any errors it finds as Create() cannot report them effectively. So make a slight change to the semantics of the 'verbose' flag and allow validate_geometry to report if it printed any error messages. Signed-off-by: NeilBrown <neilb@suse.de>	2011-09-21 14:39:01 +10:00
Lukasz Orlowski	e7cb06c845	Create: Allow to create two volumes of different sizes within one container Allows to create RAID 5 volume on 3 disks and then RAID 1 volume on 2 disks withing the same container. Signed-off-by: Lukasz Orlowski <lukasz.orlowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-09-21 13:24:34 +10:00
Adam Kwolek	a8619d23b8	imsm: FIX: Spare disk has wrong serial after takeover Takeover marks disk as failed and adds to serial ':0' string and then turns it in to spare. This causes that when new spare is about to be used, it cannot be found due to different disk serial number. Restore disk serial number to avoid this problem. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-09-19 13:13:07 +10:00
Dan Williams	3960e579bf	imsm: support 'missing' devices at Create Specifying missing devices at create is very useful for array recovery. For imsm create dummy disk entries at init_super_imsm time, and then use them to fill in unoccupied slots in the final array (if the container is unpopulated). If the container is already populated (has a subarray) 'missing' disks must be in reference to already recorded missing devices in the metadata. Also add support for --assume-clean for imsm arrays. Cc: Dave Jiang <dave.jiang@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-08-30 13:11:42 +10:00
Dan Williams	b276dd33c7	imsm: fix reserved sectors for spares Different OROMs reserve different amounts of space for the migration area. When activating a spare minimize the reserved space otherwise a valid spare can be prevented from joining an array with a migration area smaller than IMSM_RESERVED_SECTORS. This may result in an array that cannot be reshaped, but that is less surprising than not being able to rebuild a degraded array. imsm_reserved_sectors() already reports the minimal value which adds to the confusion when trying rebuild an array because mdadm -E indicates that the device has enough space. Cc: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-08-30 10:49:42 +10:00
Dan Williams	0ec1f4e8de	imsm: fix display spares Commit `94827db3` "imsm: add spares to --examine output." may try to display failed disks whose imsm_disk info is not uptodate (due to not being able to look itself up by serial). The same effect can be had by just loosening the restriction in print_imsm_disk(). Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-08-30 10:49:42 +10:00
Dan Williams	86c54047e6	imsm: fix, stop metadata updates to newly failed devices We already refrain from updating metadata on disks that are failed at load, need to do the same for new failures. This also reverts `b4add146` as we do want to update other disks' view of the failed device as out of date. Cc: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-08-30 10:49:42 +10:00
Dan Williams	660260d027	imsm: fix max disks per array Validate geometry is incorrectly looking at max disks support which is irrelevant for md/mdadm. ->dpa (disks per array) is how many disks the orom will allow per volume. Also cleanup an unnecessary ->orom check, is_raid_level_supported() already does the right thing in the !orom case. Cc: Marcin Labun <marcin.labun@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-08-30 10:49:41 +10:00
NeilBrown	418f9b368a	IMSM: allow some array attribute bits to be ignored. Some bits are not handled by mdadm, but their presence should not cause failure. In particular MPB_ATTRIB_NEVER_USE appears harmless. Reported-by: Thomas Steinborn <thestonewell@googlemail.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-08-09 08:49:34 +10:00
NeilBrown	656b6b5a55	IMSM: set ->raid_disk correctly in getinfo_super_imsm_volume The 'raid_disk' can be different to the 'number' and must be the position of the device in the array, not in the container. Normally these should not be different, but the test-suite creates a possibility so it should work. Signed-off-by: NeilBrown <neilb@suse.de>	2011-07-27 16:11:48 +10:00
Dan Williams	cd9d1ac715	imsm: fix default chunk in the !orom case Set a valid default in the !orom case, otherwise we segfault, or otherwise fail. Cc: Anna Czarnowska <anna.czarnowska@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-07-19 16:53:08 +10:00
NeilBrown	ca0748fa49	imsm: getinfo_super_imsm_volume() doesn't fill all disk information getinfo_super_imsm_volume doesn't correctly set info.disk fields because it doesn't know which disk to set them from. It should be the last disk passed to add_to_super. So add a field 'current_disk' to record this disk in add_to_super, and use it in getinfo_super. This allows us to remove a hack in Create.c Signed-off-by: NeilBrown <neilb@suse.de>	2011-07-14 15:42:10 +10:00
Milan Broz	19986c721c	mdadm: fix build failures (ppc64) This patch fixes these build issues: super-intel.c: In function 'getinfo_super_imsm_volume': super-intel.c:2327:4: error: format '%llu' expects argument of type 'long long unsigned int', but argument 3 has type '__u64' [-Werror=format] super-intel.c: In function 'imsm_reshape_super': super-intel.c:8665:7: error: 'devnum' may be used uninitialized in this function [-Werror=uninitialized] Signed-off-by: Milan Broz <mbroz@redhat.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-07-14 13:58:36 +10:00
NeilBrown	664d53258d	super-intel: fix buffer overflow in detail-platform. The serial number is not necessarily nul terminated, so we need to be sure to only use the allowed number of chars. Signed-off-by: NeilBrown <neilb@suse.de> Reported-by: Arvin Schnell <aschnell@novell.com>	2011-07-13 12:38:50 +10:00
Luca Berra	e4c72d1dc6	Fix some compiler warnings. Original by Luca, with various changes by Neil Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-17 14:35:06 +10:00
NeilBrown	9e2d750d4c	Various fixes so that "make everything" works. In particular: protect some stuff from MDASSEMBLE and report and error from 'write'. Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-16 17:13:50 +10:00
Albert Pauw	9ec11d1afd	Remove compiler warning about signed/unsigned comparison. Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-15 14:39:30 +10:00
Adam Kwolek	19482bcc40	imsm: Metadata Attributes compatibility support IMSM's meta data contains Attributes field that contains information about supported features. To assembly an array mdadm has to support all features specified by attributes. The patch introduces new attributes support and validation of the attribuses during an array assembly. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-15 09:58:57 +10:00
Adam Kwolek	f8b72ef517	imsm: FIX: Sometimes reshape cannot be finished When array size is not aligned to copy area, number of migration unit is increased in init_migr_record_imsm():7665 to reshape whole array. During calculation of last migration unit, this should be in mind also, otherwise checkpoint (max-1) is always written and reshape is never finished in mdadm. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-15 09:13:49 +10:00
Adam Kwolek	7534230b07	imsm: FIX: klocwork: passed dev pointer to is_gen_migration() can be NULL Pointer dev2 passed in write_super_imsm():4451 can be equal to NULL. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-14 12:48:58 +10:00
Adam Kwolek	7e45b5504c	imsm: Fix: klocwork: targets variable can be used uninitialized When target_offsets allocation fails execution goes to abort label, where elements from targets table are closed. Initialize targets table after allocation. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-14 12:48:53 +10:00
Adam Kwolek	e1c1d4f442	imsm: FIX: Migration Raid0->Raid5 cannot be restarted correctly When array raid0 is migrated to raid5, reshape cannot be continued correctly due to wrong array parameters settings. Raid disks number is set too big. There is no need, during raid0->raid5 migration to increase info->array.raid_disks, it is already set to final value using designation map information. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-14 12:46:53 +10:00
Adam Kwolek	d1877f697d	imsm: FIX: Raid5 data corruption data recovering from backup Sporadicaly when Raid5's data are restored from backup area, corruption occurs. It doesn't happen if reshape process is beyond critical section. Root cause of the problem is passing wrong starting point in restore_stripes(). It was hard coded to 0 so far. This causes that parity disks position in first stripe was always set to the last raid disk. This position should depend on data position in array. Proper start position was set and pointer for restoring data (copy area address) is adjusted to passed start parameter. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-14 12:42:22 +10:00
Adam Kwolek	b66e591b14	imsm: FIX: Disable automatic metadata rollback for broken reshape mdmon cannot rollback metadata changes automatically. It can break reshape process in the way that in case of reshape break user will not be able to deal with broken reshape due to lack of information about reshape geometry. mdadm (process that invokes reshape) doesn't make any rollback to allow for user action. mdmon should not do this either unless it knows for sure it is save. such knowledge is not available for automatic rollback. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-14 12:42:16 +10:00
Adam Kwolek	68eb8bc6ca	imsm: FIX: Use function to obtain array layout Function imsm_level_to_layout() should be use to get array layout. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-14 12:42:08 +10:00
Adam Kwolek	8016a6d42e	imsm: Optimize expansion speed when no backup is required When no reshape backup is required (e.g. OLCE after critical section), check-pointing can use bigger steps than backup space allows for. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:56 +10:00
Adam Kwolek	a47e44fb96	imsm: FIX: Remove timeout from wait_for_reshape_imsm() Timeout should not be used for select function in wait_for_reshape_imsm(). Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	ae9f01f89b	imsm: FIX: wait_for_reshape_imsm() cleanup This function needs to be corrected. It should check sysfs operations status and it should not interpret 0 reshape position special meaning. Unused input parameter is removed also. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	b2c5943816	imsm: FIX: Do not continue reshape when backup exists When backup exists in copy area reshape cannot be continued. In such situation, array is in unstable state. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	a6b6d984e0	imsm: FIX: Remove unused variables and code Unused variables and code can be removed from imsm_manage_reshape() Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	befb629b0b	imsm: FIX: Move reshape_progress forward When array under reshape is assembled, reshape position used in sysfs_set_array() should be set to position after recovered from backup area. This avoids data corruption due to reshape the same array area again. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	6c3560c0f2	imsm: FIX: Detect failed devices during recover_backup_imsm() Detect in recover_backup_imsm() if not opened disks number is smaller than allowed degradation for given raid level. This allows for reshape restart on degraded array. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	ab724b9862	imsm: FIX: Use metadata information for restore_stripes() and save_stripes() For raid0 reshape imsm uses degraded raid4 for this operation. Using real raid level (raid0) for stripe calculation causes no need for parity calculation and can speed up reshape process. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	aea9317117	imsm: FIX: Remove unused parameter from save_backup_imsm() interface new_data parameter is not used in save_backup_imsm(). It is removed from function interface. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	92144abfbe	imsm: FIX: Do not use pba_of_lba0 for copy position calculation imsm_manage_reshape() should not shift start copy position. This offset is passed to manage reshape function /and it is used/ as input parameter in offsets table already. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	1ab242d891	imsm: FIX: Do not verify unused parameters Parameters that are not used by imsm_manage_reshape() should not cause failure of this function. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	75b69ea420	imsm: FIX: Calculate backup location based on metadata information Use metadata information to calculate backup write offset. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	7b1ab482f6	imsm: FIX: Use macros to data access Metadata fields has to be accessed using proper macros. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	e13ce846aa	imsm: FIX: Check layout for level migration When user doesn't specify raid 5 layout for raid0->rai5 migration, layout structure member is uninitialized. Earlier it cannot be determined if it is correct or not. In metadata handle proper verification is placed. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	3ef4403cf6	imsm: FIX: Max position could not be rounded to MB When rounded array size information from metadata is used for number of migration units calculation it can occurs that result of units can be smaller (-1) than required due to used (rounded) array size). Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	0228d92ca3	imsm: FIX: Detect migration end during migration record saving Checkpoint should be saved when migration is in progress only. End of reshape (based on passes status) should be detected and it should not cause abort of reshape/check-pointing/ operation. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:55 +10:00
Adam Kwolek	2e062e8210	imsm: FIX: Verify if migration record is loaded correctly Migration compatibility can be checked when general migration record is present. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:54 +10:00
Adam Kwolek	6b7a407dce	imsm: FIX: Opened handle is not closed Opened file handle should be closed before function exit. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 13:00:54 +10:00
NeilBrown	9894ec0d64	Fix some fall-out from recent memset-zero for getinfo_super container_content_imsm was setting info->next before calling getinfo_super_imsm_container which now zeros everything. So move that assignment to afterwards. So both imsm and ddf were assuming info->disk.raid_disk means something but it doesn't. So fix those. Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-09 12:42:02 +10:00
Adam Kwolek	480be36336	imsm: Remove user warning before reshape start imsm's arrays supports imsm native check-pointing now. User warning is no longer required. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:14:33 +10:00
Adam Kwolek	0ec5d470e0	imsm: Apply checkpoint metadata update for general migration mdmon has to update checkpoint information in metadata during general migration according to received metadata update. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:13:21 +10:00
Adam Kwolek	c17608eac3	imsm: Prepare checkpoint update for general migration mdadm has to prepare checkpoint information update and send it to mdmon. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:12:48 +10:00
Adam Kwolek	2d40f3a132	imsm: Add metadata update type for general migration check-pointing There are 2 places for keeping checkpoint information: - metadata (per volume information used during volume initialization and rebuilding). - migration record (per container information used during migration/reshape) During reshape both checkpoints has to contains the same information. To do this mdadm will send metadta update with checkpoint information. Note: Checkpoint information consistence is not critical. During general migration restart, information from migration record is used only. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:12:39 +10:00
Adam Kwolek	5b83bacff6	imsm: Disable checkpoint updating by mdmon for general migration imsm contains 2 check-pointing mechanism. One (per array) is used for initialization and rebuild and second (per container) is used for general migration (reshape). First is controlled by mdmon, second by mdadm. To avoid conflicts disable mdmon checkpoints updating for general migration. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:11:49 +10:00
Adam Kwolek	276d77db1f	imsm: Implement recover_backup_imsm() for imsm metadata Add ability to restore data backed up in General Migration Copy Area in case of unexpected reshape interruption. Function restores data during an array assembly and then reshape is continues from next checkpoint. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:11:23 +10:00
Adam Kwolek	c47b0ff69a	imsm: update blocks_per_migr_unit() to support migration record blocks_per_migr_unit() has to use information from migration record for general migration case. This causes to pass intel_super pointer to this function and some other interfaces changes. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:09:50 +10:00
Adam Kwolek	520e69e25c	imsm: Add information about migration record to mdadm '-E' option Add ability to display information from migration record in examine option. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:09:29 +10:00
Adam Kwolek	146c626037	imsm: Clear migration record when no migration in progress When metadata is saved and there is no general migration in progress /in container/ clear migration record in container. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:09:16 +10:00
Adam Kwolek	b915c95fd3	imsm: Check if array degradation has been changed Before reshaping every "migration unit", check if array is still usable. In failed disks number is greater than allowed degradation level, reshape has to be aborted. Signed-off-by: Maciej Trela <maciej.trela@intel.com> Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:09:09 +10:00
Adam Kwolek	10f228541c	imsm: Implement imsm_manage_reshape(), reshape workhorse Before reshape is started, mdadm should check again if there is only one array (in container) under reshape. Then function "divides" array in to "migration units" that can fits migration copy area and enters main loop. It checks if current "migration unit" requires to be backed up. If necessary mdadm saves it to copy area and updates migration record. Then MD-driver is directed to perform reshape step (by "migration unit" size) and checkpoint is moved forward. In this way reshape is executed until array ends. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:09:08 +10:00
Adam Kwolek	eee67a47f2	imsm: Add wait_for_reshape_imsm() implementation After each checkpoint mdadm should set new reshaped area and wait until md finishes reshape. Function wait_for_reshape_imsm() sets new reshape range and waits for job completion. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 17:07:10 +10:00
Adam Kwolek	e2f41b2c6a	imsm: check migration compatibility Under Windows IMSM can reshape arrays in 2 directions (ascending and decsending). Under Linux one (ascending) direction is supported at this moment. Block loading metadata when decsending reshape is detected Windows also uses optimalization area during reshaping array. Linux does not support it. The patch blocks this operation also. Signed-off-by: Maciej Trela <maciej.trela@intel.com> Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 16:46:37 +10:00
Adam Kwolek	687629c2b2	imsm: Add support for copy area and backup operations This patch adds methods of manipulating migration record: init_migr_record_imsm() - initiate migration record at the beginning of the reshape process write_imsm_migr_rec() - saves migration record to array. Migration record is stored on 2 first disks in array only. save_backup_imsm() - saves critical data stripes to Migration Copy Area and updates the current migration unit status. Uses restore_stripes() to format a destination stripe, and to write it to the Migration Copy Area. save_checkpoint_imsm() - Updates the current unit status in the migration record. Migration record is written to 2 first array disks only (similar to reading operation). Signed-off-by: Maciej Trela <maciej.trela@intel.com> Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 16:46:35 +10:00
Adam Kwolek	8e59f3d882	imsm: Add migration record to intel_super IMSM for securing reshape process uses special disk area outside metadata for reshaped area backup purposes. If just reshaped array area requires backup, bunch of array stripes prepared for reshape is stored in to Migration Copy Area. In case of reshape interruption, Option ROM during restart or mdadm during reshape restart (when no reboot occurs) will restore Migration Copy Area to designation array. Reshape can be continued from stable array stable state. This patch adds support for IMSM migration record structure. IMSM migration record is stored on the first two disks of IMSM volume during the migration. Add function for reading migration record, so mdadm can read (if present) migration record. Migration record has to be cleared every time MIGR_GEN_MIGR is started. Signed-off-by: Maciej Trela <maciej.trela@intel.com> Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 16:19:06 +10:00
NeilBrown	95eeceeb32	getinfo_super now clears the 'info' structure before filling it in. Some code currently clears 'info' before calling getinfo_super, some code doesn't. To be consistent, change it so no caller ever clears 'info', but ever getinfo_super function must clear it. Note that ->raid_disk may be meaningful if that 'map' is passed non-NULL. In that case it is copied out before the structure is zeroed. Signed-off-by: NeilBrown <neilb@suse.de>	2011-06-08 15:54:13 +10:00
Przemyslaw Czarnowski	4bba043921	imsm: add new chunk size to metadata update Put information about new chunk size change in to migration metadata update allowing simultaneous level change and re-striping. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-05-09 11:45:53 +10:00
Przemyslaw Czarnowski	a29911dac1	imsm: process update for raid level migrations Received update and prepared memory is processed to update imsm metadata. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-05-09 11:45:53 +10:00
Przemyslaw Czarnowski	bc0b9d3496	imsm: prepare memory for level migration update When level is changed from raid0 to raid5 memory is required for replace device smaller device/array object. This memory is allocated in manager context in prepare_update() Prepare_update() is called in manager context so memory allocation are allowed here. This allows us to look for spare devices for meta update. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-05-09 11:45:53 +10:00
Przemyslaw Czarnowski	c7958710e7	imsm: fix: disable migration from raid5->raid0 it is not supported yet, so start such transition is improper. Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-05-09 11:45:53 +10:00
Przemyslaw Czarnowski	48c5303aff	imsm: prepare update for level migrations reshape Introducing raid0->raid5 level migration metadata update structure is prepared for future use. Adding spare device is required to hold additional raid5 parity. Mdadm just checks for spares, but it is not included in update. If there are no spares available, abort. Otherwise we will create degraded array what should be not allowed. Mdmon will decide what spare device is used for parity. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: Przemyslaw Czarnowski <przemyslaw.hawrylewicz.czarnowski@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-05-09 11:45:53 +10:00
Adam Kwolek	bfd80a5677	imsm: FIX: Do not write check-point '0' When 2 arrays are configured in container and arrays are reassembled during rebuild or initialization, checkpoint for one array can be reset. It depends on arrays assembly order. Scenario: 1. Create 2 arrays (e.g. raid5) 2. Add spare to container 3. Degrade arrays /rebuild starts on array #1 and continues to n%/ 4. Reassembly arrays 5. Rebuild starts on array #2 /because of assembly order/ from 0% 6. On first checkpoint stored for array #2 (non 0 value), checkpoint for array #1 is cleared /it is delayed rebuild in md, so progress is 0/ 7. Rebuild on #1 starts from n% /it was configured before checkpoint was cleared/. Any next reassembly during rebuild of #2 array (after p.6) causes checkpoint information lost for array #1. Solution is not store checkpoint for progress == 0. Checkpoint is set to 0 when rebuild/initialization starts. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-05-02 16:12:03 +10:00
Adam Kwolek	cd0430a17c	FIX: Always report new raid_disks during migration To behave in the similar way as native metadata during migration, new raid disks number has to be reported by metadata handler. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-04-18 10:11:33 +10:00
Adam Kwolek	139dae1137	imsm: fix: report aligned component size value OROM can create array with chunk size not aligned. To resolve this problem in mdadm, metadata handler has to report component size aligned value for mdadm operations while metadata value stays unchanged. Do not correct alignment for raid1 and in error case. Correction allows check in analyse_change() (Grow.c:905) to pass. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-04-06 12:40:31 +10:00
Adam Kwolek	2a4a08e7d3	imsm: FIX: Check array alignment before expansion It can occur that OROM creates array not aligned properly. Expansion cannot be run in such cases. It is detected in analyse_change(). It is too late. This causes that metadata is in migration state already, when expansion cannot be started. This problem has to be detected before metadata is updated, in all arrays in reshaped container. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-04-06 12:40:04 +10:00
Adam Kwolek	6dc0be309d	imsm: Warn user about reboot risk Current check-pointing implementation doesn't allow for interrupting reshape of boot arrays due to checkpoint restore has to be done before system start. There is problem with passing backup file name to array automatically mounted during boot time, especially when scan mode is used. Until IMSM check-pointing implementation will be introduced, warning about reboot risk should be placed in mdadm. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-04-06 12:38:50 +10:00
NeilBrown	7b0bbd0f71	Release 3.2.1 Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-28 13:30:29 +11:00
Krzysztof Wojcik	b4add146a0	FIX: imsm: Do not change serial if disk failed This patch rollback one change connected with mdadm-OROM compatibility: adding ':0' at the end of disk serial number if disk is detected as failed. Current mdadm's implementation does not distinguish two cases when disk is marked as failed: 1. If disk is really failed- disconnected, broken 2. Just marked as failed by mdadm- using "-f" option Second case is not yet fully handled and compatible with IMSM standard. Changing serial number of existing, operational disk causes problems in "thunderdome" and "load_super" functions that use serial numbers to disks comparisons and searching. The change must be recalled until full support will be developed. Signed-off-by: Krzysztof Wojcik <krzysztof.wojcik@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-24 10:15:01 +11:00
Labun, Marcin	ea2bc72b00	super-intel: enable loading metadata from non-IMSM compliant disks Honor ignore_hw_compat to load metadata from disk attached to non-IMSM controller or when there are no IMSM OROM/EFI capabilities. Used only for guessing and examining metadata format. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-23 12:05:53 +11:00
Adam Kwolek	ceaf0ee19e	imsm: FIX: indicate that metadada has to be written During adding spare disks to raid0, spare metadata is not written. This is due to exit form sync_metadata() on empty updates_pending flag. When mdmon is absent indicate sync_metadata() to flush changes to disks. Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-20 15:47:31 +11:00
Adam Kwolek	6289d1e079	imsm: FIX: Store checkpoint in per disk units While last_checkpoint is counter in per disk units, checkpoints should be stored in the same manner. Restoring from checkpoint should should recalculate checkpoint in to array position (reshape_progress). Signed-off-by: Adam Kwolek <adam.kwolek@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-14 18:17:53 +11:00
NeilBrown	d424212ed9	Make find_intel_hba_capability less verbose. mdadm has a convention in some areas of passing a device name if error messages about it are interesting, or NULL if not. Follow this convention with find_intel_hba_capability so that it doesn't complain when not appropriate - and so that it doesn't have to go and find a device name that it wasn't given. Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-10 14:53:30 +11:00
Labun, Marcin	f2f5c343ff	imsm: introduce SAS controller support in imsm metadata handler OROM/EFI capabilities are retrieved based on disk's controller type. 1/ alloc_super no longer retrieves OROM capabilities 2/ find_imsm_capability replaces find_imsm_orom 3/ new function find_intel_hba_capability gets disk's HBA and relevant capability Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-10 11:52:15 +11:00
Labun, Marcin	f0f5a01660	imsm: move code for retrieving HBA to a function Function find_intel_hba_capability attaches HBA information to intel_super structure based on fd of the component disk. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-10 11:50:58 +11:00
Labun, Marcin	8603ea6f22	imsm: verify that component disks are attached to the same type of HBA compare_super_imsm verifies that the component disks use the same type of HBA in platform dependent environment. Otherwise print-out error message and block the action. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-10 11:50:57 +11:00
Labun, Marcin	7340812950	imsm: add maximum number of disk validation in RAID array Arrays exceeding the OROM/EFI maximum number of supported disk are blocked in validate_geometry_imsm_orom function. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-10 11:50:54 +11:00
Labun, Marcin	d54559f08a	imsm: print-out error message when volume validation fails Print-out error message when volume geometry fails to comply with OROM/EFI controller's capabilities. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-10 11:50:52 +11:00
Labun, Marcin	2db863023e	imsm: do not publish OROM/EFI unsupported arrays Container_content_imsm calls validate_goemtry_imsm_orom to verify that the array parameters are supported by controller's OROM/EFI. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-10 11:50:49 +11:00
Labun, Marcin	a891a3c24d	imsm: detail_platform_imsm displays AHCI and SAS controller information The function uses find_intel_device and find_imsm_capability to present AHCI and SAS controller capabilities taken from OROM or EFI. Signed-off-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2011-03-10 11:46:11 +11:00

1 2 3 4 5 ...

508 Commits