mdadm

Commit Graph

Author	SHA1	Message	Date
Artur Paszkiewicz	5308f11727	Generic support for --consistency-policy and PPL Add a new parameter to mdadm: --consistency-policy=. It determines how the array maintains consistency in case of unexpected shutdown. This maps to the md sysfs attribute 'consistency_policy'. It can be used to create a raid5 array using PPL. Add the necessary plumbing to pass this option to metadata handlers. The write journal and bitmap functionalities are treated as different policies, which are implicitly selected when using --write-journal or --bitmap options. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-29 11:32:15 -04:00
NeilBrown	b4decd517d	Detail: handle non-existent arrays better. If you call "mdadm --detail" with a device file for an array which doesn't exist, such as by mknod /dev/md57 b 9 57 mdadm --detail /dev/md57 you get an unhelpful message about and inactive RAID0, and return status is '0'. This is confusing. So catch this possibility and print a more useful message, and return a non-zero status. Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-28 14:33:48 -04:00
NeilBrown	1ab9ed2afb	Add 'force' flag to hot_remove_disk(). In rare circumstances, the short period that hot_remove_disk() waits isn't long enough to IO to complete. This particularly happens when a device is failing and many retries are still happening. We don't want to increase the normal wait time for "mdadm --remove" as that might be use just to test if a device is active or not, and a delay would be problematic. So allow "--force" to mean that mdadm should try extra hard for a --remove to complete, waiting up to 5 seconds. Note that this patch fixes a comment which claim the previous wait time was half a second, where it was really 50msec. Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-28 14:32:35 -04:00
NeilBrown	fdd015696c	Introduce sys_hot_remove_disk() The new hot_remove_disk() will retry HOT_REMOVE_DISK several times in the face of EBUSY. However we sometimes remove a device by writing "remove" to the "state" attributed. This should be retried as well. So introduce sys_hot_remove_disk() to repeat this action a few times. Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-28 14:30:49 -04:00
Zhilong Liu	1b4944f3a1	mdadm/Build:check the level parameter when build new array check if user forgets to specify the --level when build a new array. such as: ./mdadm -B /dev/md0 -n2 /dev/loop[0-1] Signed-off-by: Zhilong Liu <zlliu@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-28 14:26:47 -04:00
NeilBrown	2dd271fe70	Retry HOT_REMOVE_DISK a few times. HOT_REMOVE_DISK can fail with EBUSY if there are outstanding IO request that have not completed yet. It can sometimes be helpful to wait a little while for these to complete. We already do this in impose_level() when reshaping a device, but not in Manage.c in response to an explicit --remove request. So create hot_remove_disk() to central this code, and call it where-ever it makes sense to wait for a HOT_REMOVE_DISK to succeed. Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-28 14:25:23 -04:00
Hannes Reinecke	3a77acd717	udev-md-raid-assembly.rules: Skip non-ready devices If a device isn't fully initialized (e.g if it should be handled by multipathing) it should not be considered for md/RAID auto-assembly. Doing so can cause incorrect results such as causing multipath to fail during startup. There is a convention that the udev environment variable SYSTEMD_READY be set to zero for such devices. So change the mdadm rules to ignore devices with SYSTEMD_READY==0. Signed-off-by: Hannes Reinecke <hare@suse.de> Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-28 14:22:32 -04:00
Zhilong Liu	4c829c2252	mdadm/bitmap:fixed typos in comments of bitmap.h bitmap.h: fixed trivial typos in comments Signed-off-by: Zhilong Liu <zlliu@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-28 14:08:04 -04:00
Gioh Kim	e23c2663da	super1: ignore failfast flag for setting device role There is corner case for setting device role, if new device has failfast flag. The failfast flag should be ignored. Signed-off-by: Gioh Kim <gi-oh.kim@profitbricks.com> Signed-off-by: Jack Wang <jinpu.wang@profitbricks.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-28 13:59:03 -04:00
Xiao Ni	5b97512954	mdadm: Forced type conversion to avoid truncation Gcc reports it needs 19 bytes to right to disk->serial. Because the type of argument i is int. But the meaning of i is failed disk number. So it doesn't need to use 19 bytes. Just add a type conversion to avoid this building error Signed-off-by: Xiao Ni <xni@redhat.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-28 13:56:43 -04:00
Xiao Ni	618f4e6d63	Replace snprintf with strncpy at some places to avoid truncation In gcc7 there are some building errors like: directive output may be truncated writing up to 31 bytes into a region of size 24 snprintf(str, MPB_SIG_LEN, %s, mpb->sig); It just need to copy one string to target. So use strncpy to replace it. For this line code: snprintf(str, MPB_SIG_LEN, %s, mpb->sig); Because mpb->sig has the content of version after magic, so it's better to use strncpy to replace snprintf too. Signed-off-by: Xiao Ni <xni@redhat.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-28 13:54:43 -04:00
Zhilong Liu	75dd32a185	mdadm/Monitor: Fix NULL pointer dereference when stat2devnm return NULL Wait(): stat2devnm() returns NULL for non block devices. Check the pointer is valid derefencing it. This can happen when using --wait, such as the 'f' and 'd' file type, causing a core dump. such as: ./mdadm --wait /dev/md/ Reviewed-by: NeilBrown <neilb@suse.com> Signed-off-by: Zhilong Liu <zlliu@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-27 18:24:19 -04:00
Zhilong Liu	e81070a69b	mdadm/mdmon:deleted the abort_reshape never invoked mdmon.c: abort_reshape() has implemented in Grow.c, this function doesn't make a lot of sense here. Signed-off-by: Zhilong Liu <zlliu@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-27 18:16:19 -04:00
Zhilong Liu	27c48b375d	mdadm:it doesn't make sense to set --bitmap twice mdadm.c: it doesn't make sense to set --bitmap twice. Signed-off-by: Zhilong Liu <zlliu@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-27 18:14:09 -04:00
Zhilong Liu	7054da69c7	mdadm:fixed some trivial typos in comments of mdadm.h mdadm.h: fixed some trivial typos in comments Signed-off-by: Zhilong Liu <zlliu@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-27 18:00:25 -04:00
Xiao Ni	ff9239ee31	mdadm: Specify enough length when write to buffer In Detail.c the buffer path in function Detail is defined as path[200], in fact the max lenth of content which needs to write to the buffer is 287. Because the length of dname of struct dirent is 255. During building it reports error: error: ‘%s’ directive writing up to 255 bytes into a region of size 189 [-Werror=format-overflow=] In function examine_super0 there is a buffer nb with length 5. But it need to show a int type argument. The lenght of max number of int is 10. So the buffer length should be 11. In human_size function the length of buf is 30. During building there is a error: output between 20 and 47 bytes into a destination of size 30. Change the length to 47. Signed-off-by: Xiao Ni <xni@redhat.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-17 15:58:16 -04:00
Xiao Ni	8268821b43	mdadm: Add Wimplicit-fallthrough=0 in Makefile There are many errors like 'error: this statement may fall through'. But the logic is right. So add the flag Wimplicit-fallthrough=0 to disable the error messages. The method I use is from https://gcc.gnu.org/onlinedocs/gcc/Warning-Options.html #index-Wimplicit-fallthrough-375 Signed-off-by: Xiao Ni <xni@redhat.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-17 15:56:50 -04:00
Zhilong Liu	72b616aff2	mdadm:add checking clustered bitmap in assemble mode mdadm:Both clustered and internal array don't need to specify --bitmap when assembling array. Signed-off-by: Zhilong Liu <zlliu@suse.com> Acked-by: Coly Li <colyli@suse.de> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-07 11:56:33 -05:00
Zhilong Liu	d64c228363	mdadm:add man page for --symlinks In build and create mode: --symlinks Auto creation of symlinks in /dev to /dev/md, option --symlinks must be 'no' or 'yes' and work with --create and --build. In assemble mode: --symlinks See this option under Create and Build options. Signed-off-by: Zhilong Liu <zlliu@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-06 16:34:32 -05:00
NeilBrown	36352fc957	examine: tidy up some code. Michael Shigorin reports that the 'lcc' compiler isn't able to deduce that 'st' must be initialized in if (c->SparcAdjust) st->ss->update_super(st, NULL, "sparc2.2", just because the only times it isn't initialised, 'err' is set non-zero. This results in a 'possibly uninitialised' warning. While there is no bug in the code, this does suggest that maybe the code could be made more obviously correct. So this patch: 1/ moves the "err" variable inside the for loop, so an error in one device doesn't stop the other devices from being processed 2/ calls 'continue' early if the device cannot be opened, so that a level of indent can be removed, and so that it is clear that 'st' is always initialised before being used 3/ frees 'st' if an error occured in load_super or load_container. Reported-by: Michael Shigorin <mike@altlinux.org> Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-06 16:33:44 -05:00
Zhilong Liu	9d67f6496c	mdadm:check the nodes when operate clustered array It doesn't make sense to write_bitmap with less than 2 nodes, in order to avoid 'write_bitmap' received invalid nodes number, it would be better to do checking nodes in getopt operations. Signed-off-by: Zhilong Liu <zlliu@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-06 16:28:21 -05:00
Zhilong Liu	e93bb91b12	mdadm:fix typo in comment Signed-off-by: Zhilong Liu <zlliu@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-03-06 16:27:38 -05:00
Wol	bb6f40bf9c	Fix oddity where mdadm did not recognise a relative path mdadm assumed that a pathname started with a "/", while an array name didn't. This alters the logic so that if the first character is not a "/" it tries to open an array, and if that fails it drops through to the pathname code rather than terminating immediately with an error. Signed-off-by: Wol <anthony@youngman.org.uk> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-01-29 12:45:20 -05:00
Pawel Baldysiak	565cc99e34	imsm: fix missing error message during migration If user tries to migrate from raid0 to raid5 and there is no spare drive to perform it - mdadm will exit with errorcode, but no error message is printed. Print error instead of debug message when this condition occurs, so user is informed why requested migration is not started. Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-01-26 20:56:28 -05:00
Jes Sorensen	53835cf500	Makefile: Fix date to be output in ISO format Updated the static version in the release, but forgot to fix the Makefile generated version when extracting from git Reported-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-01-10 18:51:40 -05:00
Jes Sorensen	25cfca3134	Release mdadm-4.0 My first release! Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-01-09 16:46:34 -05:00
Maksymilian Kunt	ef5c214e4f	imsm: show correct size for arrays with 4k disks Number of blocks used to calculate array size is based on 512 block size so the size displayed is incorrect for arrays with 4k disks. Signed-off-by: Maksymilian Kunt <maksymilian.kunt@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-01-09 09:26:12 -05:00
Xiao Ni	4515fb28a5	Add detail information when can not connect monitor If it can't connect monitor, now the error message is just Error waiting for xxx to be clean. Add detail error message in connect_monitor. Suggested-by: Oleg Samarin <osamarin68@gmail.com> Signed-off-by: Xiao Ni <xni@redhat.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-01-09 07:50:00 -05:00
Alexey Obitotskiy	d3c1141658	imsm: count arrays under VMD HBAs correctly OROM defines maximum number of arrays supported. On array creation mdadm checks if number of arrays doesn't exceed that limit, however it is not calculated correctly for VMD now. The current code performs a lookup of HBA using the id. VMD HBAs have the same id so each lookup returns the same structure (first encountered). Take a different approach for VMD HBAs. As id is not unique and cannot be used for lookups, iterate over all VMD HBAs and compare both id and HBA path. Signed-off-by: Alexey Obitotskiy <aleksey.obitotskiy@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@gmail.com>	2017-01-09 07:48:55 -05:00
Alexey Obitotskiy	30730cb8c4	Don't assume VMD sysfs path ends with a disk entry When VMD is enabled but no drive is attached to the PCIe port, mdadm crashes trying to parse the path. Skip entry if valid path has not been returned. Do it early to avoid unnecessary memory allocation. Signed-off-by: Alexey Obitotskiy <aleksey.obitotskiy@intel.com> Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2017-01-04 08:26:50 -05:00
Pawel Baldysiak	466070adb8	IMSM: Fix signed/unsigned comparisons Prior to this patch there was an error during compiling on 32-bit arch. This patch fixes this issue. Reported-by: Thomas Backlund <tmb@mageia.org> Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2017-01-03 10:34:54 -05:00
Tomasz Majchrzak	5e46202e4a	imsm: enable bad block support for imsm metadata Enable bad block support for imsm metadata as commit e522751d605d ("seq_file: reset iterator to first record for zero offset") has been accepted in upstream kernel. Prior to that patch mdmon had not been able to read bad blocks sysfs file. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-29 14:28:36 -05:00
Pawel Baldysiak	1e9b2c3fb7	IMSM: Do not update metadata if not able to migrate This patch prevents mdadm from updating metadata if migration is not possible. The same check is done in analyse_change(), but in that place - metadata is already modified. Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-22 12:20:13 -05:00
NeilBrown	c07566f14c	Make get_component_size() work with named array. get_component_size() still assumes that all array are /sys/block/md%d or /sys/block/md_d%d and so doesn't work with e.g. /sys/block/md_foo. This cause "mdadm --detail" to report Used Dev Size : unknown and causes problems when added spares and in other circumstances. So change it to use stat2devnm() which does the right thing with all types of array names. Reported-and-tested-by: Robert LeBlanc <robert@leblancnet.us> Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-22 12:19:10 -05:00
Song Liu	6df5d1833f	mdadm: add test case for raid5 write back cache This test cases checks data integrity of raid5 write back cache under various scenarios: degraded mode, non-overwrite, raid-5/6. Signed-off-by: Song Liu <songliubraving@fb.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-19 10:06:26 -05:00
Mariusz Dabrowski	31208db97e	Always return last partition end address in 512B blocks For 4K disks 'endofpart' is an index of the last 4K sector used by partition. mdadm is using number of 512-byte sectors, so value returned by get_last_partition_end must be multiplied by 8 for devices with 4K sectors. Also, unused 'ret' variable has been removed. Signed-off-by: Mariusz Dabrowski <mariusz.dabrowski@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-13 09:09:25 -05:00
Mariusz Dabrowski	41b06495ba	Use disk sector size value to set offset for reading GPT mdadm is using invalid byte-offset while reading GPT header to get partition info (size, first sector, last sector etc.). Now this offset is hardcoded to 512 bytes and it is not valid for disks with sector size different than 512 bytes because MBR and GPT headers are aligned to LBA, so valid offset for 4k drives is 4096 bytes. Signed-off-by: Mariusz Dabrowski <mariusz.dabrowski@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-12 14:26:22 -05:00
Mariusz Dabrowski	b7d81a38c7	imsm: set generation number when reading superblock IMSM doesn't set 'events' field with generation number, so sometimes mdadm tries to re-assembly container using metadata which isn't most recent (e. g. from spare disk). Signed-off-by: Mariusz Dabrowski <mariusz.dabrowski@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-12 14:25:03 -05:00
Pawel Baldysiak	32716c51d9	IMSM: Add support for Non-Intel NVMe drives under VMD This patch adds checking if platform (preOS) supports non-Intel NVMe drives under VMD domain, and - if so - allow creating IMSM Raid Volume with those drives. Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-12 14:23:39 -05:00
NeilBrown	f71d2b8f4b	mdopen: open md devices O_RDONLY There is no need to request write access when opening the md device, as we never write to it, and none of the ioctls we use require write access. If we do open with write access, then when we close, udev notices that the device was closed after being open for write access, and it generates a CHANGE event. This is generally unwanted, and particularly problematic when mdadm is trying to --stop the array, as the CHANGE event can cause the array to be re-opened before it completely closed, which results in a new mddev being allocated. So just use O_RDONLY instead of O_RDWR. Reported-by: Marc Smith <marc.smith@mcc.edu> Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-05 09:26:22 -05:00
Tomasz Majchrzak	e4467bc730	imsm: 4kn support for bad block log Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-02 11:03:13 -05:00
Tomasz Majchrzak	27156a57dd	imsm: implement "--examine-badblocks" command Implement "--examine-badblocks" command to provide list of bad blocks in metadata for a disk. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-02 11:02:48 -05:00
Tomasz Majchrzak	928f142438	imsm: provide list of bad blocks for an array Provide list of bad blocks using memory allocated in advance so it's safe to call it from monitor. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-02 11:02:41 -05:00
Tomasz Majchrzak	4c9e8c1ea6	imsm: clear bad blocks if disk becomes unavailable If a disk fails or goes missing, clear the bad blocks associated with it from metadata. If necessary, update disk ordinals. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-02 11:02:35 -05:00
Tomasz Majchrzak	c07a5a4f41	imsm: clear bad block from bad block log Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-02 11:02:25 -05:00
Tomasz Majchrzak	6f50473f25	imsm: record new bad block in bad block log Check for a duplicate first or try to merge it with existing bad block. If block range exceeds BBM_LOG_MAX_LBA_ENTRY_VAL (256) blocks, it must be split into multiple ranges. Fail if maximum number of bad blocks has been already reached. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-02 11:02:24 -05:00
Tomasz Majchrzak	b12796be0c	imsm: give md list of known bad blocks on startup On create set bad block support flag for each drive. On assmble also provide a list of known bad blocks. Bad blocks are stored in metadata per disk so they have to be checked against volume boundaries beforehand. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-02 11:02:17 -05:00
Tomasz Majchrzak	bbab0940fa	imsm: write bad block log on metadata sync Pre-allocate memory for largest possible bad block section when monitor is being opened to avoid a need for memory allocation on metadata sync. If memory for a structure has been allocated in mpb buffer but it hasn't been used yet, it will be taken by next buffer grow, leading to insufficient memory on metadata flush. Start tracking such memory and take it into calculation when growing a buffer. Also assert has been added to debug mode to warn when more metadata has been written than memory allocated. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-02 11:02:11 -05:00
Tomasz Majchrzak	8d67477fd1	imsm: parse bad block log in metadata on startup Always allocate memory for all log entries to avoid a need for memory allocation when monitor requests to record a bad block. Also some extra checks added to make static code analyzer happy. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-12-02 11:02:09 -05:00
NeilBrown	e22fe3ae15	Introduce enum flag_mode for setting and clearing flags. We currently use '1' to indicate that a flag (writemostly or failfast) needs to be set, and '2' to indicate that it needs to be cleared. Using magic number like this is not a best-practice. So replaced them with values from a enum. No functional change. Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-29 17:12:13 -05:00

1 2 3 4 5 ...

3360 Commits All Branches Search

3360 Commits

All Branches