mdadm

Commit Graph

Author	SHA1	Message	Date
Tomasz Majchrzak	1ab97c976b	mdmon: bad block support for external metadata - store bad blocks If md has changed the state to 'blocked' and metadata handler supports bad blocks, try process them first. If metadata handler has successfully stored bad block, acknowledge it to md via 'badblocks' sysfs file. If metadata handler has failed to store the new bad block (ie. lack of space), remove bad block support for a disk by writing "-external_bbl" to state sysfs file. If all bad blocks have been acknowledged, request to unblock the array. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Acked-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-28 17:48:59 -05:00
Tomasz Majchrzak	6dc1785fdb	mdmon: bad block support for external metadata - sysfs file open Open 'badblocks' and 'unacknowledged_bad_blocks' sysfs files for each disk in the array. Add them to the list of files observed by monitor. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Reviewed-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-28 17:45:56 -05:00
Tomasz Majchrzak	bb758ccad0	mdadm: bad block support for external metadata - initialization If metadata handler provides support for bad blocks, tell md by writing 'external_bbl' to rdev state file (both on create and assemble), followed by a list of known bad blocks written via sysfs 'bad_blocks' file. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Reviewed-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-28 17:44:45 -05:00
Pawel Baldysiak	06fb291ac1	IMSM: Update num_data_stripes during migration This patch adds updataing num_data_stripes during reshape. Previously this field once set during creation was never updated. Also, num_data_strips value multipied by chunk_size is used for set proper component size for RAID5. Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Maksymilian Kunt <maksymilian.kunt@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-28 17:41:54 -05:00
NeilBrown	71574efb07	Add failfast support. Allow per-device "failfast" flag to be set when creating an array or adding devices to an array. When re-adding a device which had the failfast flag, it can be removed using --nofailfast. failfast status is printed in --detail and --examine output. Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-28 08:50:36 -05:00
Tomasz Majchrzak	cf52eff58a	Increase buffer for sysfs disk state Bad block support has incremented sysfs disk state reported by kernel ("external_bbl") so it became longer than 20 bytes. It causes reshape to fail as it reads truncated entry from sysfs. Increase buffer so it can accommodate the string including all state values currently implemented in kernel at the same time. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-17 09:46:42 -05:00
Tomasz Majchrzak	bbb52f2b1d	Increase buffer for sysfs path 'unacknowledged_bad_blocks' is a long name for sysfs property and it makes sysfs path over 50 characters long. Increase buffer to the double length of the longest path available in sysfs at the moment. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-17 09:43:44 -05:00
Pawel Baldysiak	de44e46fd4	IMSM: 4Kn drives support - adapt general migration record Convert general migration record for 4Kn drives prior to write and post read. Calculate record location based on sector size, don't just assume it's 512. Assure buffer address is aligned to 4096 so write operation avoids caching. Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-17 09:26:17 -05:00
Pawel Baldysiak	f36a9ecdb5	IMSM: Add support for 4Kn sector size drives This patch adds support for drives with 4Kn sector size for IMSM metadata. Mixing member drives with 4kn and 512 is not allowed. Some offsets were aligned with sector size. Internal metadata representation and all calculations are still based on 512-byte sector sizes. This implementation converts only sector based values when reading/writing to drive, because they needs to be stored in metadata according to accual member drive sector size. Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-17 09:24:59 -05:00
Pawel Baldysiak	fa7bb6f8fd	IMSM: Read and store device sector size This patch adds retriving device sector size at startup and set it in intel_super, so it can be used in other places. Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-17 09:24:35 -05:00
Pawel Baldysiak	329715091c	Add function for getting member drive sector size This patch introduces the function for getting sector size of given device (fd). Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-17 09:24:18 -05:00
Artur Paszkiewicz	1b7eb672f7	super1: fix setting bad block log offset in write_init_super1() Commit `f79bbf4f69` ("super1: don't put the bblog at the end of the free space.") changed the location of the bad block log to be after the write-intent bitmap, but a fixed offset was used and it can make bbl overlap with the bitmap, especially when using a small bitmap chunk. This patch changes it to use the actual offset and size of the bitmap. It also joins the cases for v1.1 and v1.2 superblock because the code was very similar. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-16 09:58:19 -05:00
Artur Paszkiewicz	561ad5597b	super1: make internal bitmap size calculations more consistent Determining internal bitmap size is performed using two different functions (bitmap_sectors() and calc_bitmap_size()) and in getinfo_super1() it is calculated in yet another way. Each of these methods give slightly different results. The most accurate is calc_bitmap_size() but it also has a rounding issue. So: - fix the rounding issue in calc_bitmap_size() using bitmap_bits() - replace usages of bitmap_sectors() and open-coded calculations with calc_bitmap_size() - remove bitmap_sectors() - move bitmap_bits() to mdadm.h as inline - otherwise mdassemble won't compile (it does not use bitmap.c) Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-11-16 09:56:39 -05:00
Pawel Baldysiak	52a9408561	Lib.c: Fix geting devname for devices with long path In scenario where VMD is enabled, and "x8" type of NVMe drive is plugged into PCIe switch - the path will be longer than 200 chars (additional VMD domain + 2 level of PCIe switches). This patch makes the buffer big enough to handle this kind of configurations. Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-26 12:03:25 -04:00
Pawel Baldysiak	07cb1e57e0	IMSM: Enable spanning between VMD domains Each VMD domain adds additional PCI domain. This patch enables RAID creation with NVMe drives from different VMD domains. Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-26 12:02:47 -04:00
Pawel Baldysiak	20bee0f8db	IMSM: Add warning message when x8-type device is used This patch adds the warning message when x8-type device is used with IMSM metadata. x8 device is a special NVMe drive - two of them on a single PCIe card. This card could be a single point of failure for RAID levels different than RAID0. x8 devices have serial number ending with "-A/-B" or "-1/-2". Signed-off-by: Pawel Baldysiak <pawel.baldysiak@intel.com> Reviewed-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-26 12:01:51 -04:00
Tomasz Majchrzak	12fe93e913	imsm: load migration record from right disk Migration record is only stored on disks in first and second metadata slot. The function to load the record incorrectly passes disk slot as disk index. If rebuilt has taken place for a container, disk slot doesn't match disk index so it causes migration record to be read from a disk it has not been written to. As a result reshape operation fails. Signed-off-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-26 12:00:46 -04:00
Yilong Ren	e9eb82adb8	raid6check.c: fix "misleading-indentation" error To fix the following error info: root@vm-lkp-nex04-8G-7 /tmp/mdadm# make test cc -Wall -Werror -Wstrict-prototypes -Wextra -Wno-unused-parameter -ggdb -DSendmail=\""/usr/sbin/sendmail -t"\" -DCONFFILE=\"/etc/mdadm.conf\" -DCONFFILE2=\"/etc/mdadm/mdadm.conf\" -DMAP_DIR=\"/run/mdadm\" -DMAP_FILE=\"map\" -DMDMON_DIR=\"/run/mdadm\" -DFAILED_SLOTS_DIR=\"/run/mdadm/failed-slots\" -DNO_COROSYNC -DNO_DLM -DVERSION=\"3.4-43-g1dcee1c\" -DVERS_DATE="\"06th April 2016\"" -DUSE_PTHREADS -DBINDIR=\"/sbin\" -c -o raid6check.o raid6check.c raid6check.c: In function 'manual_repair': raid6check.c:267:4: error: this 'else' clause does not guard... [-Werror=misleading-indentation] else ^~~~ raid6check.c:269:5: note: ...this statement, but the latter is misleadingly indented as if it is guarded by the 'else' printf("Repairing D(%d) and P\n", failed_data); ^~~~~~ cc1: all warnings being treated as errors <builtin>: recipe for target 'raid6check.o' failed make: *** [raid6check.o] Error 1 root@vm-lkp-nex04-8G-7 /tmp/mdadm# Cc: NeilBrown <neilb@suse.com> Cc: linux-raid <linux-raid@vger.kernel.org> Cc: LKP <lkp@eclists.intel.com> Reviewed-by: Jes Sorensen <Jes.Sorensen@redhat.com> Signed-off-by: Yilong Ren <yilongx.ren@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-26 11:59:31 -04:00
James Clarke	8e2bca513e	Fix bus error when accessing MBR partition records Since the MBR layout only has partition records as 2-byte aligned, the 32-bit fields in them are not aligned. Thus, they cannot be accessed on some architectures (such as SPARC) by using a "struct MBR_part_record *" pointer, as the compiler can assume that the pointer is properly aligned. Instead, the records must be accessed by going through the MBR struct itself every time. Signed-off-by: James Clarke <jrtc27@jrtc27.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-19 12:38:02 -04:00
Jes Sorensen	089f9d795e	super-intel: Reduce excessive parenthesis abuse Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-19 12:31:00 -04:00
Mariusz Dabrowski	ddab63c7de	Allow level migration only for single-array container IMSM doesn't allow to change RAID level of array in container with two arrays but array count check is being done too late (after removing disks) and in some cases (e. g. RAID 0 and RAID 1 migrated to RAID 0) both arrays become degraded. This patch adds array count check before disks are being removed. Signed-off-by: Mariusz Dabrowski <mariusz.dabrowski@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-19 11:26:49 -04:00
Mariusz Dabrowski	2d2b0eb7b9	imsm: block chunk size change for RAID 10 Chunk size change of RAID 10 array fails because it is not supported but invalid values still are being written to metadata and array cannot be assembled after stop. Operation should be blocked before metadata update. Signed-off-by: Mariusz Dabrowski <mariusz.dabrowski@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-19 11:22:36 -04:00
Guoqing Jiang	119b66a473	super1: make write_bitmap1 compatible with previous mdadm versions For older mdadm version, v1.x metadata has different bitmap_offset, we can't ensure all the bitmaps are on a 4K boundary since writing 4K for bitmap could corrupt the superblock, and Anthony reported the bug about it at below link. https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=837964 So let's check about the alignment for bitmap_offset before set the boundary to 4096 unconditionally. Thanks for Neil's detailed explanation. Reported-by: Anthony DeRobertis <anthony@derobert.net> Fixes: `95a05b37e8` ("Create n bitmaps for clustered mode") Cc: Neil Brown <neilb@suse.com> Signed-off-by: Guoqing Jiang <gqjiang@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-19 11:21:15 -04:00
NeilBrown	681b7ae245	Fix some issues found by clang The clang compiler complained about each of these. The mdmon.h error will only affect 'far' RAID10 arrays using intel or DDF metadata, and there is no such thing. The mdopen.c will cause a problem if there are no free md device numbers in the first 512. That is fairly unlikely. The restripe.c error would only affect the 'test_stripe' command, and probably doesn't change its behaviour. The super-intel.c fix is purely cosmetic. Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-07 11:47:48 -04:00
Artur Paszkiewicz	21e9380b26	imsm: retrieve nvme serial from sysfs Don't rely on SCSI ioctl for reading NVMe serials - SCSI emulation for NVMe devices can be disabled in the kernel config. Instead, try to get a serial from /sys/block/nvme*/device/serial. If that fails for whatever reason (i.e. no such attribute in old kernels) - fall back to the SCSI method. This also moves some SCSI-specific code from imsm_read_serial() to scsi_get_serial(). Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Reviewed-by: Tomasz Majchrzak <tomasz.majchrzak@intel.com> Reviewed-by: Alexey Obitotskiy <aleksey.obitotskiy@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-10-07 11:18:32 -04:00
Mariusz Dabrowski	fa219dd26a	Fix RAID metadata check mdadm recognizes devices with partition table as part of an RAID array and invalid warning message is displayed. After this fix proper warning messages are being displayed for MBR/GPT disks and devices with RAID metadata. Signed-off-by: Mariusz Dabrowski <mariusz.dabrowski@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-09-22 11:35:02 -04:00
Artur Paszkiewicz	676e87a806	imsm: remove redundant characters from some error messages Fix the cases that produced messages like "mdadm: : The message". Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-09-16 09:50:50 -04:00
Artur Paszkiewicz	83ca7d4527	imsm: do not activate spares for uninitialized member arrays This fixes some issues when a member array is created with "missing" devices in a container that has more devices than used in the member array. Reported-by: Yi Zhang <yizhan@redhat.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-09-15 12:16:07 -04:00
Song Liu	474267015b	mdadm: fix a buffer overflow struct mdp_superblock_1.set_name is 32B long, but struct mdinfo.name is 33B long. So we need strncpy instead strcpy to avoid buffer overflow. Signed-off-by: Song Liu <songliubraving@fb.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-09-12 12:51:12 -04:00
Robert LeBlanc	bd1fd72e13	mdopen: Prevent overrunning the devname buffer when copying devnm into it for long md names. Linux allows for 32 character device names. When using the maximum size device name and also storing "/dev/", devname needs to be 37 character long to store the complete device name. i.e. "/dev/md_abcdefghijklmnopqrstuvwxyz12\0" Signed-off-by: Robert LeBlanc<robert@leblancnet.us> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-25 13:43:31 -04:00
Jes Sorensen	6e88b3b3e5	bitmap: Mark a number of local functions static Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-15 16:35:28 -04:00
Jes Sorensen	34996a5f89	bitmap: Handle errors when reading bitmap info for cluster nodes Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-15 16:21:33 -04:00
Jes Sorensen	9ca0de6241	bitmap: Simplify code for bitmap_file_open() By switching to open+fstat rather than stat+open the code can be simplified and avoid duplicating the open handling. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-15 16:16:05 -04:00
Jes Sorensen	00fab7459a	super0: Clean up formatting in examine_super0() No funcionality change - should be purely cosmetic Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-15 15:56:23 -04:00
Jes Sorensen	a8cb6604b6	super0: Fix spelling of 'version' in comment and fix formatting Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-15 15:49:59 -04:00
Jes Sorensen	055b766b1c	super0: Use random_uuid() in init_super0() This shaves another 80 bytes off the mdadm binary. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-15 15:48:56 -04:00
Jes Sorensen	c5f71c2417	Introduce random_uuid() helper function This gets rid of 5 nearly identical copies of the same code, and reduces the binary size of mdadm by over 700 bytes on x86_64. Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-15 15:41:34 -04:00
Jes Sorensen	977d12d739	mdadm.h: Fix build problem against newer glibc Newer glibc requires direct include of sys/sysmacros.h in order to access makedev(). Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-15 11:30:39 -04:00
Song Liu	690e46c320	mdadm: put journal device in right place of --detail When there is failed HDDs, journal device showed in wrong place of --detail: Number Major Minor RaidDevice State 4 8 24 - journal /dev/sdb8 1 8 18 1 active sync /dev/sdb2 2 8 19 2 active sync /dev/sdb3 3 8 21 3 active sync /dev/sdb5 0 8 17 - faulty /dev/sdb1 This patch fixed the output as: Number Major Minor RaidDevice State - 0 0 0 removed 1 8 18 1 active sync /dev/sdb2 2 8 19 2 active sync /dev/sdb3 3 8 21 3 active sync /dev/sdb5 0 8 17 - faulty /dev/sdb1 4 8 24 - journal /dev/sdb8 Reported-by: Yi Zhang <yizhan@redhat.com> Signed-off-by: Song Liu <songliubraving@fb.com> Signed-off-by: Shaohua Li <shli@fb.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-12 10:58:58 -04:00
Song Liu	ff3c881f84	mdadm: add man page for --add-journal Add the following to man page: --add-journal Recreate journal for RAID-4/5/6 array that lost a journal device. In the current implementation, this command cannot add a journal to an array that had a failed journal. To avoid interrupting on-going write opertion --add-journal only works for array in Read-Only state. Reported-by: Yi Zhang <yizhan@redhat.com> Signed-off-by: Song Liu <songliubraving@fb.com> Signed-off-by: Shaohua Li <shli@fb.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-12 10:57:13 -04:00
Jes Sorensen	ad7ac9ac66	lib: Various coding style cleanups Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-11 16:01:00 -04:00
Jes Sorensen	781f7efbac	lib: Avoid if and return on the same line Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-11 15:53:29 -04:00
Jes Sorensen	36138e4e4b	sysfs: Avoid if and return on the same line Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-11 15:52:48 -04:00
Jes Sorensen	7eef9be219	super1: Avoid if and return on the same line Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-11 15:52:02 -04:00
Jes Sorensen	f1bbb5ff6d	restripe: Avoid if and return on the same line Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-11 15:51:00 -04:00
Jes Sorensen	9f0ad56be0	util: Never have if and return on the same line Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-11 15:48:47 -04:00
Jes Sorensen	421c6c047e	config: Various stylistic cleanups Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-11 15:48:09 -04:00
Jes Sorensen	6a674388f8	config: Use xcalloc() rather than xmalloc()+memset() Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-11 15:32:34 -04:00
Artur Paszkiewicz	c012223056	Incremental: don't try to load_container() for a subarray mdadm -IRs would exit with a non-zero status because of this. Reported-by: Xiao Ni <xni@redhat.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-09 10:57:15 -04:00
Zhilong Liu	e19a149c72	mdadm:add 'clustered' in typo prompt when specify wrong param for bitmap mdadm: 'clustered' bitmap has already supported, thus add the prompt if users specify wrong value for bitmap param. Signed-off-by: Zhilong Liu <zlliu@suse.com> Signed-off-by: Jes Sorensen <Jes.Sorensen@redhat.com>	2016-08-02 10:06:43 -04:00

... 2 3 4 5 6 ...

3409 Commits All Branches Search

3409 Commits

All Branches