mdadm

Author	SHA1	Message	Date
Dan Williams	27fd627414	imsm: show uuid in ->examine_super() ...and add "auto=md" to the brief output. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:43:56 -07:00
Dan Williams	9968e376a1	fname_as_uuid: print uuids msb first The sha1 routines store the uuids in little endian byte-order, so always print from msb to lsb. This allows imsm containers to be assembled with -As. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:26:51 -07:00
Dan Williams	695154b2e7	mdmon: periodically retry to create the socket If initial socket creation fails, EROFS, set a periodic alarm to wake up the manager and retry. Include a kernel patch that will wake us up if the mount flags are changed. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:52 -07:00
Dan Williams	1e4bc070a7	sysfs_open leaks devnum2devname() result Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:52 -07:00
Dan Williams	e6b9548dce	non-trivial warn_unused_result fix, prepare_update If an allocation fails in ->prepare_update we need to catch it in ->process_update. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:52 -07:00
Dan Williams	792449393d	non-trivial warn_unused_result fixes, activate_spare Both super-ddf and super-intel ignore memory allocation failures during ->activate_spare. Fix these up by cancelling the activation. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:52 -07:00
Dan Williams	175593bf28	non-trivial warn_unused_result fixes, write_init_super_ddf When a write fails just move on to the next disk. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:52 -07:00
Dan Williams	3d2c4fc7b6	trivial warn_unused_result squashing Made the mistake of recompiling the F9 mdadm rpm which has a patch to remove -Werror and add "-Wp,-D_FORTIFY_SOURCE -O2" which turns on lots of errors: config.c:568: warning: ignoring return value of asprintf Assemble.c:411: warning: ignoring return value of asprintf Assemble.c:413: warning: ignoring return value of asprintf super0.c:549: warning: ignoring return value of posix_memalign super0.c:742: warning: ignoring return value of posix_memalign super0.c:812: warning: ignoring return value of posix_memalign super1.c:692: warning: ignoring return value of posix_memalign super1.c:1039: warning: ignoring return value of posix_memalign super1.c:1155: warning: ignoring return value of posix_memalign super-ddf.c:508: warning: ignoring return value of posix_memalign super-ddf.c:645: warning: ignoring return value of posix_memalign super-ddf.c:696: warning: ignoring return value of posix_memalign super-ddf.c:715: warning: ignoring return value of posix_memalign super-ddf.c:1476: warning: ignoring return value of posix_memalign super-ddf.c:1603: warning: ignoring return value of posix_memalign super-ddf.c:1614: warning: ignoring return value of posix_memalign super-ddf.c:1842: warning: ignoring return value of posix_memalign super-ddf.c:2013: warning: ignoring return value of posix_memalign super-ddf.c:2140: warning: ignoring return value of write super-ddf.c:2143: warning: ignoring return value of write super-ddf.c:2147: warning: ignoring return value of write super-ddf.c:2150: warning: ignoring return value of write super-ddf.c:2162: warning: ignoring return value of write super-ddf.c:2169: warning: ignoring return value of write super-ddf.c:2172: warning: ignoring return value of write super-ddf.c:2176: warning: ignoring return value of write super-ddf.c:2181: warning: ignoring return value of write super-ddf.c:2686: warning: ignoring return value of posix_memalign super-ddf.c:2690: warning: ignoring return value of write super-ddf.c:3070: warning: ignoring return value of posix_memalign super-ddf.c:3254: warning: ignoring return value of posix_memalign bitmap.c:128: warning: ignoring return value of posix_memalign mdmon.c:94: warning: ignoring return value of write mdmon.c:221: warning: ignoring return value of pipe mdmon.c:327: warning: ignoring return value of write mdmon.c:330: warning: ignoring return value of chdir mdmon.c:335: warning: ignoring return value of dup monitor.c:415: warning: rv may be used uninitialized in this function ...some of these like the write() ones are not so trivial so save those fixes for the next patch. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:52 -07:00
Dan Williams	3f6efecc4c	imsm: determine failed indexes from the most up-to-date disk load_imsm_disk() currently notices if spares missed their activation update, but we allow a stale failed disk back in to the array because its serial number is clobbered in the most up-to-date disk. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	47ee5a4566	imsm: manage a list of missing disks If a drive is removed while mdmon is not running we need a way to identify what is missing and mark that disk as failed in the metadata. At ->load_super() time create a list of missing disks defined as a disk that is marked in-sync yet does not appear in super->disks. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	1ee1e9fc62	imsm: fix mpb_size calculation in write_super_imsm Spotted a thinko... raid devices are dynamically sized, disks are not. The space for disks is always mpb->num_disks * sizeof(struct imsm_disk). Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	f8f603f133	imsm: enable checkpointing of migration (resync/rebuild) When the array is shutdown, or when mdadm --wait-clean is called, any active resync process will be idled allowing mdmon to record the current resync position. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	7146ec6a1e	Extend --wait-clean to checkpoint resync Root file systems backed by external metadata arrays need to be explicitly checkpointed near the time the rootfs is marked readonly as userspace will not have an opportunity to react to the final shutdown of the array. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	0dd3ba30aa	--wait-clean: shorten timeout Set the safemode timeout to a small value to get the array marked clean as soon as possible. We don't write 'clean' directly as it may cause mdmon to miss a 'write-pending' event. Include a couple fixes to sysfs_set_safemode(): 1/ 0 pad the milliseconds field 2/ workaround input truncation in the kernel Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	593add1b56	monitor: protect against CONFIG_LBD=n md/resync_start reports different terminal values depending on kernel configuration (~0UL versus ~0ULL). Make detection of the resync-complete state more robust by comparing against array size. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	14e8215b1b	imsm: trust sector reservation from metadata On ich6r the option-rom appears to reserve only 432 sectors rather than the 418+4096 of newer implementations. For compatibility trust the metadata in these cases. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	2a24d7b696	sysfs: dprintf when we fail to write a sysfs file When arrays do not startup correctly it would be nice to know why. Need to move the dprintf definition to mdadm.h Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
Dan Williams	c92a2527e1	imsm: confirm raid10 layout, fix up handling raid10 failures 1/ near-2 indeed matches how the Windows driver lays out the data 2/ update imsm_check_degraded to check for rebuilding disks in the raid10 case Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:47 -07:00
Dan Williams	5c3db629a6	imsm: more serial handling fixups zero-initialize the serial buffer to handle cases where the response is less than MAX_RAID_SERIAL_LEN. Tested-by: Jacek Danecki <jacek.danecki@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 13:12:17 -07:00
NeilBrown	1c6cb603fa	Grow: Fix linear-growth when devices are not all the same size. If we add a device to a linear array which is a difference size to the other devices in the array then, for v1.x metadata, we need to make sure the size is correctly reflected in the superblock.	2008-10-15 14:34:18 +11:00
NeilBrown	e4965ef846	Improve reporting of layout for raid10. Showing e.g. near=1, far=2 for the 'far2' layout of raid10 is confusing even though there is a sense in which is it correct. Make it less confusing by only printing whichever number is not 1. If both are 1, make that clear too (i.e. no redundancy).	2008-10-13 16:15:18 +11:00
NeilBrown	2a528478c7	Manage: allow adding device that is just large enough to v1.x array. When adding a device to an array, we check that it is large enough. Currently the check makes sure there is also room for a reasonably sized bitmap. But if the array doesn't have a bitmap, then this test might be too restrictive. So when adding, only insist there is enough space for the current bitmap. When Creating, still require room for the standard sized bitmap. This resolved Debian Bug 500309	2008-10-13 16:15:16 +11:00
NeilBrown	c04d54461f	Updates version numbers for 3.0-devel1 release.	2008-09-18 17:27:49 +10:00
NeilBrown	04c0634e8f	Don't try to set_array_info when -I find new devices for an array. When -I get a new device for a container and tries to incrementally assemble the container array, it calls sysfs_set_array to create the array without first checking if it already exists. This produces unpleasant error messages. So check first. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 17:05:02 +10:00
NeilBrown	5775279572	Remove .sock file when removing .pid file for mdmon	2008-09-18 16:43:59 +10:00
NeilBrown	dbb44303d7	Add support for assembling specific subarrays. This normally isn't needed as --incremental does all the work. But it is needed to recognise member= and container= in mdadm.conf	2008-09-18 16:21:08 +10:00
NeilBrown	35ddc76dcb	Use common code to report MD_UUID for --detail --export As we need to be able to extract a UUID from any superblock for matching, use that as the MD_UUID as it will probably be used for array matching too.	2008-09-18 16:12:28 +10:00
NeilBrown	ff54de6e47	Report uuid in --detail --brief for ddf and intel The uuid is slightly fictitious but needed for array matching.	2008-09-18 16:11:40 +10:00
NeilBrown	d7288ddc3a	Use uuid as /dev name when assembling array of uncertain origin. If we aren't sure that the array belongs to 'this' host, use the uuid to choose a name to avoid any conflict.	2008-09-18 16:08:10 +10:00
NeilBrown	51006d8586	Add uuid support for super-intel. 'imsm' does not provide any real uuid, so we synthesise one from various stable bits of the superblock.	2008-09-18 16:07:32 +10:00
NeilBrown	9362c1c80c	Allow metadata handler to report that it doesn't record homehost. For now, this means that the lack of a homehost doesn't always prevent assembly. Soon we will allow assembly anyway, but have different messages if homehost isn't supported.	2008-09-18 16:06:41 +10:00
NeilBrown	ffcfc735a5	Don't allow spares when creating 'external' arrays. It is meaningless when creating the container, and for subarrays, the container is responsible for assigning spares. Also, don't do the 'spare' fiddle for raid5 as we cannot set up a spare at this point yet. Later maybe just create the array degraded and let the container sort it out.	2008-09-18 16:03:08 +10:00
NeilBrown	c5afc314e2	Lots of fixes to make incremental assembly of containers work. So: mdadm -I /dev/whatever will (if appropriate) add whatever to a container, then start any arrays inside the container.	2008-09-18 16:03:05 +10:00
NeilBrown	352452c364	Handle incremental assembly of containers. mdadm -I /dev/part-of-container should add that to a container, creating if it needed, and then try to assemble any arrays in the container. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 16:01:57 +10:00
NeilBrown	f35f252592	Move calls to SET_ARRAY_INFO to common helper. When we assemble an array, there are three different approaches depending on whether metadata is internal or external, and on kernel version. Move all this to a common helper instead of duplicating in 3 places. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 16:01:55 +10:00
NeilBrown	7801ac2092	Factor out add-disk code The variety of approaches to 'add_disk' are factored out into a separate function, and Incremental mode benefits by being closer to supporting the assembly of containers. Also remove the adding-to-array-data-structure out of sysfs_add_disk and into add_disk. And add some tests for --incremental mode to make sure we don't break it. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 15:13:32 +10:00
NeilBrown	9b2a22d319	Ignore leading zeros in version number information. --detail sometimes generates leading zero which are just noise.	2008-09-18 15:07:45 +10:00
NeilBrown	7b187ed7e9	Allow --config in --incremental mode.	2008-09-18 15:05:46 +10:00
NeilBrown	c69b251bc7	Teach --detail about containers and members there-of. Make --detail on a container more useful by suppressing irrelevant detail and adding useful detail like a list of member arrays. Ditto for members of a container: report the name of the container array. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 15:05:20 +10:00
NeilBrown	0e60042683	Compile fixes, particularly moving more stuff under MDASSEMBLE Now 'make everything' works again.	2008-09-18 15:04:47 +10:00
NeilBrown	1cccd683f3	Disable compilation with diet-libc We need posix_memalign (or something similar) which diet-libc does not provide.	2008-09-18 14:33:37 +10:00
NeilBrown	a8473e68c7	Fix compile warning/error. gcc said: error: large integer implicitly truncated to unsigned type Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 14:10:42 +10:00
Dan Williams	295646b3d5	mdmon: recreate socket/pid file on SIGHUP Allow mdmon to start while /var/run/mdadm is readonly. Later a SIGHUP can trigger mdmon to drop its pid and socket once /var/run/mdadm is writable. Of course one needs the pid to send a HUP, that can be stored in a distribution specific rw-init directory... For now, rely on a killall -HUP mdmon to get the files dumped. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:43 -07:00
Dan Williams	313a4a82f1	ping_manager() to prevent 'add' before 'remove' completes It is currently possible to remove a device and re-add it without the manager noticing, i.e. without detecting a mdstat->devcnt container->devcnt mismatch. Introduce ping_manager() to arrange for mdmon to run manage_container() prior to mdadm dropping the exclusive open() on the container. Despite these precautions sysfs_read() may still fail. If this happens invalidate container->devcnt to ensure manage_container() runs at the next event. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:43 -07:00
Dan Williams	4795982e68	sysfs: detect disks that are in the process of being removed When removing a disk there is a window where the 'slot' attribute of md/dev-$name will return -EBUSY to read attempts. When this happens look at the the 'block' link, if it is removed then we can be sure the device has been removed, versus some other error. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:43 -07:00
Dan Williams	4065aa816a	monitor: clean up some debug messages Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:43 -07:00
Dan Williams	93f7cacab3	mdmon: resume rebuild If we started a degraded array that was previously rebuilding we may have enough information to resume the rebuild without a trip through the monitor. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:43 -07:00
Dan Williams	e553d2a458	imsm: allow a failed disk to be readded Allow the following sequence to rebuild the array mdadm --fail /dev/md/r1 /dev/disk mdadm --remove /dev/imsm /dev/disk mdadm --add /dev/imsm /dev/disk Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	1770662bca	'mdadm --wait-clean' wait for array to be marked clean For use in distro shutdown scripts with a RAID root file system. Returns immediately if the array is 'readonly', or not an externally managed array. It is up to the distro's scripts to make sure no new writes hit the device after this returns 'true'. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	c94709e83f	Add ping_monitor() to mdadm --wait The action we are waiting for may not be complete until the monitor has had a chance to take action on the result. The following script can now remove the device on the first attempt, versus a few attempts with the original Wait(): #!/bin/bash #export MDADM_NO_MDMON=1 export IMSM_DEVNAME_AS_SERIAL=1 ./mdadm -Ss ./mdadm --zero-superblock /dev/loop[0-3] echo 2 > /proc/sys/dev/raid/speed_limit_max ./mdadm --create /dev/imsm /dev/loop[0-3] -n 4 -e imsm -a md ./mdadm --create /dev/md/r1 /dev/loop[0-3] -n 4 -l 5 --force -a mdp ./mdadm --fail /dev/md/r1 /dev/loop3 ./mdadm --wait /dev/md/r1 x=0 while ! ./mdadm --remove /dev/imsm /dev/loop3 > /dev/null 2>&1 do x=$((x+1)) done echo "removed after $x attempts" ./mdadm --add /dev/imsm /dev/loop3 Include 2 small cleanups: * remove the almost open coded fd2devnum() in Wait() by introducing a new utility routine stat2devnum() * teach connect_monitor() to parse the container device from a subarray string Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00

... 5 6 7 8 9 ...

1029 Commits