mdadm

Author	SHA1	Message	Date
NeilBrown	d1d3482b56	config: add 'homehost' option to 'AUTO' line. This allows basing auto-assembly decisions on whether the array is recorded as belonging to this host or not. Signed-off-by: NeilBrown <neilb@suse.de>	2010-03-03 14:33:55 +11:00
Luca Berra	c132678b18	allow redefinition of VAR_RUN having mdmon socket under var is painful at shutdown time Signed-off-by: Luca Berra <bluca@comedia.it> Signed-off-by: NeilBrown <neilb@suse.de>	2010-03-03 12:23:30 +11:00
NeilBrown	b179246f4f	Assemble: Handle assembling from config file which is out of order. Currently "mdadm -As" will process the entries in the config file in order. If any array is a component or member of a preceding array, that array will not be assembled. So if there are any failures during assembly, retry those arrays, and look until everything is assembled, or nothing more can be assembled. Signed-off-by: NeilBrown <neilb@suse.de>	2010-02-24 11:16:56 +11:00
NeilBrown	58a4ba2a6b	mdmon: don't monitor /proc/mounts to decide when to create .pid file. Monitoring /proc/mounts and creating a .pid file as soon as /var/run is writable is racy. Most distros clean all non-directories from /var/run early in boot and if mdmon races with this it could lose the files as soon as they are created. Instead require that "mdmon --takeover" be run after /var is writable. Signed-off-by: NeilBrown <neilb@suse.de>	2010-02-08 17:26:18 +11:00
NeilBrown	5d4d1b26d3	mdmon: allow pid to be stored in different directory. /var/run probably doesn't persist from early boot. So if necessary, store in in /lib/init/rw or somewhere else that does persist. Signed-off-by: NeilBrown <neilb@suse.de>	2010-02-04 16:47:28 +11:00
NeilBrown	24f6f99b36	Having single function to read mdmon pid file. We don't need three. One (signal_mdmon) wasn't even being used. Signed-off-by: NeilBrown <neilb@suse.de>	2010-02-04 16:47:21 +11:00
NeilBrown	8409bc51e8	Merge branch 'klockwork' of git://github.com/djbw/mdadm Conflicts: super-intel.c	2009-12-30 13:46:52 +11:00
NeilBrown	c1e3ab8c1e	Merge branch 'master' of git://github.com/djbw/mdadm	2009-12-30 13:42:37 +11:00
Dan Williams	1e5c69836d	imsm: add support for checkpointing via 'curr_migr_unit' Unlike native md checkpointing some data about the geometry and type of the migration process is coded into curr_migr_unit. Provide logic to convert between md/{resync_start\|recovery_start} and imsm/curr_migr_unit. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-12-21 17:54:32 -07:00
Dan Williams	2904b26f05	Support external metadata recovery-resume Minimal changes needed to permit reassembling partially recovered external metadata arrays. The biggest logical change is that ->container_content() can now surface partially rebuilt members rather than omitting them from the disk list. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-12-21 12:51:57 -07:00
Dan Williams	d23534e464	Teach sysfs_add_disk() callers to use ->recovery_start versus 'insync' parameter Also fixup 'in_sync' versus 'insync' typo. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-12-21 11:26:21 -07:00
Dan Williams	b7528a20cc	Introduce MaxSector Replace occurrences of ~0ULL to make it clear we are talking about maximal resync/recovery position. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-12-21 10:23:26 -07:00
Dan Williams	e1516be1db	Add scaffolding for handling md/dev-XXX/recovery_start Prepare the code to handle saving a recovery checkpoint. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-12-21 10:06:14 -07:00
Artur Wojcik	33a6535d00	Fix required to enable RAID arrays on SAS disks. The patch increases the capacity of buffers used to store sysfs path names. Originally the buffers were too small to hold the canonical representation of sysfs path (in case of a SAS device, especially a device installed behind an expander). Signed-off-by: Artur Wojcik <artur.wojcik@intel.com> Reviewed-by: Andre Noll <maan@systemlinux.org> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-12-10 12:03:40 -07:00
Trela, Maciej	034b203a47	Check partition tables when creating array. When creating an array, check if the devices have partition tables and print a warning if the table or the partitions might be destroyed by array creation. Signed-off-by: NeilBrown <neilb@suse.de>	2009-12-08 16:07:47 +11:00
NeilBrown	9277cc7752	Various fixes for --kill - When --kill-superblock is used with --metadata, find every different superblock if there are several and kill them all. - When creating a new array, kill off any old metadata. The code to do this was already present but has become broken over time. Signed-off-by: NeilBrown <neilb@suse.de>	2009-11-24 16:32:01 +11:00
NeilBrown	4a997737a1	Merge branch 'master' into devel-3.1	2009-10-22 11:13:13 +11:00
NeilBrown	ea0ebe9685	Assemble: print more verbose messages about restarting a reshape Signed-off-by: NeilBrown <neilb@suse.de>	2009-10-20 16:23:45 +11:00
Zdenek Behan	9a36a9b713	Monitor: add option to specify rebuild increments ie. the percent increments after which RebuildNN event is generated This is particulary useful when using --program option, rather than (only) syslog for alerts. Signed-off-by: Zdenek Behan <rain@matfyz.cz> Signed-off-by: NeilBrown <neilb@suse.de>	2009-10-19 13:13:58 +11:00
Dan Williams	9f1da82421	mdmon: preserve socket over chroot Connect to the monitor in the old namespace and use that connection for WaitClean requests when stopping the victim mdmon instance. This allows ping_monitor() to work post chroot(). Cc: Hans de Goede <hdegoede@redhat.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-10-13 17:41:58 -07:00
Dan Williams	aae5a11207	Detail: export MD_UUID from mapfile The load_super() from an mdadm --detail call may race against an mdmon update. When this happens the load_super sees an inconsistent metadata block and returns an error. The fallback path to use the map file contents lacks uuid reporting, so provide __fname_from_uuid for generically printing a uuid. Reported-by: Hans de Goede <hdegoede@redhat.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-10-13 17:41:57 -07:00
Dan Williams	6e46bf344b	imsm: add --update=uuid support When disks have conflicting container memberships (same container ids but incompatible member arrays) --update=uuid can be used to move offenders to a new container id by changing 'orig_family_num'. Note that this only supports random updates of the uuid as the actual uuid is synthesized. We also need to communicate the new 'orig_family_num' value to all disks involved in the update. A new field 'update_private' is added to struct mdinfo to allow this information to be transmitted. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-10-13 17:41:53 -07:00
NeilBrown	ca4f89a3b7	Merge branch 'master' into devel-3.1 Conflicts: mdadm.8	2009-10-01 16:58:40 +10:00
NeilBrown	e9e43ec367	Grow: support restart of new migrations.	2009-08-13 11:12:54 +10:00
NeilBrown	7236ee7ad4	Handle extra 'grow' variations. UNFINISHED	2009-08-11 13:02:49 +10:00
NeilBrown	4737ae25de	Exmaine/brief: put member arrays after container arrays. A previous patch moved move the '--examine --brief' reporting of member arrays to before their containers. This breaks "mdadm -As" assembly. So put them back, but still fix the problem addressed by previous patch. Signed-off-by: NeilBrown <neilb@suse.de>	2009-08-07 14:17:40 +10:00
Dan Williams	148acb7baa	imsm: fix family number handling The family_number field can change. The option-rom will change the family number when it starts a rebuild process (flags a container for rebuild). This was not seen previously as mdadm would usually start the rebuild process, preserving the family number. This is the mechanism that helps to prevent a prodigal array member from being returned to its original system and cause a rebuild to go in the wrong direction. With the change we will end up with a container that will fail to assemble unless the device with the incompatible family number is left out of the assembly. So, take several actions: 1/ Convert uuid generation to use orig_family_num, being careful to preserve the existing uuid in the case where orig_family_num is not set (i.e. previous mdadm created imsm arrays) 2/ Set orig_family_num at Create. For arrays created by mdadm prior to this release orig_family_num will be zero, so set it to family_num at the first metadata write. 3/ Add checks for orig_family_num to compare_super_imsm 4/ Update the family number when initiating rebuild 5/ The option-rom mixes some random data into the family number, add this functionality to the mdadm implementation. Reported-by: Marcin Labun <marcin.labun@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-07-31 17:11:41 -07:00
NeilBrown	a628848379	restripe: support saving when not all devices are present.	2009-07-14 15:12:30 +10:00
NeilBrown	19678e536d	Grow: pass layout as a string rather than a number. This allows the layout to be parsed after the current level of the array is know, so that the level doesn't need to be given (otherwise pointlessly) on the command line. Signed-off-by: NeilBrown <neilb@suse.de>	2009-07-14 12:13:29 +10:00
NeilBrown	d823a6c872	Remove Manage_reconfing in favour of Grow_reshape Bother Manage_reconfig and Grow_reshape provide for changing the 'layout' of a faulty array. This is no necessary. So discard Manage_reconfig and just use Grow_reshape Signed-off-by: NeilBrown <neilb@suse.de>	2009-07-14 12:11:31 +10:00
NeilBrown	4a06e2c270	main: factor out code to parse layout for raid10 and faulty. This will soon be called from multiple places. Signed-off-by: NeilBrown <neilb@suse.de>	2009-07-14 11:29:20 +10:00
NeilBrown	84e11361aa	Grow: support --array-size changes With 2.6.30 it is possible to tell the md driver to clip an array to a size smaller than the real size of the array. This option gives access to that feature. The size change does not persist across restarts. Signed-off-by: NeilBrown <neilb@suse.de>	2009-07-13 15:00:02 +10:00
NeilBrown	e736b62389	Update copyright dates and remove references to @cse.unsw.edu.au Also removed 'paper' addresses. Signed-off-by: NeilBrown <neilb@suse.de>	2009-06-02 14:35:45 +10:00
NeilBrown	8320878543	Merge branch 'master' into devel-3.0 Conflicts: Build.c mdadm.c mdadm.h super1.c	2009-05-11 16:05:41 +10:00
NeilBrown	360b463696	mapfile - when rebuilding, choose an appropriate name is none is found. When rebuilding the mapfile (mdadm -Ir), if not appropriate name is found in /dev/md/, try to find an appropriate name, either by looking in mdadm.conf or by using the name in the metadata. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:58:42 +10:00
NeilBrown	0ac91628b9	Allow homehost to be largely ignored when assembling arrays. If mdadm.conf contains HOMEHOST <ignore> or commandline contains --homehost=<ignore> then the check that array metadata mentions the given homehost is replace by a check that the name recorded in the metadata is not already used by some other array mentioned in mdadm.conf. This allows more arrays to use their native name rather than having an _NN suffix added. This should only be used during boot time if all arrays required for normal boot are listed in mdadm.conf. If auto-assembly is used to find all array during boot, then the HOMEHOST feature should be used to ensure there is no room for confusion in choosing array names, and so it should not be set to <ignore>. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:46:46 +10:00
NeilBrown	061f2c6abd	Make --brief even briefer. Because ---examine --brief, or --detail --brief are often used to create mdadm.conf, and because people don't want to have to update their mdadm.conf unnecessarily, we don't want to include information that might change. And now that level changing is supported, that is almost everything but UUID. So move some more fields into the "Only print with --verbose" class. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:18:20 +10:00
NeilBrown	31015d5798	conf/assemble: new config line "auto". The line 'auto' in mdadm.conf can be used to disable assembly of specific metadata types, or of all arrays. This does not affect assembly of arrays listed in mdadm.conf or on command line. auto -all will disable all auto-assembly. auto -ddf will cause mdadm to ignore ddf arrays that are not explicitly mentioned, and auto assemble anything else it finds. Signed-off-by: NeilBrown <neilb@suse.de>	2009-05-11 15:17:33 +10:00
Paul Clements	25affb56b9	mdadm: allow build to use --size This patch enables the --size parameter for build operations. Without this, if you have a raid1, for instance, where the 2 disks are not the exact same size, and you need to build the array but one of the disks is not available right at the moment (maybe it's USB and it's unplugged, or maybe it's a network disk and it's unavailable), then you have to play some weird games to get the array to size correctly (that is, to the size of the smaller of the two components or less). There may be other uses for this too... -- Paul Signed-off-by: NeilBrown <neilb@suse.de>	2009-04-21 15:36:13 +10:00
NeilBrown	c256924e52	Merge branch 'master' of git://github.com/djbw/mdadm into devel-3.0 Conflicts: Grow.c mdadm.h sysfs.c Due to independent fixes for the "mdadm hangs if reshape finishes too quickly" problem.	2009-04-14 11:11:14 +10:00
NeilBrown	462906cdee	incremental_container: preserve 'in_sync' flag when adding to existing array. When building container members with -IR, we need to ensure that devices added to an active array preserve the 'in_sync' status so they don't needlessly get rebuilt. So allow sysfs_add_disk to do this (only works in kernels since 2.6.30) and pass the relevant flag down. Signed-off-by: NeilBrown <neilb@suse.de>	2009-04-14 10:19:02 +10:00
Dan Williams	48924014b0	Grow: fix hang when reshape completes too fast For short reshapes the kernel may be done before mdadm can check that progress has passed the critical section. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-04-12 00:58:28 -07:00
Dan Williams	da9b4a62af	imsm: set array size at Create/Assemble imsm arrays round down the effective array size to the closest 1 megabyte boundary so teach get_info_super_imsm and sysfs_set_array to set 'md/array_size' if available (and make sure ddf uses the default size). Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-04-12 00:58:28 -07:00
NeilBrown	a7c6e3fb24	wait_for improvement. wait not only for the name to appear, but for it to refer to the correct device. Sometimes old symlinks left lying around can be confusing. Signed-off-by: NeilBrown <neilb@suse.de>	2009-04-07 17:34:38 +10:00
NeilBrown	93ecfa01d4	grow: don't wait forever for critical section to pass. If an array reshape completed within 1 second, then --grow will not notice that it has finished and will keep waiting for the critical section to pass. So be more cautious in the test. Signed-off-by: NeilBrown <neilb@suse.de>	2009-04-01 12:26:08 +11:00
NeilBrown	b640a252ee	Support new raid6 layouts needed for DDF DDF raid6 layouts are subtly different from the standard 'md' layouts. From 2.6.30 the kernel knows about these. Teach mdadm about them, and also allow 'ddf' to set an appropriate default. Signed-off-by: NeilBrown <neilb@suse.de>	2009-03-09 11:16:53 +11:00
Dan Williams	dab4a5134e	sysfs: allow sysfs_read to detect and drop removed disks All operations that rely on loading from an existing container (like --add) will fail after a disk has been removed. Provide an option to skip missing / offline disks rather than abort. We attempt to do this in the load_super_{imsm,ddf}_all cases when mdmon is running i.e. we already have a consitent version of the metadata running in the system. Otherwise, we fail as normal and let the administrator fix up the container. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-02-24 18:45:56 -07:00
NeilBrown	6c40598f59	Merge branch 'master' into devel-3.0	2009-02-02 11:09:09 +11:00
Dustin Kirkland	089485cbe4	Typo in earlier patch : asprintf -> vasprintf Signed-off-by: NeilBrown <neilb@suse.de>	2009-02-02 10:54:23 +11:00
Bernhard Reutner-Fischer	2df1f26911	mdadm fix compilation for uClibc 2008-12-08 Bernhard Reutner-Fischer <rep.dot.nop@gmail.com> * Makefile (dadm.uclibc): Remove misspelled and unneeded rule. * md5.h: Include stdint.h for uClibc. * mdadm.h: uClibc defines __UCLIBC__. If uClibc has LFS off then use lseek instead of lseek64. Signed-off-by: Bernhard Reutner-Fischer <rep.dot.nop@gmail.com>	2009-02-02 09:53:51 +11:00
Dan Williams	5615172f1d	Create: warn when a metadata format's platform components are missing If the metadata handler can not find its platform support components then there is no way for it to verify that the raid configuration will be supported by the option-rom. Provide a generic method for metadata handlers to warn the user that the array they are about to create may not work as intended with a given platform. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-01-20 01:36:51 -07:00
Dan Williams	a18a888ea7	Create: allow per-metadata default layouts Let handlers specifiy their own defaults, specifically needed for the imsm-raid5 case where mdadm defaults to 'ls' and imsm to 'la'. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2009-01-20 01:36:50 -07:00
NeilBrown	78fbcc1031	Merge branch 'master' into scratch-3.0 Conflicts: Assemble.c config.c	2009-01-08 09:31:28 +11:00
Dustin Kirkland	1a0ee0baf0	Fail overtly when asprintf fails to allocate memory .. rather that causing a less-obvious violation of segments. Signed-off-by: NeilBrown <neilb@suse.de>	2009-01-08 09:25:33 +11:00
NeilBrown	45b662b611	Merge branch 'devel' of git://git.kernel.org/pub/scm/linux/kernel/git/djbw/mdadm into devel-3.0	2008-12-18 16:58:25 +11:00
Dan Williams	4cce406959	introduce --detail-platform to display platform raid capabilities Metadata formats like imsm work in concert with platform firmware and hardware, so provide a way for mdadm to display this info to the user. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-12-08 16:59:18 -07:00
NeilBrown	8592f29d64	Create: support autolayout when creating in a DDF If, when creating an array, a signal target device is given which is a container, then allow the metadata handler to choose which devices to use. This is currently only supported for DDF. Signed-off-by: NeilBrown <neilb@suse.de>	2008-12-04 16:08:33 +11:00
NeilBrown	e46273ebe4	Change 'size' argument to validate_geometry to be sectors, not K That way it is the same a *freesize, and generally less confusing. Signed-off-by: NeilBrown <neilb@suse.de>	2008-12-04 15:47:57 +11:00
Dan Williams	f20c396836	allow add_to_super to return errors Prepare add_to_super to validate disks against the platform capabilities Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-27 15:30:39 +11:00
NeilBrown	e8a70c8958	mdmon: pass symbolic name to mdmon instead of device name. Now that names in /dev are usually created (eventually) by udev, it isn't really safe to rely in finding a name in /dev to pass to mdmon to identify which array to monitor. And it isn't really necessary to have a name in /dev. So just pass the symbolic name, e.g. md127 or md123. Change util.c to pass that name, and change mdmon to process the name sensibly. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-20 14:51:42 +11:00
NeilBrown	a714580e02	Wait for name to appear after create/assemble etc. We don't really want mdadm to exit until udev has created the names in /dev. So wait. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 21:56:42 +11:00
NeilBrown	195254b87a	mapfile: validate entries before they are returned. It is possible for the mapfile to become wrong, and that gets very confusing. So validate entries before returning them. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 21:56:42 +11:00
NeilBrown	f2e55eccfb	mdopen: use small sequence number for uniquifying array names. Rather than appending the md minor number, we now append a small sequence number to make sure name in /dev/md/ that aren't LOCAL are unique. As the map file is locked while we do this, we are sure of no losing any races. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:51:12 +11:00
NeilBrown	9008ed1c96	Assemble: allow members of containers to be assembled and auto-assembled. Try to treat members of containers much like other arrays for assembly. We still look through the list of devices for a match (it will be the container), then find the relevant 'info' and try to assemble the array. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:51:12 +11:00
Dan Williams	6234c63ccc	Assemble: factor out assemble_container_content Factor out, from Incremental_container, the code for assembling an array based on information extracted from a container. We will shortly use this from Assemble too. Signed-off-by: Dan Williams <dan.j.williams@intel.com> Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:51:11 +11:00
Dan Williams	ce744c97bc	Assemble: revert preliminary -As support I have seen the light. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-11-04 20:51:11 +11:00
NeilBrown	ad5bc697ad	Incremental: lock against multiple concurrent additions to an array. In two devices are added via -I to one array at the same time, mdadm can get badly confused. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:50:39 +11:00
NeilBrown	4ccad7b163	Manage: when stopping an array, delete all names from /dev. This only applies if udev isn't installed or is disabled by MDADM_NO_UDEV We try to remove partitions too. We find names to remove by looking in /var/run/mdadm/map Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:50:39 +11:00
NeilBrown	9759037678	Generate 'change' uevents when arrays change in non-obvious ways. When a 'container' gets started, we need udev to notice, but the kernel has no way of knowing that a KOBJ_CHANGE event is needed. So send one directly via the 'uevent' sysfs attribute. Also, uevents don't get generated when md arrays are stopped (prior to 2.6.28) so send 'change' events then too. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:50:39 +11:00
NeilBrown	1771a6e214	config: Support container=uuid as alternative to container=/dev/name in mdadm.conf When mdadm.conf is automatically generated, we might not know a suitable /dev/name. But we do know the uuid of the container. So allow that as an option. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 20:50:38 +11:00
NeilBrown	215bb3f776	Incremental: adjust to the new naming scheme. --incremental now uses exactly the same create_mddev that other code uses.	2008-11-04 20:50:38 +11:00
NeilBrown	69207ff6ac	mdopen: Introduce new rules for creating device name. MORE CONTENT HERE	2008-11-04 20:50:21 +11:00
NeilBrown	40ebbb9cfe	util: make env checking more generic Change the "env_check_mdmon" function to be more generic, accepting and environment variable name, as soon we will have a new use for it. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 10:35:43 +11:00
NeilBrown	7f91af49ad	Delay creation of array devices for assemble/build/create We will shortly be feeding more information into the process of creating array devices, so delay the creation. Still open them early if the device already exists. This involves making sure the autof flag is in the right place so that it can be found at creation time. Also, Assemble, Build, and Create now always close 'mdfd'. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 10:35:37 +11:00
NeilBrown	6be1d39d1d	Introduce new open_mddev which just does an open. Some cases we aren't interested in creating the mddev, just opening it. Make those more explicit. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 10:35:31 +11:00
NeilBrown	2399204ddd	Rename open_mddev to create_mddev This reflect that fact that more often than not it is creating things in /dev, and allows for a new open_mddev which does just that. Signed-off-by: NeilBrown <neilb@suse.de>	2008-11-04 10:35:10 +11:00
Dan Williams	71d60c480a	Preliminary -As support for container member arrays Given an mdadm.conf like the following allow /dev/imsm and /dev/md/r1 to be created by "mdadm -As". DEVICES partitions ARRAY /dev/imsm metadata=imsm auto=md UUID=b98f5dbe-aa859e7b-0e369b89-a80986d4 ARRAY /dev/md/r1 container=/dev/imsm member=0 auto=mdp UUID=3538e39c-b397c2e9-1aa031f9-2bc0eca4 spares=1 Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-28 10:55:31 -07:00
NeilBrown	b01b06bda8	Merge branch 'master' into devel-3.0 Conflicts: Create.c Manage.c	2008-10-27 10:10:08 +11:00
NeilBrown	b3d3195538	Allow WRITEMOSTLY to be cleared on --readd using --readwrite. Previously it was possible to set the WRITEMOSTLY flag when adding a device to an array, but not to clear the flag when re-adding. This is now possible with --readwrite. Signed-off-by: NeilBrown <neilb@suse.de>	2008-10-25 18:20:49 +11:00
NeilBrown	492350045c	Merge branch 'master' into devel-3.0 Conflicts: Manage.c	2008-10-17 12:46:23 +11:00
Dan Williams	27dec8fae3	quiet WaitClean() Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:43:57 -07:00
Dan Williams	36ba7d4849	Allow a uuid of all f's to always match The uuid returned for an imsm spare device will never match the uuid of an active disk. So make mdadm interpret a uuid of all f's as "match any". Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:43:57 -07:00
Dan Williams	2a24d7b696	sysfs: dprintf when we fail to write a sysfs file When arrays do not startup correctly it would be nice to know why. Need to move the dprintf definition to mdadm.h Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-10-15 14:15:51 -07:00
NeilBrown	e4965ef846	Improve reporting of layout for raid10. Showing e.g. near=1, far=2 for the 'far2' layout of raid10 is confusing even though there is a sense in which is it correct. Make it less confusing by only printing whichever number is not 1. If both are 1, make that clear too (i.e. no redundancy).	2008-10-13 16:15:18 +11:00
NeilBrown	2a528478c7	Manage: allow adding device that is just large enough to v1.x array. When adding a device to an array, we check that it is large enough. Currently the check makes sure there is also room for a reasonably sized bitmap. But if the array doesn't have a bitmap, then this test might be too restrictive. So when adding, only insist there is enough space for the current bitmap. When Creating, still require room for the standard sized bitmap. This resolved Debian Bug 500309	2008-10-13 16:15:16 +11:00
NeilBrown	dbb44303d7	Add support for assembling specific subarrays. This normally isn't needed as --incremental does all the work. But it is needed to recognise member= and container= in mdadm.conf	2008-09-18 16:21:08 +10:00
NeilBrown	ff54de6e47	Report uuid in --detail --brief for ddf and intel The uuid is slightly fictitious but needed for array matching.	2008-09-18 16:11:40 +10:00
NeilBrown	d7288ddc3a	Use uuid as /dev name when assembling array of uncertain origin. If we aren't sure that the array belongs to 'this' host, use the uuid to choose a name to avoid any conflict.	2008-09-18 16:08:10 +10:00
NeilBrown	9362c1c80c	Allow metadata handler to report that it doesn't record homehost. For now, this means that the lack of a homehost doesn't always prevent assembly. Soon we will allow assembly anyway, but have different messages if homehost isn't supported.	2008-09-18 16:06:41 +10:00
NeilBrown	352452c364	Handle incremental assembly of containers. mdadm -I /dev/part-of-container should add that to a container, creating if it needed, and then try to assemble any arrays in the container. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 16:01:57 +10:00
NeilBrown	f35f252592	Move calls to SET_ARRAY_INFO to common helper. When we assemble an array, there are three different approaches depending on whether metadata is internal or external, and on kernel version. Move all this to a common helper instead of duplicating in 3 places. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 16:01:55 +10:00
NeilBrown	7801ac2092	Factor out add-disk code The variety of approaches to 'add_disk' are factored out into a separate function, and Incremental mode benefits by being closer to supporting the assembly of containers. Also remove the adding-to-array-data-structure out of sysfs_add_disk and into add_disk. And add some tests for --incremental mode to make sure we don't break it. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 15:13:32 +10:00
NeilBrown	c69b251bc7	Teach --detail about containers and members there-of. Make --detail on a container more useful by suppressing irrelevant detail and adding useful detail like a list of member arrays. Ditto for members of a container: report the name of the container array. Signed-off-by: NeilBrown <neilb@suse.de>	2008-09-18 15:05:20 +10:00
Dan Williams	1770662bca	'mdadm --wait-clean' wait for array to be marked clean For use in distro shutdown scripts with a RAID root file system. Returns immediately if the array is 'readonly', or not an externally managed array. It is up to the distro's scripts to make sure no new writes hit the device after this returns 'true'. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	c94709e83f	Add ping_monitor() to mdadm --wait The action we are waiting for may not be complete until the monitor has had a chance to take action on the result. The following script can now remove the device on the first attempt, versus a few attempts with the original Wait(): #!/bin/bash #export MDADM_NO_MDMON=1 export IMSM_DEVNAME_AS_SERIAL=1 ./mdadm -Ss ./mdadm --zero-superblock /dev/loop[0-3] echo 2 > /proc/sys/dev/raid/speed_limit_max ./mdadm --create /dev/imsm /dev/loop[0-3] -n 4 -e imsm -a md ./mdadm --create /dev/md/r1 /dev/loop[0-3] -n 4 -l 5 --force -a mdp ./mdadm --fail /dev/md/r1 /dev/loop3 ./mdadm --wait /dev/md/r1 x=0 while ! ./mdadm --remove /dev/imsm /dev/loop3 > /dev/null 2>&1 do x=$((x+1)) done echo "removed after $x attempts" ./mdadm --add /dev/imsm /dev/loop3 Include 2 small cleanups: * remove the almost open coded fd2devnum() in Wait() by introducing a new utility routine stat2devnum() * teach connect_monitor() to parse the container device from a subarray string Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	8ed3e5e1bf	Honor safemode_delay at Create() and Incremental() time Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
Dan Williams	a67dd8cc58	Allow metadata handlers to communicate desired safemode delay via mdinfo Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-09-15 20:58:42 -07:00
NeilBrown	e9dd159873	Allow an externally managed array to be marked readonly If the metadata_version is -mdXXX/whatever rather than /mdXXX/whatever then the array is readonly and should be left alone by mdmon. Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 17:55:15 +10:00
NeilBrown	3c558363a1	Factor out test for subarray version string. We are about to change the syntax of the version string for 'subarray's. So factor out the test into a single function. Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 17:55:15 +10:00
NeilBrown	01f157d74a	Extra option for set_array_state: you choose dirty or clean. When we first start an array, it might be good to start recovery straight away. That requires setting the array to 'dirty', but only the metadata handler can know if that is required or not. So have a third possible 'consistent' option to set_array_state. Either 'no' or 'yes' or 'you choose'. Return value indicates what was chosen. '1' (no) should be chosen unless there is a good reason. Signed-off-by: NeilBrown <neilb@suse.de>	2008-08-19 14:54:55 +10:00
Dan Williams	9296754385	mdmon: handle failures versus readauto arrays Transition readauto arrays to active before failing drives. Hmm... why do we keep reblocking / renotifying in the readonly case? Need to bottom out on this, but not right now. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-15 10:58:43 -07:00
Dan Williams	f1d267661d	mdmon: allow degraded arrays to be monitored manage_new is too strict in the face of failed devices. Teach it to monitor degraded arrays. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-08-15 10:58:43 -07:00
Dan Williams	755c99faf2	sysfs: deprecate sysfs_disk_to_sg The cmd_filter patch merged for 2.6.27 broke retrieving the serial number via an ioctl to /dev/sgN. In debugging this I found that other utilities like sdparm simply run the ioctl on /dev/sdX. So just convert to that for protection in numbers, but scream on the mailing list for the inconvenience grr... Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-24 17:26:24 -07:00
NeilBrown	8850ee3e1e	Factor common code into new "start_mdmon". Signed-off-by: Neil Brown <neilb@suse.de>	2008-07-18 16:37:11 +10:00
Dan Williams	5dcfcb715d	mdadm: add an environment variable to prevent auto-launching mdmon Useful for attaching gdb to mdmon before any action is taken on the array. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-07-14 14:59:32 -07:00
Neil Brown	77472ff8d0	Introduce devname2devnum and use it instead of opencoding.	2008-07-12 20:28:38 +10:00
Neil Brown	2c514b7120	Pass 'verbose' flag to validate_geometry That way it can be silent when we are just trying to figure out which metadata to use, and noisy when detecting a real problem.	2008-07-12 20:28:38 +10:00
Neil Brown	6416d5275d	Use O_DIRECT for all IO to devices. Using buffered IO risks non-atomic updates to parts of the device that we don't actually want to write to. This isn't in general safe. So switch to O_DIRECT for all that IO and make sure we have properly aligned buffers.	2008-07-12 20:28:33 +10:00
Neil Brown	edd8d13c02	Create arrays via metadata-update Support creating arrays inside an active ddf container by sending a metadata update over a pipe to mdmon.	2008-07-12 20:27:40 +10:00
Neil Brown	4d43913ce0	Remove mgr_pipe for communicating from manage to monitor. Data is being passed in shared memory, so the pipe is only being use as a wakeup. This can more easily be done with a thread-signal.	2008-07-12 20:27:40 +10:00
Neil Brown	2f64e61a50	Remove mon_pipe for communicating from monitor to manager The returned value was never used, and we don't really want this return path anyway as writing to a pipe could conceivably block, and the monitor must not block.	2008-07-12 20:27:40 +10:00
Neil Brown	f94d52f43e	Handle device removal from container This really should be done in mdadm, not mdmon. We ensure the device won't be suddenly commited as a hot-spare using O_EXCL, then check the 'holders' sysfs directory to make sure it is only in use once.	2008-07-12 20:27:40 +10:00
Neil Brown	78e449282e	Remove the multiple super_switchs for ddf. It is simpler if there is just one, and the methods make decisions as appropriate.	2008-07-12 20:27:39 +10:00
Neil Brown	d2ca644994	Remove getinfo_super_n and do some other cleaning up. Getting close to a sensible description of what some of the superswitch methods are supposed to do!	2008-07-12 20:27:39 +10:00
Neil Brown	f7e7067b47	Add subarray field to supertype. When loading the metadata for a subarray (super_by_fd), we set ->subarray to be the name read from md/metadata_version so that getinfo_super can return info about the correct array. With this we can differentiate between a container and an array within the container by looking at ->subarray[0].	2008-07-12 20:27:38 +10:00
Neil Brown	6adfd3affd	Add some comments to explain some of the bits of superswitch.	2008-07-12 20:27:38 +10:00
Neil Brown	0063ecba3d	Hide subordinate superswitch structures. Only one superswitch should be externally visible for each general type. Others which handle different flavours (e.g. container/data-array) should be internal only.	2008-07-12 20:27:38 +10:00
Neil Brown	b8ac196795	Remove 'major' from superswitch. It isn't generally meaningful.	2008-07-12 20:27:37 +10:00
Neil Brown	1522c538b1	Use text_version in map_file rather than major.minor.	2008-07-12 20:27:37 +10:00
Dan Williams	8b35327854	imsm: 'volume' is the proper name for imsm container members Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-13 17:42:09 -07:00
Dan Williams	f1665f7200	sysfs: helper routine to retrieve the scsi id imsm records this information in its metadata Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-13 17:27:30 -07:00
Dan Williams	90c8b70714	sysfs: provide a helper function for locating scsi_generic interfaces imsm records and validates this data in its metadata Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-06-13 17:27:30 -07:00
Neil Brown	6c3fb95c44	Support adding a spare to a degraded array. When signalled by the monitor, the manager will find spares and add them to the array and initiate a recovery.	2008-06-12 10:13:29 +10:00
Neil Brown	2e735d1982	Allow passing metadata update to the monitor. Code in manager can now just call queue_metadata_update with a (freeable) buf holding the update, and it will get passed to the monitor and written out.	2008-06-12 10:13:23 +10:00
Neil Brown	cba0191bad	Parse the 'instance' part of external:/mdXX/INST in metadata handler. This give more flexability.	2008-05-27 09:18:57 +10:00
Neil Brown	dd15dc4a4d	Discard st->container_member 'container_member' isn't really a well defined concept. Each metadata might enumerate members differently, so just let each format /mdX/YYYY as appropriate.	2008-05-27 09:18:56 +10:00
Neil Brown	159c3a1a77	Remove st->text_version in favour of info->text_version I want the metadata handler to have more control over the 'version', particularly for arrays which are members of containers. So discard st->text_version and instead use info->text_version which getinfo_super can initialise.	2008-05-27 09:18:55 +10:00
Neil Brown	ed9d66aade	Change mark_clean to set_array_state. DDF needs more fine grained understanding of the array state.	2008-05-27 09:18:54 +10:00
Neil Brown	a931db9ed7	auto-start mdmon on --create FIXME uses sill hardcoded path. Need --assemble too.	2008-05-27 09:18:42 +10:00
Neil Brown	e0d6609fe6	Exit when there are no more arrays to manage.	2008-05-27 09:18:41 +10:00
Neil Brown	5869a76c90	Remove supertype->devfd It is never used.	2008-05-27 09:18:40 +10:00
Neil Brown	1ed3f38758	Remove stopped arrays. When an array becomes inactive, clean up and forget it. This involves signalling the manager.	2008-05-27 09:18:39 +10:00
Neil Brown	7a7cc50430	Set status of devices in ddf. Might work a little bit....	2008-05-27 09:18:38 +10:00
Neil Brown	4e5528c6f7	Implement mark_clean for ddf and remove mark_dirty and mark_sync mark_dirty is just a special case of mark_clean - with sync_pos == 0. mark_sync is not required. We don't modify the metadata when sync finishes. Only when the array becomes non-writeable at which point we use mark_clean to record how far the resync progressed.	2008-05-27 09:18:38 +10:00
Neil Brown	2318b9f0dc	Remove 'fd' arg from sysfs_add_disk It it never used, and removing means there are several 'open's that can go.	2008-05-27 09:18:32 +10:00
Dan Williams	3e70c845e2	add infrastructure to receive higher order commands, like remove_device From: Dan Williams <dan.j.williams@intel.com> Each md_message encapsulates a single command. A command includes an 'action' member which describes what if any data comes after the action. Communication with the monitor involves updating the active_cmd pointer and then writing to mgr_pipe. Pass/fail status is returned via mon_pipe. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:54 +10:00
Dan Williams	8d45d1969b	handle disk failures From: Dan Williams <dan.j.williams@intel.com> Added curr_state as a parameter to set_disk. Handlers look at this to record components failures, and set global 'degraded' or 'failed' status. When reading the state as faulty: 1/ mark the disk failed in the metadata 2/ write '-blocked' to the rdev state to allow the kernel's failure mechanism to advance 3/ the kernel will take away the drive's role in remove_and_add_spares() 4/ once the disk no longer has a role writing 'remove' to the rdev state will get the disk out of array. There is a window after writing '-blocked' where the kernel will return -EBUSY to remove requests. We rely on the fact that the disk will continue to show faulty so we lazily wait until the kernel is ready to remove the disk. If the manager thread needs to get the disk out of the way it can ping the monitor and wait, just like the replace_array() case. [buglet fix: swap the parameters of attr_match in read_dev_state] Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:49 +10:00
Dan Williams	fd7cde1bf0	handle resync completion From: Dan Williams <dan.j.williams@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:42 +10:00
Neil Brown	549e9569c6	Merge mdmon	2008-05-15 16:48:37 +10:00
Dan Williams	f7dd881f90	handle Manage_subdevs() for 'external' arrays From: Dan Williams <dan.j.williams@intel.com> 1/ Block attempts to add/remove devices from container members 2/ Forward add/remove requests to containers Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:35 +10:00
Dan Williams	0fd5c350e5	set resync_start in Incremental_container From: Dan Williams <dan.j.williams@intel.com> Metadata handlers set mdinfo.resync_start depending on the state of the array. By default mdadm assumes the array is dirty and needs a full resync. Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:33 +10:00
Dan Williams	5f2aace8eb	Set 'metadata_version' for container_member in Incremental_container From: Dan Williams <dan.j.williams@intel.com> Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:25 +10:00
Dan Williams	cdddbdbca0	imsm: initial Intel(R) Matrix Storage Manager support From: Dan Williams <dan.j.williams@intel.com> The following now work: --examine --examine --brief Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:22 +10:00
Neil Brown	2f6079dc96	Create a container member From: Neil Brown <neilb@suse.de>	2008-05-15 16:48:21 +10:00
Neil Brown	598f0d58ac	Can now mostly assemble DDF arrays	2008-05-15 16:48:19 +10:00
Neil Brown	2503d23b5a	More ddf stuff	2008-05-15 16:48:17 +10:00
Neil Brown	5f8097beb9	more ddf stuff Create a BVD in a DDF Do not actually assemble it yet...	2008-05-15 16:48:15 +10:00
Dan Williams	a322f70c41	Initial DDF support code. Create a ddf array by naming the device /dev/ddf* or specifying metadata 'ddf'. If ddf is specified with no level, assume a container (indeed, anything else would be wrong). **Need to use text_Version to set external metadata... More ddf support Load a ddf container. Now --examine /dev/ddf works. super-ddf: fix compile warning From: Dan Williams <dan.j.williams@intel.com> super-ddf.c:723: format %lu expects type long unsigned int, but argument 3 has type unsigned int Signed-off-by: Dan Williams <dan.j.williams@intel.com>	2008-05-15 16:48:14 +10:00
Neil Brown	d03373f1de	Some support for external metadata. Allow specifying metadata type when creating arrays etc.	2008-05-15 16:48:13 +10:00
Neil Brown	111d01fcc7	Change write_init_super to be called only once. The current model for creating arrays involves writing a superblock to each device in the array. With containers (as with DDF), that model doesn't work. Every device in the container may need to be updated for an array made from just some the devices in a container. So instead of calling write_init_super for each device, we call it once for the array and have it iterate over all the devices in the array. To help with this, ->add_to_super now passes in an 'fd' and name for the device. These get saved for use by write_init_super. So add_to_super takes ownership of the fd, and write_init_super will close it. This information is stored in the new 'info' field of supertype. As part of this, write_init_super now removes any old traces of raid metadata rather than doing this in common code.	2008-05-15 16:48:12 +10:00

1 2 3 4 5 ...

361 Commits