All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] DM RAID:  Add message/status support for changing sync action
@ 2013-03-15 20:48 Jonathan Brassow
  2013-03-17 22:35 ` NeilBrown
  2013-03-17 22:52 ` [dm-devel] " Alasdair G Kergon
  0 siblings, 2 replies; 4+ messages in thread
From: Jonathan Brassow @ 2013-03-15 20:48 UTC (permalink / raw)
  To: linux-raid; +Cc: neilb, agk, jbrassow, dm-devel

DM RAID:  Add message/status support for changing sync action

This patch adds a message interface to dm-raid to allow the user to more
finely control the sync actions being performed by the MD driver.  This
gives the user the ability to initiate "check" and "repair" (i.e. scrubbing).
Two additional fields have been added to the status output to provide more
information about the type of sync action occurring and the results of those
actions, specifically: <sync_action> and <mismatch_cnt>.  This is essentially
the device-mapper way of doing what MD controls through the 'sync_action'
sysfs file and shows through the 'mismatch_cnt' sysfs file.

This is a first patch and I still have two concerns:
1) in 'raid_messages', reap_sync_thread() is not called when "frozen" or
   "idle" is issued like it would be in  md.c:action_store() because it is
   not available to dm-raid.c.  This hasn't prevented the thread from
   shutting down properly in my tests, but there may be something I'm
   not considering.
2) (and this is more of an LVM problem)  The "in-sync ratio" field of the
   status output has generally been used to determine the progress of the
   initial synchronization or of rebuilding a device.  "check" and "repair"
   have not been available options until now.  So, if this ratio also
   shows the progress of a "check", older versions of LVM would seem to
   indicate (through the 'sync%' field) that the array is not in-sync, when
   in fact it is.  Thus, this change might be perceived as a break to
   backwards compatability.  (However, the old LVM tools would not be able
   to /issue/ a "check" command and should never run into this.)  I don't
   see this as a problem for this patch, but I thought it should be
   brought up and considered.   Obviously, code in LVM will need to be
   changed to adjust to this - perhaps to show the different types of sync
   actions that might be occurring.

Signed-off-by: Jonathan Brassow <jbrassow@redhat.com>

Index: linux-upstream/drivers/md/dm-raid.c
===================================================================
--- linux-upstream.orig/drivers/md/dm-raid.c
+++ linux-upstream/drivers/md/dm-raid.c
@@ -1287,6 +1287,7 @@ static int raid_status(struct dm_target
 	unsigned sz = 0;
 	int i, array_in_sync = 0;
 	sector_t sync;
+	char *sync_action_type = "idle";
 
 	switch (type) {
 	case STATUSTYPE_INFO:
@@ -1298,8 +1299,28 @@ static int raid_status(struct dm_target
 			sync = rs->md.recovery_cp;
 
 		if (sync >= rs->md.resync_max_sectors) {
+			/*
+			 * Sync complete.
+			 */
 			array_in_sync = 1;
 			sync = rs->md.resync_max_sectors;
+		} else if (test_bit(MD_RECOVERY_CHECK, &rs->md.recovery)) {
+			/*
+			 * If "check" is occurring, the array has undergone
+			 * and initial sync and the health characters should
+			 * not be 'a' anymore.
+			 */
+			array_in_sync = 1;
+
+			/*
+			 * FIXME:  We could set this if we /don't/ want to
+			 *         display the pace of a 'check'.  Right now,
+			 *         the 'sync_action' field and the ratio field
+			 *         work together to show what is being done.
+			 *         THIS MIGHT BE PERCEIVED AS BREAKING
+			 *         BACKWARDS COMPATIBILITY.
+			 */
+//			sync = rs->md.resync_max_sectors;
 		} else {
 			/*
 			 * The array may be doing an initial sync, or it may
@@ -1311,6 +1332,7 @@ static int raid_status(struct dm_target
 				if (!test_bit(In_sync, &rs->dev[i].rdev.flags))
 					array_in_sync = 1;
 		}
+
 		/*
 		 * Status characters:
 		 *  'D' = Dead/Failed device
@@ -1339,6 +1361,41 @@ static int raid_status(struct dm_target
 		       (unsigned long long) sync,
 		       (unsigned long long) rs->md.resync_max_sectors);
 
+		/*
+		 * Sync action:
+		 *   See Documentation/device-mapper/dm-raid.c for
+		 *   information on each of these states.
+		 */
+		if (test_bit(MD_RECOVERY_FROZEN, &rs->md.recovery))
+			sync_action_type = "frozen";
+		else if (test_bit(MD_RECOVERY_RUNNING, &rs->md.recovery) ||
+			 (!rs->md.ro && test_bit(MD_RECOVERY_NEEDED,
+						 &rs->md.recovery))) {
+			if (test_bit(MD_RECOVERY_RESHAPE, &rs->md.recovery))
+				sync_action_type = "reshape";
+			else if (test_bit(MD_RECOVERY_SYNC, &rs->md.recovery)) {
+				if (!test_bit(MD_RECOVERY_REQUESTED,
+					      &rs->md.recovery))
+					sync_action_type = "resync";
+				else if (test_bit(MD_RECOVERY_CHECK,
+						  &rs->md.recovery))
+					sync_action_type = "check";
+				else
+					sync_action_type = "repair";
+			} else if (test_bit(MD_RECOVERY_RECOVER,
+					    &rs->md.recovery))
+				sync_action_type = "recover";
+		}
+		DMEMIT(" %s", sync_action_type);
+
+		/*
+		 * resync_mismatches/mismatch_cnt
+		 *   This field shows the number of discrepancies found when
+		 *   performing a "check" of the array.
+		 */
+		DMEMIT(" %llu",
+		       (unsigned long long)
+		       atomic64_read(&rs->md.resync_mismatches));
 		break;
 	case STATUSTYPE_TABLE:
 		/* The string you would use to construct this array */
@@ -1427,7 +1484,62 @@ static int raid_status(struct dm_target
 	return 0;
 }
 
-static int raid_iterate_devices(struct dm_target *ti, iterate_devices_callout_fn fn, void *data)
+static int raid_message(struct dm_target *ti, unsigned argc, char **argv)
+{
+	struct raid_set *rs = ti->private;
+	struct mddev *mddev = &rs->md;
+
+	if (!strcasecmp(argv[0], "reshape")) {
+		DMERR("Reshape not supported.");
+		return -EINVAL;
+	}
+
+	if (!mddev->pers || !mddev->pers->sync_request)
+		return -EINVAL;
+
+	if (!strcasecmp(argv[0], "frozen"))
+		set_bit(MD_RECOVERY_FROZEN, &mddev->recovery);
+	else
+		clear_bit(MD_RECOVERY_FROZEN, &mddev->recovery);
+
+	if (!strcasecmp(argv[0], "idle") || !strcasecmp(argv[0], "frozen")) {
+		if (mddev->sync_thread) {
+			set_bit(MD_RECOVERY_INTR, &mddev->recovery);
+			/* FIXME: add reap_sync_thread(mddev); ? */
+		}
+	} else if (test_bit(MD_RECOVERY_RUNNING, &mddev->recovery) ||
+		   test_bit(MD_RECOVERY_NEEDED, &mddev->recovery))
+		return -EBUSY;
+	else if (!strcasecmp(argv[0], "resync"))
+		set_bit(MD_RECOVERY_NEEDED, &mddev->recovery);
+	else if (!strcasecmp(argv[0], "recover")) {
+		set_bit(MD_RECOVERY_RECOVER, &mddev->recovery);
+		set_bit(MD_RECOVERY_NEEDED, &mddev->recovery);
+	} else {
+		if (!strcasecmp(argv[0], "check"))
+			set_bit(MD_RECOVERY_CHECK, &mddev->recovery);
+		else if (!!strcasecmp(argv[0], "repair"))
+			return -EINVAL;
+		set_bit(MD_RECOVERY_REQUESTED, &mddev->recovery);
+		set_bit(MD_RECOVERY_SYNC, &mddev->recovery);
+	}
+	if (mddev->ro == 2) {
+		/* A write to sync_action is enough to justify
+		 * canceling read-auto mode
+		 */
+		mddev->ro = 0;
+		if (!mddev->suspended)
+			md_wakeup_thread(mddev->sync_thread);
+	}
+	set_bit(MD_RECOVERY_NEEDED, &mddev->recovery);
+	if (!mddev->suspended)
+		md_wakeup_thread(mddev->thread);
+
+	return 0;
+}
+
+static int raid_iterate_devices(struct dm_target *ti,
+				iterate_devices_callout_fn fn, void *data)
 {
 	struct raid_set *rs = ti->private;
 	unsigned i;
@@ -1484,12 +1596,13 @@ static void raid_resume(struct dm_target
 
 static struct target_type raid_target = {
 	.name = "raid",
-	.version = {1, 4, 1},
+	.version = {1, 4, 2},
 	.module = THIS_MODULE,
 	.ctr = raid_ctr,
 	.dtr = raid_dtr,
 	.map = raid_map,
 	.status = raid_status,
+	.message = raid_message,
 	.iterate_devices = raid_iterate_devices,
 	.io_hints = raid_io_hints,
 	.presuspend = raid_presuspend,
Index: linux-upstream/Documentation/device-mapper/dm-raid.txt
===================================================================
--- linux-upstream.orig/Documentation/device-mapper/dm-raid.txt
+++ linux-upstream/Documentation/device-mapper/dm-raid.txt
@@ -1,10 +1,13 @@
 dm-raid
--------
+=======
 
 The device-mapper RAID (dm-raid) target provides a bridge from DM to MD.
 It allows the MD RAID drivers to be accessed using a device-mapper
 interface.
 
+
+Mapping Table Interface
+-----------------------
 The target is named "raid" and it accepts the following parameters:
 
   <raid_type> <#raid_params> <raid_params> \
@@ -47,7 +50,7 @@ The target is named "raid" and it accept
     followed by optional parameters (in any order):
 	[sync|nosync]   Force or prevent RAID initialization.
 
-	[rebuild <idx>]	Rebuild drive number idx (first drive is 0).
+	[rebuild <idx>]	Rebuild drive number 'idx' (first drive is 0).
 
 	[daemon_sleep <ms>]
 		Interval between runs of the bitmap daemon that
@@ -56,9 +59,9 @@ The target is named "raid" and it accept
 
 	[min_recovery_rate <kB/sec/disk>]  Throttle RAID initialization
 	[max_recovery_rate <kB/sec/disk>]  Throttle RAID initialization
-	[write_mostly <idx>]		   Drive index is write-mostly
-	[max_write_behind <sectors>]       See '-write-behind=' (man mdadm)
-	[stripe_cache <sectors>]           Stripe cache size (higher RAIDs only)
+	[write_mostly <idx>]		   Mark drive index 'idx' write-mostly.
+	[max_write_behind <sectors>]       See '--write-behind=' (man mdadm)
+	[stripe_cache <sectors>]           Stripe cache size (RAID 4/5/6 only)
 	[region_size <sectors>]
 		The region_size multiplied by the number of regions is the
 		logical size of the array.  The bitmap records the device
@@ -122,7 +125,7 @@ The target is named "raid" and it accept
 	given for both the metadata and data drives for a given position.
 
 
-Example tables
+Example Tables
 --------------
 # RAID4 - 4 data drives, 1 parity (no metadata devices)
 # No metadata devices specified to hold superblock/bitmap info
@@ -141,26 +144,69 @@ Example tables
         raid4 4 2048 sync min_recovery_rate 20 \
         5 8:17 8:18 8:33 8:34 8:49 8:50 8:65 8:66 8:81 8:82
 
+
+Status Output
+-------------
 'dmsetup table' displays the table used to construct the mapping.
 The optional parameters are always printed in the order listed
 above with "sync" or "nosync" always output ahead of the other
 arguments, regardless of the order used when originally loading the table.
 Arguments that can be repeated are ordered by value.
 
-'dmsetup status' yields information on the state and health of the
-array.
-The output is as follows:
+
+'dmsetup status' yields information on the state and health of the array.
+The output is as follows (normally a single line, but expanded here for
+clarity):
 1: <s> <l> raid \
-2:      <raid_type> <#devices> <1 health char for each dev> <resync_ratio>
+2:      <raid_type> <#devices> <health_chars> \
+3:      <resync_ratio> <sync_action> <mismatch_cnt>
 
 Line 1 is the standard output produced by device-mapper.
-Line 2 is produced by the raid target, and best explained by example:
-        0 1960893648 raid raid4 5 AAAAA 2/490221568
+Line 2 & 3 are produced by the raid target and are best explained by example:
+        0 1960893648 raid raid4 5 AAAAA 2/490221568 init 0
 Here we can see the RAID type is raid4, there are 5 devices - all of
-which are 'A'live, and the array is 2/490221568 complete with recovery.
-Faulty or missing devices are marked 'D'.  Devices that are out-of-sync
-are marked 'a'.
-
+which are 'A'live, and the array is 2/490221568 complete with its initial
+recovery.  Here is a fuller description of the individual fields:
+	<raid_type>     Same as the <raid_type> used to create the array.
+	<health_chars>  One char for each device, indicating: 'A' = alive and
+			in-sync, 'a' = alive but not in-sync, 'D' = dead/failed.
+	<resync_ratio>  The ratio indicating how much of the array is know to be
+			in-sync.  This ratio only applies to an array undergoing
+			its initialization or recovery.
+	<sync_action>   One of the following possible states:
+			idle    - No synchronization action is being performed.
+			frozen  - The current action has been halted.
+			resync  - Array is undergoing its initial synchronization
+				  or is resynchronizing after an unclean shutdown
+				  (possibly aided by a bitmap).
+			recover - A device in the array is being rebuilt or
+				  replaced.
+			check   - A user-initiated full check of the array is
+				  being performed.  All blocks are read and
+				  checked for consistency.  The number of
+				  discrepancies found are recorded in
+				  <mismatch_cnt>.  No changes are made to the
+				  array by this action.
+			repair  - The same as "check", but discrepancies are
+				  corrected.
+			reshape - The array is undergoing a reshape.
+	<mismatch_cnt>  The number of discrepancies found between mirror copies
+			in RAID1/10 or wrong parity values found in RAID4/5/6.
+			This value is valid only after a "check" of the array
+			is performed.  A healthy array has a 'mismatch_cnt' of 0.
+
+Message Interface
+-----------------
+The dm-raid target will accept certain actions through the 'message' interface.
+('man dmsetup' for more information on the message interface.)  These actions
+include:
+	"idle"   - Halt the current sync action.
+	"frozen" - Freeze the current sync action.
+	"resync" - Initiate/continue a resync.
+	"recover"- Initiate/continue a recover process.
+	"check"  - Initiate a check (i.e. a "scrub") of the array.
+	"repair" - Initiate a repair of the array.
+	"reshape"- Currently unsupported (-EINVAL).
 
 Version History
 ---------------
@@ -172,3 +218,5 @@ Version History
 1.3.2   Fix/improve redundancy checking for RAID10
 1.4.0	Non-functional change.  Removes arg from mapping function.
 1.4.1   Add RAID10 "far" and "offset" algorithm support.
+1.4.2   Add message interface to allow manipulation of the sync_action.
+	New status (STATUSTYPE_INFO) fields: sync_action and mismatch_cnt.



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] DM RAID:  Add message/status support for changing sync action
  2013-03-15 20:48 [PATCH] DM RAID: Add message/status support for changing sync action Jonathan Brassow
@ 2013-03-17 22:35 ` NeilBrown
  2013-03-19 16:54   ` Brassow Jonathan
  2013-03-17 22:52 ` [dm-devel] " Alasdair G Kergon
  1 sibling, 1 reply; 4+ messages in thread
From: NeilBrown @ 2013-03-17 22:35 UTC (permalink / raw)
  To: Jonathan Brassow; +Cc: linux-raid, agk, dm-devel

[-- Attachment #1: Type: text/plain, Size: 2986 bytes --]

On Fri, 15 Mar 2013 15:48:23 -0500 Jonathan Brassow <jbrassow@redhat.com>
wrote:

> DM RAID:  Add message/status support for changing sync action
> 
> This patch adds a message interface to dm-raid to allow the user to more
> finely control the sync actions being performed by the MD driver.  This
> gives the user the ability to initiate "check" and "repair" (i.e. scrubbing).
> Two additional fields have been added to the status output to provide more
> information about the type of sync action occurring and the results of those
> actions, specifically: <sync_action> and <mismatch_cnt>.  This is essentially
> the device-mapper way of doing what MD controls through the 'sync_action'
> sysfs file and shows through the 'mismatch_cnt' sysfs file.
> 
> This is a first patch and I still have two concerns:
> 1) in 'raid_messages', reap_sync_thread() is not called when "frozen" or
>    "idle" is issued like it would be in  md.c:action_store() because it is
>    not available to dm-raid.c.  This hasn't prevented the thread from
>    shutting down properly in my tests, but there may be something I'm
>    not considering.

I think the only real need for the reap_sync_thread() call is to make the
change synchronous.  i.e. after "echo frozen > sync_action" completes, you
know that the array actually is frozen, rather that it should be frozen very
soon.  This can be important for code which wants to set 'frozen', then
manipulate the array in a way that would be prevented if it was still
undergoing a resync/recovery/check/etc.

I'm don't know how much I actually depend on that property, but it sounds
like a good one to have.
I would probably be OK to export reap_sync_thread() so that dm-raid can use
it.


> 2) (and this is more of an LVM problem)  The "in-sync ratio" field of the
>    status output has generally been used to determine the progress of the
>    initial synchronization or of rebuilding a device.  "check" and "repair"
>    have not been available options until now.  So, if this ratio also
>    shows the progress of a "check", older versions of LVM would seem to
>    indicate (through the 'sync%' field) that the array is not in-sync, when
>    in fact it is.  Thus, this change might be perceived as a break to
>    backwards compatability.  (However, the old LVM tools would not be able
>    to /issue/ a "check" command and should never run into this.)  I don't
>    see this as a problem for this patch, but I thought it should be
>    brought up and considered.   Obviously, code in LVM will need to be
>    changed to adjust to this - perhaps to show the different types of sync
>    actions that might be occurring.

I see your point.  I don't have any opinion on whether it is a real problem
or just a theoretical one though.

In general, the patch looks good - particular as it includes documentation
changes!  When you get a very you are happy with I'll apply it.

Thanks,
NeilBrown



[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 828 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [dm-devel] [PATCH] DM RAID: Add message/status support for changing sync action
  2013-03-15 20:48 [PATCH] DM RAID: Add message/status support for changing sync action Jonathan Brassow
  2013-03-17 22:35 ` NeilBrown
@ 2013-03-17 22:52 ` Alasdair G Kergon
  1 sibling, 0 replies; 4+ messages in thread
From: Alasdair G Kergon @ 2013-03-17 22:52 UTC (permalink / raw)
  To: Jonathan Brassow; +Cc: linux-raid, dm-devel, agk

On Fri, Mar 15, 2013 at 03:48:23PM -0500, Jon Brassow wrote:
> DM RAID:  Add message/status support for changing sync action
 
Quick points:

> Two additional fields 

(which are *always* populated)

> have been added 

appended

> to the status output to provide more
> information about the type of sync action occurring and the results of those
> actions, specifically: <sync_action> and <mismatch_cnt>.  This is essentially
> the device-mapper way of doing what MD controls through the 'sync_action'
> sysfs file and shows through the 'mismatch_cnt' sysfs file.
 
> +	char *sync_action_type = "idle";

(const?)  

> +		/*
> +		 * Sync action:
> +		 *   See Documentation/device-mapper/dm-raid.c for
> +		 *   information on each of these states.
> +		 */
> +		if (test_bit(MD_RECOVERY_FROZEN, &rs->md.recovery))
> +			sync_action_type = "frozen";
> +		else if (test_bit(MD_RECOVERY_RUNNING, &rs->md.recovery) ||
> +			 (!rs->md.ro && test_bit(MD_RECOVERY_NEEDED,
> +						 &rs->md.recovery))) {
> +			if (test_bit(MD_RECOVERY_RESHAPE, &rs->md.recovery))
> +				sync_action_type = "reshape";
> +			else if (test_bit(MD_RECOVERY_SYNC, &rs->md.recovery)) {
> +				if (!test_bit(MD_RECOVERY_REQUESTED,
> +					      &rs->md.recovery))
> +					sync_action_type = "resync";
> +				else if (test_bit(MD_RECOVERY_CHECK,
> +						  &rs->md.recovery))
> +					sync_action_type = "check";
> +				else
> +					sync_action_type = "repair";
> +			} else if (test_bit(MD_RECOVERY_RECOVER,
> +					    &rs->md.recovery))
> +				sync_action_type = "recover";
> +		}

Move this out into a fn that can simply return once it's selected
the most appropriate string?

> -	.version = {1, 4, 1},
> +	.version = {1, 4, 2},

  1.5.0 - inc minor when the change affects interface or behaviour in a non-neglible way

Alasdair


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] DM RAID:  Add message/status support for changing sync action
  2013-03-17 22:35 ` NeilBrown
@ 2013-03-19 16:54   ` Brassow Jonathan
  0 siblings, 0 replies; 4+ messages in thread
From: Brassow Jonathan @ 2013-03-19 16:54 UTC (permalink / raw)
  To: NeilBrown; +Cc: linux-raid, agk, dm-devel


On Mar 17, 2013, at 5:35 PM, NeilBrown wrote:

> 
> In general, the patch looks good - particular as it includes documentation
> changes!  When you get a very you are happy with I'll apply it.
> 
> Thanks,
> NeilBrown
> 

I'll be sending out a version 2, which is split into two patches and addresses the comments made.

 brassow


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2013-03-19 16:54 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-03-15 20:48 [PATCH] DM RAID: Add message/status support for changing sync action Jonathan Brassow
2013-03-17 22:35 ` NeilBrown
2013-03-19 16:54   ` Brassow Jonathan
2013-03-17 22:52 ` [dm-devel] " Alasdair G Kergon

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.