All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] scsi: ses: don't get power status of SES device slot on probe
@ 2017-03-30  1:02 Mauricio Faria de Oliveira
  2017-04-04 21:07 ` Dan Williams
  0 siblings, 1 reply; 5+ messages in thread
From: Mauricio Faria de Oliveira @ 2017-03-30  1:02 UTC (permalink / raw)
  To: linux-scsi, jejb, martin.petersen, songliubraving
  Cc: dan.j.williams, axboe, hare, hch, gpiccoli

The commit 08024885a2a3 ("ses: Add power_status to SES device slot")
introduced the 'power_status' attribute to enclosure components and
the associated callbacks.

There are 2 callbacks available to get the power status of a device:
1) ses_get_power_status() for 'struct enclosure_component_callbacks'
2) get_component_power_status() for the sysfs device attribute
(these are available for kernel-space and user-space, respectively.)

However, despite both methods being available to get power status
on demand, that commit also introduced a call to get power status
in ses_enclosure_data_process().

This dramatically increased the total probe time for SCSI devices
on larger configurations, because ses_enclosure_data_process() is
called several times during the SCSI devices probe and loops over
the component devices (but that is another problem, another patch).

That results in a tremendous continuous hammering of SCSI Receive
Diagnostics commands to the enclosure-services device, which does
delay the total probe time for the SCSI devices __significantly__:

  Originally, ~34 minutes on a system attached to ~170 disks:

    [ 9214.490703] mpt3sas version 13.100.00.00 loaded
    ...
    [11256.580231] scsi 17:0:177:0: qdepth(16), tagged(1), simple(0),
                   ordered(0), scsi_level(6), cmd_que(1)

  With this patch, it decreased to ~2.5 minutes -- a 13.6x faster

    [ 1002.992533] mpt3sas version 13.100.00.00 loaded
    ...
    [ 1151.978831] scsi 11:0:177:0: qdepth(16), tagged(1), simple(0),
                   ordered(0), scsi_level(6), cmd_que(1)

Another downside is that the SCSI device class lock is held during
the executions of ses_enclosure_data_process() since it's a callee
of ses_intf_add(), which is executed under that lock in device_add().
This ends up blocking the parallel SCSI scan functionality because
device_add() depends on that lock, and also the removal of devices
via device_del() (e.g., during an enclosure reboot).

Back to the commit discussion.. on the ses_get_power_status() call
introduced in ses_enclosure_data_process(): impact of removing it.

That may possibly be in place to initialize the power status value
on device probe.  However, those 2 functions available to retrieve
that value _do_ automatically refresh/update it.  So the potential
benefit would be a direct access of the 'power_status' field which
does not use the callbacks...

But the only reader of 'struct enclosure_component::power_status'
is the get_component_power_status() callback for sysfs attribute,
and it _does_ check for and call the .get_power_status callback,
(which indeed is defined and implemented by that commit), so the
power status value is, again, automatically updated.

So, the remaining potential for a direct/non-callback access to
the power_status attribute would be out-of-tree modules -- well,
for those, if they are for whatever reason interested in values
that are set during device probe and not up-to-date by the time
they need it.. well, that would be curious.

So, for out-of-tree modules and people still interested in this
behavior (in case I might have missed something about it, other
than the total probe time impact): let's add a module parameter
to keep that check turned on (disabled by default).

Also, to handle that more properly, set the initial power state
value to '-1' (i.e., uninitialized) instead of '1' (power 'on'),
and check for it in that callback which may do an direct access
to the field value _if_ a callback function is not defined.

Signed-off-by: Mauricio Faria de Oliveira <mauricfo@linux.vnet.ibm.com>
Fixes: 08024885a2a3 ("ses: Add power_status to SES device slot")
---
 drivers/misc/enclosure.c | 6 +++++-
 drivers/scsi/ses.c       | 9 ++++++++-
 2 files changed, 13 insertions(+), 2 deletions(-)

diff --git a/drivers/misc/enclosure.c b/drivers/misc/enclosure.c
index 65fed7146e9b..d3a8a13c4247 100644
--- a/drivers/misc/enclosure.c
+++ b/drivers/misc/enclosure.c
@@ -148,7 +148,7 @@ struct enclosure_device *
 	for (i = 0; i < components; i++) {
 		edev->component[i].number = -1;
 		edev->component[i].slot = -1;
-		edev->component[i].power_status = 1;
+		edev->component[i].power_status = -1;
 	}
 
 	mutex_lock(&container_list_lock);
@@ -594,6 +594,10 @@ static ssize_t get_component_power_status(struct device *cdev,
 
 	if (edev->cb->get_power_status)
 		edev->cb->get_power_status(edev, ecomp);
+
+	if (ecomp->power_status == -1)
+		return -EINVAL;
+
 	return snprintf(buf, 40, "%s\n", ecomp->power_status ? "on" : "off");
 }
 
diff --git a/drivers/scsi/ses.c b/drivers/scsi/ses.c
index 50adabbb5808..2fafefd419c1 100644
--- a/drivers/scsi/ses.c
+++ b/drivers/scsi/ses.c
@@ -36,6 +36,12 @@
 
 #include <scsi/scsi_transport_sas.h>
 
+static bool power_status_on_probe;	/* default: false, 0. */
+module_param(power_status_on_probe, bool, 0644);
+MODULE_PARM_DESC(power_status_on_probe, "get power status of SES device slots "
+					"(enclosure components) at probe time "
+					"(warning: may delay total probe time)");
+
 struct ses_device {
 	unsigned char *page1;
 	unsigned char *page1_types;
@@ -548,7 +554,8 @@ static void ses_enclosure_data_process(struct enclosure_device *edev,
 					ecomp = &edev->component[components++];
 
 				if (!IS_ERR(ecomp)) {
-					ses_get_power_status(edev, ecomp);
+					if (power_status_on_probe)
+						ses_get_power_status(edev, ecomp);
 					if (addl_desc_ptr)
 						ses_process_descriptor(
 							ecomp,
-- 
1.8.3.1

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] scsi: ses: don't get power status of SES device slot on probe
  2017-03-30  1:02 [PATCH] scsi: ses: don't get power status of SES device slot on probe Mauricio Faria de Oliveira
@ 2017-04-04 21:07 ` Dan Williams
  2017-04-05 13:13   ` Mauricio Faria de Oliveira
  0 siblings, 1 reply; 5+ messages in thread
From: Dan Williams @ 2017-04-04 21:07 UTC (permalink / raw)
  To: Mauricio Faria de Oliveira
  Cc: linux-scsi, James E.J. Bottomley, Martin K. Petersen, Song Liu,
	Jens Axboe, Hannes Reinecke, Christoph Hellwig, gpiccoli

On Wed, Mar 29, 2017 at 6:02 PM, Mauricio Faria de Oliveira
<mauricfo@linux.vnet.ibm.com> wrote:
> The commit 08024885a2a3 ("ses: Add power_status to SES device slot")
> introduced the 'power_status' attribute to enclosure components and
> the associated callbacks.
>
> There are 2 callbacks available to get the power status of a device:
> 1) ses_get_power_status() for 'struct enclosure_component_callbacks'
> 2) get_component_power_status() for the sysfs device attribute
> (these are available for kernel-space and user-space, respectively.)
>
> However, despite both methods being available to get power status
> on demand, that commit also introduced a call to get power status
> in ses_enclosure_data_process().
>
> This dramatically increased the total probe time for SCSI devices
> on larger configurations, because ses_enclosure_data_process() is
> called several times during the SCSI devices probe and loops over
> the component devices (but that is another problem, another patch).
>
> That results in a tremendous continuous hammering of SCSI Receive
> Diagnostics commands to the enclosure-services device, which does
> delay the total probe time for the SCSI devices __significantly__:
>
>   Originally, ~34 minutes on a system attached to ~170 disks:
>
>     [ 9214.490703] mpt3sas version 13.100.00.00 loaded
>     ...
>     [11256.580231] scsi 17:0:177:0: qdepth(16), tagged(1), simple(0),
>                    ordered(0), scsi_level(6), cmd_que(1)
>
>   With this patch, it decreased to ~2.5 minutes -- a 13.6x faster
>
>     [ 1002.992533] mpt3sas version 13.100.00.00 loaded
>     ...
>     [ 1151.978831] scsi 11:0:177:0: qdepth(16), tagged(1), simple(0),
>                    ordered(0), scsi_level(6), cmd_que(1)
>
> Another downside is that the SCSI device class lock is held during
> the executions of ses_enclosure_data_process() since it's a callee
> of ses_intf_add(), which is executed under that lock in device_add().
> This ends up blocking the parallel SCSI scan functionality because
> device_add() depends on that lock, and also the removal of devices
> via device_del() (e.g., during an enclosure reboot).
>
> Back to the commit discussion.. on the ses_get_power_status() call
> introduced in ses_enclosure_data_process(): impact of removing it.
>
> That may possibly be in place to initialize the power status value
> on device probe.  However, those 2 functions available to retrieve
> that value _do_ automatically refresh/update it.  So the potential
> benefit would be a direct access of the 'power_status' field which
> does not use the callbacks...
>
> But the only reader of 'struct enclosure_component::power_status'
> is the get_component_power_status() callback for sysfs attribute,
> and it _does_ check for and call the .get_power_status callback,
> (which indeed is defined and implemented by that commit), so the
> power status value is, again, automatically updated.
>
> So, the remaining potential for a direct/non-callback access to
> the power_status attribute would be out-of-tree modules -- well,
> for those, if they are for whatever reason interested in values
> that are set during device probe and not up-to-date by the time
> they need it.. well, that would be curious.
>
> So, for out-of-tree modules and people still interested in this
> behavior (in case I might have missed something about it, other
> than the total probe time impact): let's add a module parameter
> to keep that check turned on (disabled by default).
>
> Also, to handle that more properly, set the initial power state
> value to '-1' (i.e., uninitialized) instead of '1' (power 'on'),
> and check for it in that callback which may do an direct access
> to the field value _if_ a callback function is not defined.
>
> Signed-off-by: Mauricio Faria de Oliveira <mauricfo@linux.vnet.ibm.com>
> Fixes: 08024885a2a3 ("ses: Add power_status to SES device slot")
> ---
>  drivers/misc/enclosure.c | 6 +++++-
>  drivers/scsi/ses.c       | 9 ++++++++-
>  2 files changed, 13 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/misc/enclosure.c b/drivers/misc/enclosure.c
> index 65fed7146e9b..d3a8a13c4247 100644
> --- a/drivers/misc/enclosure.c
> +++ b/drivers/misc/enclosure.c
> @@ -148,7 +148,7 @@ struct enclosure_device *
>         for (i = 0; i < components; i++) {
>                 edev->component[i].number = -1;
>                 edev->component[i].slot = -1;
> -               edev->component[i].power_status = 1;
> +               edev->component[i].power_status = -1;
>         }
>
>         mutex_lock(&container_list_lock);
> @@ -594,6 +594,10 @@ static ssize_t get_component_power_status(struct device *cdev,
>
>         if (edev->cb->get_power_status)
>                 edev->cb->get_power_status(edev, ecomp);
> +
> +       if (ecomp->power_status == -1)
> +               return -EINVAL;
> +

Can we ever hit this if get_power_status is non-null?

>         return snprintf(buf, 40, "%s\n", ecomp->power_status ? "on" : "off");
>  }
>
> diff --git a/drivers/scsi/ses.c b/drivers/scsi/ses.c
> index 50adabbb5808..2fafefd419c1 100644
> --- a/drivers/scsi/ses.c
> +++ b/drivers/scsi/ses.c
> @@ -36,6 +36,12 @@
>
>  #include <scsi/scsi_transport_sas.h>
>
> +static bool power_status_on_probe;     /* default: false, 0. */
> +module_param(power_status_on_probe, bool, 0644);
> +MODULE_PARM_DESC(power_status_on_probe, "get power status of SES device slots "
> +                                       "(enclosure components) at probe time "
> +                                       "(warning: may delay total probe time)");
> +

I don't think we need this module parameter as long as the only way to
read the power status is through sysfs. We can always just leave it to
be read on-demand when needed.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] scsi: ses: don't get power status of SES device slot on probe
  2017-04-04 21:07 ` Dan Williams
@ 2017-04-05 13:13   ` Mauricio Faria de Oliveira
  2017-04-05 14:41     ` Dan Williams
  0 siblings, 1 reply; 5+ messages in thread
From: Mauricio Faria de Oliveira @ 2017-04-05 13:13 UTC (permalink / raw)
  To: Dan Williams
  Cc: linux-scsi, James E.J. Bottomley, Martin K. Petersen, Song Liu,
	Jens Axboe, Hannes Reinecke, Christoph Hellwig, gpiccoli

Hi Dan,

Thanks for reviewing.

On 04/04/2017 06:07 PM, Dan Williams wrote:
>> @@ -594,6 +594,10 @@ static ssize_t get_component_power_status(struct device *cdev,
>>
>>         if (edev->cb->get_power_status)
>>                 edev->cb->get_power_status(edev, ecomp);
>> +
>> +       if (ecomp->power_status == -1)
>> +               return -EINVAL;
>> +
> Can we ever hit this if get_power_status is non-null?

I admit that is cautionary, and more for consistency with the chance
of an uninitialized state being still in place, so not to return any
(incorrect) value in that state.

But, yes, in a few cases -- although unlikely.

1) imagine .get_power_status couldn't update the 'power_status' field
    (it's a bit unlikely with the in-tree ses driver, but in the case
     that ses_get_page2_descriptor() returns NULL, ses_get_power_status()
     doesn't update 'power_status').

2) an out-of-tree module employs another enclosure_component_callbacks
    structure, which doesn't implement a .get_power_status hook.

3) or it does, but that failed, and didn't update 'power_status' (e.g.,
    like case 1)


>> +static bool power_status_on_probe;     /* default: false, 0. */
>> +module_param(power_status_on_probe, bool, 0644);
>> +MODULE_PARM_DESC(power_status_on_probe, "get power status of SES device slots "
>> +                                       "(enclosure components) at probe time "
>> +                                       "(warning: may delay total probe time)");
>> +
> I don't think we need this module parameter as long as the only way to
> read the power status is through sysfs. We can always just leave it to
> be read on-demand when needed.

I agree for the in-tree module.  My only concern, as described in the
commit message, is the case someone is using an out-of-tree module /
has an implementation that depends on that (e.g., reading the 'power_
status' field directly, instead of using .get_power_status.

I've been a bit overzealous when considering why that call was inserted
in ses_enclosure_data_process(), and didn't want to break them.

However, for an in-tree usage, it's clear that just dropping that line
seemed to be sufficient -- the rest in the patch is just consideration
for whoever might be using it in out-of-tree ways.

(and if such consideration is deemed not to be required in this case,
  I'd be fine with dropping the related changes and making this patch
  a one-line. :- )

cheers,

-- 
Mauricio Faria de Oliveira
IBM Linux Technology Center

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] scsi: ses: don't get power status of SES device slot on probe
  2017-04-05 13:13   ` Mauricio Faria de Oliveira
@ 2017-04-05 14:41     ` Dan Williams
  2017-04-05 14:59       ` Mauricio Faria de Oliveira
  0 siblings, 1 reply; 5+ messages in thread
From: Dan Williams @ 2017-04-05 14:41 UTC (permalink / raw)
  To: Mauricio Faria de Oliveira
  Cc: linux-scsi, James E.J. Bottomley, Martin K. Petersen, Song Liu,
	Jens Axboe, Hannes Reinecke, Christoph Hellwig, gpiccoli

On Wed, Apr 5, 2017 at 6:13 AM, Mauricio Faria de Oliveira
<mauricfo@linux.vnet.ibm.com> wrote:
> Hi Dan,
>
> Thanks for reviewing.
>
> On 04/04/2017 06:07 PM, Dan Williams wrote:
>>>
>>> @@ -594,6 +594,10 @@ static ssize_t get_component_power_status(struct
>>> device *cdev,
>>>
>>>         if (edev->cb->get_power_status)
>>>                 edev->cb->get_power_status(edev, ecomp);
>>> +
>>> +       if (ecomp->power_status == -1)
>>> +               return -EINVAL;
>>> +
>>
>> Can we ever hit this if get_power_status is non-null?
>
>
> I admit that is cautionary, and more for consistency with the chance
> of an uninitialized state being still in place, so not to return any
> (incorrect) value in that state.
>
> But, yes, in a few cases -- although unlikely.
>
> 1) imagine .get_power_status couldn't update the 'power_status' field
>    (it's a bit unlikely with the in-tree ses driver, but in the case
>     that ses_get_page2_descriptor() returns NULL, ses_get_power_status()
>     doesn't update 'power_status').

Ok, in that case we should probably return -EIO, if get_power_status
is non-NULL, but the power_status is still -1.

> 2) an out-of-tree module employs another enclosure_component_callbacks
>    structure, which doesn't implement a .get_power_status hook.

If get_power_status is null and power_status is -1 I think we should
return -ENOTTY to indicate the failure.

> 3) or it does, but that failed, and didn't update 'power_status' (e.g.,
>    like case 1)

I think we're back -EIO for this.

>
>
>>> +static bool power_status_on_probe;     /* default: false, 0. */
>>> +module_param(power_status_on_probe, bool, 0644);
>>> +MODULE_PARM_DESC(power_status_on_probe, "get power status of SES device
>>> slots "
>>> +                                       "(enclosure components) at probe
>>> time "
>>> +                                       "(warning: may delay total probe
>>> time)");
>>> +
>>
>> I don't think we need this module parameter as long as the only way to
>> read the power status is through sysfs. We can always just leave it to
>> be read on-demand when needed.
>
>
> I agree for the in-tree module.  My only concern, as described in the
> commit message, is the case someone is using an out-of-tree module /
> has an implementation that depends on that (e.g., reading the 'power_
> status' field directly, instead of using .get_power_status.
>
> I've been a bit overzealous when considering why that call was inserted
> in ses_enclosure_data_process(), and didn't want to break them.
>
> However, for an in-tree usage, it's clear that just dropping that line
> seemed to be sufficient -- the rest in the patch is just consideration
> for whoever might be using it in out-of-tree ways.
>
> (and if such consideration is deemed not to be required in this case,
>  I'd be fine with dropping the related changes and making this patch
>  a one-line. :- )

Let's not carry module parameters that only theoretically help an out
of tree module. If such a driver exists its owner can just holler if
this policy change breaks their usage, and then we can have a
discussion about why their driver isn't upstream already.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] scsi: ses: don't get power status of SES device slot on probe
  2017-04-05 14:41     ` Dan Williams
@ 2017-04-05 14:59       ` Mauricio Faria de Oliveira
  0 siblings, 0 replies; 5+ messages in thread
From: Mauricio Faria de Oliveira @ 2017-04-05 14:59 UTC (permalink / raw)
  To: Dan Williams, Song Liu
  Cc: linux-scsi, James E.J. Bottomley, Martin K. Petersen, Jens Axboe,
	Hannes Reinecke, Christoph Hellwig, gpiccoli

On 04/05/2017 11:41 AM, Dan Williams wrote:
> On Wed, Apr 5, 2017 at 6:13 AM, Mauricio Faria de Oliveira
> <mauricfo@linux.vnet.ibm.com> wrote:

>> 1) imagine .get_power_status couldn't update the 'power_status' field
>>    (it's a bit unlikely with the in-tree ses driver, but in the case
>>     that ses_get_page2_descriptor() returns NULL, ses_get_power_status()
>>     doesn't update 'power_status').
> Ok, in that case we should probably return -EIO, if get_power_status
> is non-NULL, but the power_status is still -1.
>
>> 2) an out-of-tree module employs another enclosure_component_callbacks
>>    structure, which doesn't implement a .get_power_status hook.
> If get_power_status is null and power_status is -1 I think we should
> return -ENOTTY to indicate the failure.
>
>> 3) or it does, but that failed, and didn't update 'power_status' (e.g.,
>>    like case 1)
> I think we're back -EIO for this.

Absolutely. I see those are better values to convey the failure/reason.


>> However, for an in-tree usage, it's clear that just dropping that line
>> seemed to be sufficient -- the rest in the patch is just consideration
>> for whoever might be using it in out-of-tree ways.
>>
>> (and if such consideration is deemed not to be required in this case,
>>  I'd be fine with dropping the related changes and making this patch
>>  a one-line. :- )
> Let's not carry module parameters that only theoretically help an out
> of tree module. If such a driver exists its owner can just holler if
> this policy change breaks their usage, and then we can have a
> discussion about why their driver isn't upstream already.

Okay, got it.
For the record, the patch author has been in To:/Cc:, for such cases.

I'll submit a v2.

Thanks,

-- 
Mauricio Faria de Oliveira
IBM Linux Technology Center

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2017-04-05 14:59 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-03-30  1:02 [PATCH] scsi: ses: don't get power status of SES device slot on probe Mauricio Faria de Oliveira
2017-04-04 21:07 ` Dan Williams
2017-04-05 13:13   ` Mauricio Faria de Oliveira
2017-04-05 14:41     ` Dan Williams
2017-04-05 14:59       ` Mauricio Faria de Oliveira

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.