All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ren Mingxin <renmx@cn.fujitsu.com>
To: emilne@redhat.com, Hannes Reinecke <hare@suse.de>
Cc: bmr@redhat.com, James Bottomley <jbottomley@parallels.com>,
	linux-scsi@vger.kernel.org, Bart van Assche <bvanassche@acm.org>,
	Joern Engel <joern@logfs.org>
Subject: Re: [PATCHv2 0/7] Limit overall SCSI EH runtime
Date: Mon, 15 Jul 2013 18:33:25 +0800	[thread overview]
Message-ID: <51E3CFF5.2070501@cn.fujitsu.com> (raw)
In-Reply-To: <1373635840.7420.139.camel@localhost.localdomain>

Hi, Ewan:

On 07/12/2013 09:30 PM, Ewan Milne wrote:
> On Fri, 2013-07-12 at 13:54 +0800, Ren Mingxin wrote:
>> I'm wondering how do you test, with a special hardware or self-made
>> module?Would you mind pasting your test method() and result?
> This was tested in a SAN environment with an EMC Symmetrix and
> Brocade FC switches.  The error was injected by the following
> commands:
>
> portcfg rscnsupr<port>  --enable
> portdisable<port>
>
> Where<port>  is the FC port of the Symmetrix target.
>
> Multipath is used and the test records how long I/O from userspace
> takes to complete after the error handling stops and the I/O is
> retried on another path.
>
> What happens is that the target never responds to anything the
> HBA sends, so commands and TMFs just timeout.  The HBA doesn't
> see link down (since it is the target port) and doesn't get an
> RSCN.  When the HBA is finally reset, however, it can't login
> to the target port and so further I/O gets an immediate error.
>
> Unfortunately, not all SAN environments will exhibit the failing
> behavior -- it appears as if in some cases the HBA detects the
> problem regardless of the switch portcfg setting.  But this has
> been verified to solve the problem of seemingly endless EH
> activity in testing at a large customer site.

Thanks in advance for your explanations in detail. I've been able to
reproduce only with this patchset.

> Also, to be clear, we tested with the "Limit overall SCSI EH
> runtime" patchset but not the "New EH command timeout handler".
> I think the changes to issue the abort in the timeout handler
> are a good idea, though, because there really is no need to
> wait for all activity on the host to cease before issuing the
> abort as far as I can see.

Hmm, agree with you. It is much better to issue aborts without
waiting, which can shorten the timeout handling time.

>>> Acked-by: Ewan D. Milne<emilne@redhat.com>
>>>

Hi, Hannes:

I noticed that the dd time had been reduced from 6m+ to 2m+ when the
'eh_deadline' was set as 30s, but the dd time was 6m+(nearly the same
as default - 'eh_deadline' was 0) when the 'eh_deadline' was set as
10s. I havn't been able to dig further, but I guess there is some
restriction when setting this 'eh_deadline' interface. Maybe should
not less than some timeout, otherwise 'eh_deadline' setting will not
work?

Thanks,
Ren

  reply	other threads:[~2013-07-15 10:29 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-07-01  6:50 [PATCHv2 0/7] Limit overall SCSI EH runtime Hannes Reinecke
2013-07-01  6:50 ` [PATCH 1/7] dpt_i2o: Remove DPTI_STATE_IOCTL Hannes Reinecke
2013-07-01  6:50 ` [PATCH 2/7] dpt_i2o: return SCSI_MLQUEUE_HOST_BUSY when in reset Hannes Reinecke
2013-07-01  6:50 ` [PATCH 3/7] advansys: Remove 'last_reset' references Hannes Reinecke
2013-07-01  6:50 ` [PATCH 4/7] tmscsim: Move 'last_reset' into host structure Hannes Reinecke
2013-07-01  6:50 ` [PATCH 5/7] dc395: Move 'last_reset' into internal " Hannes Reinecke
2013-07-01  6:50 ` [PATCH 6/7] scsi: remove check for 'resetting' Hannes Reinecke
2013-07-01  6:50 ` [PATCH 7/7] scsi: Add 'eh_deadline' to limit SCSI EH runtime Hannes Reinecke
2013-09-20  7:48   ` Ren Mingxin
2013-10-16 19:22   ` James Bottomley
2013-10-17 14:27     ` Ewan Milne
2013-10-23  9:25     ` Hannes Reinecke
2013-10-23  7:46       ` James Bottomley
2013-10-23  9:49         ` Hannes Reinecke
2013-07-01 17:44 ` [PATCHv2 0/7] Limit overall " Jörn Engel
2013-07-01 19:23   ` James Bottomley
2013-07-01 20:55     ` Jörn Engel
2013-07-02  5:48       ` Hannes Reinecke
2013-07-02  6:37       ` James Bottomley
2013-07-02 14:58         ` Jörn Engel
2013-07-02 16:33           ` James Bottomley
2013-07-02 15:50             ` Jörn Engel
2013-07-10 20:35 ` Ewan Milne
2013-07-12  5:54   ` Ren Mingxin
2013-07-12 13:30     ` Ewan Milne
2013-07-15 10:33       ` Ren Mingxin [this message]
2013-07-26  9:52         ` Ren Mingxin
2013-08-07  6:43   ` Ren Mingxin
2013-08-29 13:06     ` Hannes Reinecke
2013-09-24 20:51       ` Ric Wheeler
2013-09-25  5:48         ` Hannes Reinecke
2013-10-02 16:21           ` Hannes Reinecke

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=51E3CFF5.2070501@cn.fujitsu.com \
    --to=renmx@cn.fujitsu.com \
    --cc=bmr@redhat.com \
    --cc=bvanassche@acm.org \
    --cc=emilne@redhat.com \
    --cc=hare@suse.de \
    --cc=jbottomley@parallels.com \
    --cc=joern@logfs.org \
    --cc=linux-scsi@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.