linux-block.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Hans Holmberg <Hans.Holmberg@wdc.com>
To: Damien Le Moal <Damien.LeMoal@wdc.com>,
	"linux-block@vger.kernel.org" <linux-block@vger.kernel.org>,
	Jens Axboe <axboe@kernel.dk>
Subject: Re: [PATCH] block: mq-deadline: Fix queue restart handling
Date: Wed, 28 Aug 2019 07:41:28 +0000	[thread overview]
Message-ID: <SN6PR04MB38864ED05BEEAB9773F0C8DAEBA30@SN6PR04MB3886.namprd04.prod.outlook.com> (raw)
In-Reply-To: 20190828044020.23915-1-damien.lemoal@wdc.com

On 2019-08-28 06:40, Damien Le Moal wrote:
> Commit 7211aef86f79 ("block: mq-deadline: Fix write completion
> handling") added a call to blk_mq_sched_mark_restart_hctx() in
> dd_dispatch_request() to make sure that write request dispatching does
> not stall when all target zones are locked. This fix left a subtle race
> when a write completion happens during a dispatch execution on another
> CPU:
> 
> CPU 0: Dispatch			CPU1: write completion
> 
> dd_dispatch_request()
>      lock(&dd->lock);
>      ...
>      lock(&dd->zone_lock);	dd_finish_request()
>      rq = find request		lock(&dd->zone_lock);
>      unlock(&dd->zone_lock);
>      				zone write unlock
> 				unlock(&dd->zone_lock);
> 				...
> 				__blk_mq_free_request
>                                        check restart flag (not set)
> 				      -> queue not run
>      ...
>      if (!rq && have writes)
>          blk_mq_sched_mark_restart_hctx()
>      unlock(&dd->lock)
> 
> Since the dispatch context finishes after the write request completion
> handling, marking the queue as needing a restart is not seen from
> __blk_mq_free_request() and blk_mq_sched_restart() not executed leading
> to the dispatch stall under 100% write workloads.
> 
> Fix this by moving the call to blk_mq_sched_mark_restart_hctx() from
> dd_dispatch_request() into dd_finish_request() under the zone lock to
> ensure full mutual exclusion between write request dispatch selection
> and zone unlock on write request completion.
> 
> Fixes: 7211aef86f79 ("block: mq-deadline: Fix write completion handling")
> Cc: stable@vger.kernel.org
> Reported-by: Hans Holmberg <Hans.Holmberg@wdc.com>
> Signed-off-by: Damien Le Moal <damien.lemoal@wdc.com>
> ---
>   block/mq-deadline.c | 19 +++++++++----------
>   1 file changed, 9 insertions(+), 10 deletions(-)
> 
> diff --git a/block/mq-deadline.c b/block/mq-deadline.c
> index a17466f310f4..b490f47fd553 100644
> --- a/block/mq-deadline.c
> +++ b/block/mq-deadline.c
> @@ -377,13 +377,6 @@ static struct request *__dd_dispatch_request(struct deadline_data *dd)
>    * hardware queue, but we may return a request that is for a
>    * different hardware queue. This is because mq-deadline has shared
>    * state for all hardware queues, in terms of sorting, FIFOs, etc.
> - *
> - * For a zoned block device, __dd_dispatch_request() may return NULL
> - * if all the queued write requests are directed at zones that are already
> - * locked due to on-going write requests. In this case, make sure to mark
> - * the queue as needing a restart to ensure that the queue is run again
> - * and the pending writes dispatched once the target zones for the ongoing
> - * write requests are unlocked in dd_finish_request().
>    */
>   static struct request *dd_dispatch_request(struct blk_mq_hw_ctx *hctx)
>   {
> @@ -392,9 +385,6 @@ static struct request *dd_dispatch_request(struct blk_mq_hw_ctx *hctx)
>   
>   	spin_lock(&dd->lock);
>   	rq = __dd_dispatch_request(dd);
> -	if (!rq && blk_queue_is_zoned(hctx->queue) &&
> -	    !list_empty(&dd->fifo_list[WRITE]))
> -		blk_mq_sched_mark_restart_hctx(hctx);
>   	spin_unlock(&dd->lock);
>   
>   	return rq;
> @@ -561,6 +551,13 @@ static void dd_prepare_request(struct request *rq, struct bio *bio)
>    * spinlock so that the zone is never unlocked while deadline_fifo_request()
>    * or deadline_next_request() are executing. This function is called for
>    * all requests, whether or not these requests complete successfully.
> + *
> + * For a zoned block device, __dd_dispatch_request() may have stopped
> + * dispatching requests if all the queued requests are write requests directed
> + * at zones that are already locked due to on-going write requests. To ensure
> + * write request dispatch progress in this case, mark the queue as needing a
> + * restart to ensure that the queue is run again after completion of the
> + * request and zones being unlocked.
>    */
>   static void dd_finish_request(struct request *rq)
>   {
> @@ -572,6 +569,8 @@ static void dd_finish_request(struct request *rq)
>   
>   		spin_lock_irqsave(&dd->zone_lock, flags);
>   		blk_req_zone_write_unlock(rq);
> +		if (!list_empty(&dd->fifo_list[WRITE]))
> +			blk_mq_sched_mark_restart_hctx(rq->mq_hctx);
>   		spin_unlock_irqrestore(&dd->zone_lock, flags);
>   	}
>   }
> 

Looks good to me.

Reviewed-by: Hans Holmberg <hans.holmberg@wdc.com>


  reply	other threads:[~2019-08-28  7:41 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-08-28  4:40 [PATCH] block: mq-deadline: Fix queue restart handling Damien Le Moal
2019-08-28  7:41 ` Hans Holmberg [this message]
2019-08-28 15:04 ` Sasha Levin
2019-09-03  9:05 ` Christoph Hellwig
2019-09-03 14:00 ` Jens Axboe

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=SN6PR04MB38864ED05BEEAB9773F0C8DAEBA30@SN6PR04MB3886.namprd04.prod.outlook.com \
    --to=hans.holmberg@wdc.com \
    --cc=Damien.LeMoal@wdc.com \
    --cc=axboe@kernel.dk \
    --cc=linux-block@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).