linux-raid.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] Fix deadlock with request based dm and some drivers
@ 2012-11-06 11:29 Jens Axboe
  2012-11-06 12:14 ` Jun'ichi Nomura
  0 siblings, 1 reply; 7+ messages in thread
From: Jens Axboe @ 2012-11-06 11:29 UTC (permalink / raw)
  To: Alasdair Kergon, linux-raid

Hi Alasdair,

I recently fixed a deadlock for a customer we have with
the below patch. I have queued it up in my tree as not
to lose it. Can I have an ack from you, or do you want
to submit it yourself? I've marked it stable as well.


From 8ca211056519ac06bc96fb134dca1f8eb2141407 Mon Sep 17 00:00:00 2001
From: Jens Axboe <axboe@kernel.dk>
Date: Tue, 6 Nov 2012 12:24:26 +0100
Subject: [PATCH] dm: fix deadlock with request based dm and queue request_fn
 recursion

Request based dm attempts to re-run the request queue off the
request completion path. If used with a driver that potentially does
end_io from its request_fn, we could deadlock trying to recurse
back into request dispatch. Fix this by punting the request queue
run to kblockd.

Tested to fix a quickly reproducible deadlock in such a scenario.

Cc: stable@kernel.org
Signed-off-by: Jens Axboe <axboe@kernel.dk>
---
 drivers/md/dm.c | 8 +++++++-
 1 file changed, 7 insertions(+), 1 deletion(-)

diff --git a/drivers/md/dm.c b/drivers/md/dm.c
index 02db918..77e6eff 100644
--- a/drivers/md/dm.c
+++ b/drivers/md/dm.c
@@ -740,8 +740,14 @@ static void rq_completed(struct mapped_device *md, int rw, int run_queue)
 	if (!md_in_flight(md))
 		wake_up(&md->wait);
 
+	/*
+	 * Run this off this callpath, as drivers could invoke end_io while
+	 * inside their request_fn (and holding the queue lock). Calling
+	 * back into ->request_fn() could deadlock attempting to grab the
+	 * queue lock again.
+	 */
 	if (run_queue)
-		blk_run_queue(md->queue);
+		blk_run_queue_async(md->queue);
 
 	/*
 	 * dm_put() must be at the end of this function. See the comment above
-- 
1.7.12.rc3


-- 
Jens Axboe


^ permalink raw reply related	[flat|nested] 7+ messages in thread

* Re: [PATCH] Fix deadlock with request based dm and some drivers
  2012-11-06 11:29 [PATCH] Fix deadlock with request based dm and some drivers Jens Axboe
@ 2012-11-06 12:14 ` Jun'ichi Nomura
  2012-11-06 12:35   ` Jens Axboe
  0 siblings, 1 reply; 7+ messages in thread
From: Jun'ichi Nomura @ 2012-11-06 12:14 UTC (permalink / raw)
  To: Jens Axboe, Alasdair Kergon; +Cc: linux-raid, device-mapper development

(Cc to dm-devel)

On 11/06/12 20:29, Jens Axboe wrote:
> I recently fixed a deadlock for a customer we have with
> the below patch. I have queued it up in my tree as not
> to lose it. Can I have an ack from you, or do you want
> to submit it yourself? I've marked it stable as well.

Could you tell details about how the deadlock happened?

dm's end_io always throws the completion handling to softirq:
  end_clone_request()
    dm_complete_request()
      blk_complete_request()

then it is processed in softirq context:
  dm_softirq_done()
    dm_done()
      dm_end_request()
        rq_completed(run_queue=true)
          blk_run_queue()

Since queue_lock is always held with interrupt disabled,
I couldn't see why it could deadlock.

Request-based dm followed the completion model of scsi
mid layer. So similar code path exists in scsi, too.
For example:
  scsi_request_fn()
    scsi_kill_request()
      blk_complete_request()
then:
  scsi_softirq_done()
    scsi_io_completion()
      scsi_next_command()
        scsi_run_queue()
          spin_lock queue_lock
          __blk_run_queue()

If calling run-queue from softirq_done_fn() can cause deadlock,
I'm afraid the problem is not limited to dm.

> From 8ca211056519ac06bc96fb134dca1f8eb2141407 Mon Sep 17 00:00:00 2001
> From: Jens Axboe <axboe@kernel.dk>
> Date: Tue, 6 Nov 2012 12:24:26 +0100
> Subject: [PATCH] dm: fix deadlock with request based dm and queue request_fn
>  recursion
> 
> Request based dm attempts to re-run the request queue off the
> request completion path. If used with a driver that potentially does
> end_io from its request_fn, we could deadlock trying to recurse
> back into request dispatch. Fix this by punting the request queue
> run to kblockd.
> 
> Tested to fix a quickly reproducible deadlock in such a scenario.
> 
> Cc: stable@kernel.org
> Signed-off-by: Jens Axboe <axboe@kernel.dk>
> ---
>  drivers/md/dm.c | 8 +++++++-
>  1 file changed, 7 insertions(+), 1 deletion(-)
> 
> diff --git a/drivers/md/dm.c b/drivers/md/dm.c
> index 02db918..77e6eff 100644
> --- a/drivers/md/dm.c
> +++ b/drivers/md/dm.c
> @@ -740,8 +740,14 @@ static void rq_completed(struct mapped_device *md, int rw, int run_queue)
>  	if (!md_in_flight(md))
>  		wake_up(&md->wait);
>  
> +	/*
> +	 * Run this off this callpath, as drivers could invoke end_io while
> +	 * inside their request_fn (and holding the queue lock). Calling
> +	 * back into ->request_fn() could deadlock attempting to grab the
> +	 * queue lock again.
> +	 */
>  	if (run_queue)
> -		blk_run_queue(md->queue);
> +		blk_run_queue_async(md->queue);
>  
>  	/*
>  	 * dm_put() must be at the end of this function. See the comment above


-- 
Jun'ichi Nomura, NEC Corporation

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] Fix deadlock with request based dm and some drivers
  2012-11-06 12:14 ` Jun'ichi Nomura
@ 2012-11-06 12:35   ` Jens Axboe
  2012-11-07  0:15     ` Jun'ichi Nomura
  0 siblings, 1 reply; 7+ messages in thread
From: Jens Axboe @ 2012-11-06 12:35 UTC (permalink / raw)
  To: Jun'ichi Nomura
  Cc: Alasdair Kergon, linux-raid, device-mapper development

On 2012-11-06 13:14, Jun'ichi Nomura wrote:
> (Cc to dm-devel)
> 
> On 11/06/12 20:29, Jens Axboe wrote:
>> I recently fixed a deadlock for a customer we have with
>> the below patch. I have queued it up in my tree as not
>> to lose it. Can I have an ack from you, or do you want
>> to submit it yourself? I've marked it stable as well.
> 
> Could you tell details about how the deadlock happened?
> 
> dm's end_io always throws the completion handling to softirq:
>   end_clone_request()
>     dm_complete_request()
>       blk_complete_request()
> 
> then it is processed in softirq context:
>   dm_softirq_done()
>     dm_done()
>       dm_end_request()
>         rq_completed(run_queue=true)
>           blk_run_queue()
> 
> Since queue_lock is always held with interrupt disabled,
> I couldn't see why it could deadlock.
> 
> Request-based dm followed the completion model of scsi
> mid layer. So similar code path exists in scsi, too.
> For example:
>   scsi_request_fn()
>     scsi_kill_request()
>       blk_complete_request()
> then:
>   scsi_softirq_done()
>     scsi_io_completion()
>       scsi_next_command()
>         scsi_run_queue()
>           spin_lock queue_lock
>           __blk_run_queue()
> 
> If calling run-queue from softirq_done_fn() can cause deadlock,
> I'm afraid the problem is not limited to dm.

dm_softirq_done()
    rq_completed()
        blk_run_queue()
            dm_request_fn()
                dm_dispatch_request()
                    blk_insert_cloned_request()
                        __elv_add_request()
                            elv_insert()
                                blk_run_queue()
                                    ...

Basically you recurse back into the request handler, since it ends up
running the queue. But I see I've been too focused on the older release,
since we don't actually do that anymore after the plugging rewrite. So
it should actually be safe in current kernels and hence the patch only
needed for the stable series where that is not the case.

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] Fix deadlock with request based dm and some drivers
  2012-11-06 12:35   ` Jens Axboe
@ 2012-11-07  0:15     ` Jun'ichi Nomura
  2012-11-07  8:00       ` Jens Axboe
  0 siblings, 1 reply; 7+ messages in thread
From: Jun'ichi Nomura @ 2012-11-07  0:15 UTC (permalink / raw)
  To: Jens Axboe; +Cc: Alasdair Kergon, linux-raid, device-mapper development

On 11/06/12 21:35, Jens Axboe wrote:
> On 2012-11-06 13:14, Jun'ichi Nomura wrote:
>> (Cc to dm-devel)
>>
>> On 11/06/12 20:29, Jens Axboe wrote:
>>> I recently fixed a deadlock for a customer we have with
>>> the below patch. I have queued it up in my tree as not
>>> to lose it. Can I have an ack from you, or do you want
>>> to submit it yourself? I've marked it stable as well.
>>
>> Could you tell details about how the deadlock happened?
>>
>> dm's end_io always throws the completion handling to softirq:
>>   end_clone_request()
>>     dm_complete_request()
>>       blk_complete_request()
>>
>> then it is processed in softirq context:
>>   dm_softirq_done()
>>     dm_done()
>>       dm_end_request()
>>         rq_completed(run_queue=true)
>>           blk_run_queue()
>>
>> Since queue_lock is always held with interrupt disabled,
>> I couldn't see why it could deadlock.
>>
>> Request-based dm followed the completion model of scsi
>> mid layer. So similar code path exists in scsi, too.
>> For example:
>>   scsi_request_fn()
>>     scsi_kill_request()
>>       blk_complete_request()
>> then:
>>   scsi_softirq_done()
>>     scsi_io_completion()
>>       scsi_next_command()
>>         scsi_run_queue()
>>           spin_lock queue_lock
>>           __blk_run_queue()
>>
>> If calling run-queue from softirq_done_fn() can cause deadlock,
>> I'm afraid the problem is not limited to dm.
> 
> dm_softirq_done()
>     rq_completed()
>         blk_run_queue()
          ^^ this is for dm's queue
>             dm_request_fn()
>                 dm_dispatch_request()
>                     blk_insert_cloned_request()
>                         __elv_add_request()
>                             elv_insert()
>                                 blk_run_queue()
                                  ^^ this is for lower device's queue
>                                     ...
> 
> Basically you recurse back into the request handler, since it ends up
> running the queue.

But the queues are different as commented inline above.
So it should be ok. (from deadlock point of view)

> But I see I've been too focused on the older release,
> since we don't actually do that anymore after the plugging rewrite. So
> it should actually be safe in current kernels and hence the patch only
> needed for the stable series where that is not the case.

BTW, using blk_run_queue_async() in softirq_done_fn() might be good
from performance point of view? It may reduce latency of the softirq
and have an effect of batching request_fn calls.

-- 
Jun'ichi Nomura, NEC Corporation

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] Fix deadlock with request based dm and some drivers
  2012-11-07  0:15     ` Jun'ichi Nomura
@ 2012-11-07  8:00       ` Jens Axboe
  2012-11-07  8:35         ` Jun'ichi Nomura
  0 siblings, 1 reply; 7+ messages in thread
From: Jens Axboe @ 2012-11-07  8:00 UTC (permalink / raw)
  To: Jun'ichi Nomura
  Cc: Alasdair Kergon, linux-raid, device-mapper development

On 2012-11-07 01:15, Jun'ichi Nomura wrote:
> On 11/06/12 21:35, Jens Axboe wrote:
>> On 2012-11-06 13:14, Jun'ichi Nomura wrote:
>>> (Cc to dm-devel)
>>>
>>> On 11/06/12 20:29, Jens Axboe wrote:
>>>> I recently fixed a deadlock for a customer we have with
>>>> the below patch. I have queued it up in my tree as not
>>>> to lose it. Can I have an ack from you, or do you want
>>>> to submit it yourself? I've marked it stable as well.
>>>
>>> Could you tell details about how the deadlock happened?
>>>
>>> dm's end_io always throws the completion handling to softirq:
>>>   end_clone_request()
>>>     dm_complete_request()
>>>       blk_complete_request()
>>>
>>> then it is processed in softirq context:
>>>   dm_softirq_done()
>>>     dm_done()
>>>       dm_end_request()
>>>         rq_completed(run_queue=true)
>>>           blk_run_queue()
>>>
>>> Since queue_lock is always held with interrupt disabled,
>>> I couldn't see why it could deadlock.
>>>
>>> Request-based dm followed the completion model of scsi
>>> mid layer. So similar code path exists in scsi, too.
>>> For example:
>>>   scsi_request_fn()
>>>     scsi_kill_request()
>>>       blk_complete_request()
>>> then:
>>>   scsi_softirq_done()
>>>     scsi_io_completion()
>>>       scsi_next_command()
>>>         scsi_run_queue()
>>>           spin_lock queue_lock
>>>           __blk_run_queue()
>>>
>>> If calling run-queue from softirq_done_fn() can cause deadlock,
>>> I'm afraid the problem is not limited to dm.
>>
>> dm_softirq_done()
>>     rq_completed()
>>         blk_run_queue()
>           ^^ this is for dm's queue
>>             dm_request_fn()
>>                 dm_dispatch_request()
>>                     blk_insert_cloned_request()
>>                         __elv_add_request()
>>                             elv_insert()
>>                                 blk_run_queue()
>                                   ^^ this is for lower device's queue
>>                                     ...
>>
>> Basically you recurse back into the request handler, since it ends up
>> running the queue.
> 
> But the queues are different as commented inline above.
> So it should be ok. (from deadlock point of view)

It's still not OK, some drivers end up doing spin_unlock_irq() in their
request_fn. Running unknown request_fn from ipi/irq is a bad idea, imho,
it'll quickly cause problems.

>> But I see I've been too focused on the older release,
>> since we don't actually do that anymore after the plugging rewrite. So
>> it should actually be safe in current kernels and hence the patch only
>> needed for the stable series where that is not the case.
> 
> BTW, using blk_run_queue_async() in softirq_done_fn() might be good
> from performance point of view? It may reduce latency of the softirq
> and have an effect of batching request_fn calls.

Yes, I argued the same for the original people who saw the problem. It
is quite an extensive and expensive path to have off the IPI handler. So
from that point of view, I'd recommend the patch as well.

-- 
Jens Axboe


^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] Fix deadlock with request based dm and some drivers
  2012-11-07  8:00       ` Jens Axboe
@ 2012-11-07  8:35         ` Jun'ichi Nomura
  2012-11-07 10:59           ` [dm-devel] " Alasdair G Kergon
  0 siblings, 1 reply; 7+ messages in thread
From: Jun'ichi Nomura @ 2012-11-07  8:35 UTC (permalink / raw)
  To: Jens Axboe; +Cc: Alasdair Kergon, linux-raid, device-mapper development

On 11/07/12 17:00, Jens Axboe wrote:
> On 2012-11-07 01:15, Jun'ichi Nomura wrote:
>> On 11/06/12 21:35, Jens Axboe wrote:
>>> On 2012-11-06 13:14, Jun'ichi Nomura wrote:
>>>> (Cc to dm-devel)
>>>>
>>>> On 11/06/12 20:29, Jens Axboe wrote:
>>>>> I recently fixed a deadlock for a customer we have with
>>>>> the below patch. I have queued it up in my tree as not
>>>>> to lose it. Can I have an ack from you, or do you want
>>>>> to submit it yourself? I've marked it stable as well.
>>>>
>>>> Could you tell details about how the deadlock happened?
>>>>
>>>> dm's end_io always throws the completion handling to softirq:
>>>>   end_clone_request()
>>>>     dm_complete_request()
>>>>       blk_complete_request()
>>>>
>>>> then it is processed in softirq context:
>>>>   dm_softirq_done()
>>>>     dm_done()
>>>>       dm_end_request()
>>>>         rq_completed(run_queue=true)
>>>>           blk_run_queue()
>>>>
>>>> Since queue_lock is always held with interrupt disabled,
>>>> I couldn't see why it could deadlock.
>>>>
>>>> Request-based dm followed the completion model of scsi
>>>> mid layer. So similar code path exists in scsi, too.
>>>> For example:
>>>>   scsi_request_fn()
>>>>     scsi_kill_request()
>>>>       blk_complete_request()
>>>> then:
>>>>   scsi_softirq_done()
>>>>     scsi_io_completion()
>>>>       scsi_next_command()
>>>>         scsi_run_queue()
>>>>           spin_lock queue_lock
>>>>           __blk_run_queue()
>>>>
>>>> If calling run-queue from softirq_done_fn() can cause deadlock,
>>>> I'm afraid the problem is not limited to dm.
>>>
>>> dm_softirq_done()
>>>     rq_completed()
>>>         blk_run_queue()
>>           ^^ this is for dm's queue
>>>             dm_request_fn()
>>>                 dm_dispatch_request()
>>>                     blk_insert_cloned_request()
>>>                         __elv_add_request()
>>>                             elv_insert()
>>>                                 blk_run_queue()
>>                                   ^^ this is for lower device's queue
>>>                                     ...
>>>
>>> Basically you recurse back into the request handler, since it ends up
>>> running the queue.
>>
>> But the queues are different as commented inline above.
>> So it should be ok. (from deadlock point of view)
> 
> It's still not OK, some drivers end up doing spin_unlock_irq() in their
> request_fn. Running unknown request_fn from ipi/irq is a bad idea, imho,
> it'll quickly cause problems.

Though I still don't think there is actual deadlock because
dm's queue lock is released before dm_dispatch_request(),
I see your point why it's potentially bad.

>>> But I see I've been too focused on the older release,
>>> since we don't actually do that anymore after the plugging rewrite. So
>>> it should actually be safe in current kernels and hence the patch only
>>> needed for the stable series where that is not the case.
>>
>> BTW, using blk_run_queue_async() in softirq_done_fn() might be good
>> from performance point of view? It may reduce latency of the softirq
>> and have an effect of batching request_fn calls.
> 
> Yes, I argued the same for the original people who saw the problem. It
> is quite an extensive and expensive path to have off the IPI handler. So
> from that point of view, I'd recommend the patch as well.

Thank you for confirmation.
If it's good for performance, the patch is fine with me.

-- 
Jun'ichi Nomura, NEC Corporation

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [dm-devel] [PATCH] Fix deadlock with request based dm and some drivers
  2012-11-07  8:35         ` Jun'ichi Nomura
@ 2012-11-07 10:59           ` Alasdair G Kergon
  0 siblings, 0 replies; 7+ messages in thread
From: Alasdair G Kergon @ 2012-11-07 10:59 UTC (permalink / raw)
  To: Jens Axboe; +Cc: Jun'ichi Nomura, linux-raid, device-mapper development

On Wed, Nov 07, 2012 at 05:35:37PM +0900, Jun'ichi Nomura wrote:
> If it's good for performance, the patch is fine with me.
 
Acked-by: Alasdair G Kergon <agk@redhat.com>

Alasdair


^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2012-11-07 10:59 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2012-11-06 11:29 [PATCH] Fix deadlock with request based dm and some drivers Jens Axboe
2012-11-06 12:14 ` Jun'ichi Nomura
2012-11-06 12:35   ` Jens Axboe
2012-11-07  0:15     ` Jun'ichi Nomura
2012-11-07  8:00       ` Jens Axboe
2012-11-07  8:35         ` Jun'ichi Nomura
2012-11-07 10:59           ` [dm-devel] " Alasdair G Kergon

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).