All of lore.kernel.org
 help / color / mirror / Atom feed
From: Minas Harutyunyan <Minas.Harutyunyan@synopsys.com>
To: Felipe Balbi <balbi@kernel.org>,
	Minas Harutyunyan <Minas.Harutyunyan@synopsys.com>,
	Roger Quadros <rogerq@ti.com>
Cc: "linux-usb@vger.kernel.org" <linux-usb@vger.kernel.org>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH v2] usb: dwc3: Prevent indefinite sleep in _dwc3_set_mode during suspend/resume
Date: Sat, 17 Mar 2018 06:33:12 +0000	[thread overview]
Message-ID: <410670D7E743164D87FA6160E7907A560113ABBA4B@am04wembxa.internal.synopsys.com> (raw)
In-Reply-To: 87zi38438h.fsf@linux.intel.com

Hi,

On 3/16/2018 4:25 PM, Felipe Balbi wrote:
> 
> Hi,
> 
> Minas Harutyunyan <Minas.Harutyunyan@synopsys.com> writes:
>>>>> On 09/03/18 14:47, Roger Quadros wrote:
>>>>>> In the following test we get stuck by sleeping forever in _dwc3_set_mode()
>>>>>> after which dual-role switching doesn't work.
>>>>>>
>>>>>> On dra7-evm's dual-role port,
>>>>>> - Load g_zero gadget driver and enumerate to host
>>>>>> - suspend to mem
>>>>>> - disconnect USB cable to host and connect otg cable with Pen drive in it.
>>>>>> - resume system
>>>>>> - we sleep indefinitely in _dwc3_set_mode due to.
>>>>>>     dwc3_gadget_exit()->usb_del_gadget_udc()->udc_stop()->
>>>>>> 	dwc3_gadget_stop()->wait_event_lock_irq()
>>>>>>
>>>>>> To fix this instead of waiting indefinitely with wait_event_lock_irq()
>>>>>> we use wait_event_interruptible_lock_irq_timeout() and print
>>>>>> and error message if there was a timeout.
>>>>>>
>>>>>> Signed-off-by: Roger Quadros <rogerq@ti.com>
>>>>>
>>>>> Thanks for picking this for -next.
>>>>> Is it better to have this in v4.16-rc fixes?
>>>>> and also stable? v4.12+
>>>>
>>>> Well, there was no "Fixes: foobar" or "Cc: stable" lines in the commit
>>>> log ;-)
>>>>
>>>> The best we can do now, is wait for -rc1 and manually send the commit to
>>>> stable.
>>>>
>>>
>>> That's fine. Thanks.
>>>
>>
>> Same issue seen in dwc3_gadget_ep_dequeue() function where also used
>> wait_event_lock_irq() - as result infinite loop.
> 
> how did this happen? During rmmod dwc3? Or, perhaps, after you unloaded
> a gadget driver?
> 
No, not during rmmod's.
We using our internal USB testing tool. Test case; ISOC OUT, transfer 
size N frames. When host starts ISOC OUT traffic then the dwc3 based on 
"Transfer not ready" event in frame F starts transfers staring from 
frame F+4 (for bInterval=1) as result 4 requests, which already queued 
on device side, remain incomplete. Function driver on some timeout 
trying dequeue these 4 requests (without disabling EP) to complete test.
For IN ISOC's these requests completed on MISSED ISOC event, but for 
ISOC OUT required call dequeue on some timeout.

>> Actually to fix this issue I updated condition of wait function
>> from:
>> !(dep->flags & DWC3_EP_END_TRANSFER_PENDING)
>> to:
>> !(dep->flags & DWC3_EP_END_TRANSFER_PENDING & DWC3_EP_ENABLED)
> 
> you're not fixing anything. You're, essentially, removing the entire
> end transfer pending logic. 
yes, you are right, but how to overcome this infinite loop? Replace 
wait_event_lock_irq() by  wait_event_interruptible_lock_irq_timeout()?

The whole idea of this is that we can
> disable the endpoint and wait for the End Transfer interrupt. When you
> add a check for the endpoint being enabled, then that code will never
> run and, thus, never wait for the End Transfer IRQ.
> 
> If you manage to find a more reliable way of reproducing this, then make
> sure to capture dwc3 tracepoints (see the documentation for details) and
> let's start trying to figure out what's going on.
> 
> cheers
> 


WARNING: multiple messages have this Message-ID (diff)
From: Minas Harutyunyan <Minas.Harutyunyan@synopsys.com>
To: Felipe Balbi <balbi@kernel.org>,
	Minas Harutyunyan <Minas.Harutyunyan@synopsys.com>,
	Roger Quadros <rogerq@ti.com>
Cc: "linux-usb@vger.kernel.org" <linux-usb@vger.kernel.org>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>
Subject: [v2] usb: dwc3: Prevent indefinite sleep in _dwc3_set_mode during suspend/resume
Date: Sat, 17 Mar 2018 06:33:12 +0000	[thread overview]
Message-ID: <410670D7E743164D87FA6160E7907A560113ABBA4B@am04wembxa.internal.synopsys.com> (raw)

Hi,

On 3/16/2018 4:25 PM, Felipe Balbi wrote:
> 
> Hi,
> 
> Minas Harutyunyan <Minas.Harutyunyan@synopsys.com> writes:
>>>>> On 09/03/18 14:47, Roger Quadros wrote:
>>>>>> In the following test we get stuck by sleeping forever in _dwc3_set_mode()
>>>>>> after which dual-role switching doesn't work.
>>>>>>
>>>>>> On dra7-evm's dual-role port,
>>>>>> - Load g_zero gadget driver and enumerate to host
>>>>>> - suspend to mem
>>>>>> - disconnect USB cable to host and connect otg cable with Pen drive in it.
>>>>>> - resume system
>>>>>> - we sleep indefinitely in _dwc3_set_mode due to.
>>>>>>     dwc3_gadget_exit()->usb_del_gadget_udc()->udc_stop()->
>>>>>> 	dwc3_gadget_stop()->wait_event_lock_irq()
>>>>>>
>>>>>> To fix this instead of waiting indefinitely with wait_event_lock_irq()
>>>>>> we use wait_event_interruptible_lock_irq_timeout() and print
>>>>>> and error message if there was a timeout.
>>>>>>
>>>>>> Signed-off-by: Roger Quadros <rogerq@ti.com>
>>>>>
>>>>> Thanks for picking this for -next.
>>>>> Is it better to have this in v4.16-rc fixes?
>>>>> and also stable? v4.12+
>>>>
>>>> Well, there was no "Fixes: foobar" or "Cc: stable" lines in the commit
>>>> log ;-)
>>>>
>>>> The best we can do now, is wait for -rc1 and manually send the commit to
>>>> stable.
>>>>
>>>
>>> That's fine. Thanks.
>>>
>>
>> Same issue seen in dwc3_gadget_ep_dequeue() function where also used
>> wait_event_lock_irq() - as result infinite loop.
> 
> how did this happen? During rmmod dwc3? Or, perhaps, after you unloaded
> a gadget driver?
> 
No, not during rmmod's.
We using our internal USB testing tool. Test case; ISOC OUT, transfer 
size N frames. When host starts ISOC OUT traffic then the dwc3 based on 
"Transfer not ready" event in frame F starts transfers staring from 
frame F+4 (for bInterval=1) as result 4 requests, which already queued 
on device side, remain incomplete. Function driver on some timeout 
trying dequeue these 4 requests (without disabling EP) to complete test.
For IN ISOC's these requests completed on MISSED ISOC event, but for 
ISOC OUT required call dequeue on some timeout.

>> Actually to fix this issue I updated condition of wait function
>> from:
>> !(dep->flags & DWC3_EP_END_TRANSFER_PENDING)
>> to:
>> !(dep->flags & DWC3_EP_END_TRANSFER_PENDING & DWC3_EP_ENABLED)
> 
> you're not fixing anything. You're, essentially, removing the entire
> end transfer pending logic. 
yes, you are right, but how to overcome this infinite loop? Replace 
wait_event_lock_irq() by  wait_event_interruptible_lock_irq_timeout()?

The whole idea of this is that we can
> disable the endpoint and wait for the End Transfer interrupt. When you
> add a check for the endpoint being enabled, then that code will never
> run and, thus, never wait for the End Transfer IRQ.
> 
> If you manage to find a more reliable way of reproducing this, then make
> sure to capture dwc3 tracepoints (see the documentation for details) and
> let's start trying to figure out what's going on.
> 
> cheers
>
---
To unsubscribe from this list: send the line "unsubscribe linux-usb" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

  reply	other threads:[~2018-03-17  6:33 UTC|newest]

Thread overview: 64+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-02-27 11:22 [PATCH] usb: dwc3: Prevent indefinite sleep in _dwc3_set_mode during suspend/resume Roger Quadros
2018-02-27 11:22 ` Roger Quadros
2018-02-28  3:04 ` [PATCH] " Baolin Wang
2018-02-28  3:04   ` Baolin Wang
2018-02-28  9:55   ` [PATCH] " Roger Quadros
2018-02-28  9:55     ` Roger Quadros
2018-02-28  7:53 ` [PATCH] " Felipe Balbi
2018-02-28  7:53   ` Felipe Balbi
2018-02-28  9:59   ` [PATCH] " Roger Quadros
2018-02-28  9:59     ` Roger Quadros
2018-03-05  8:49     ` [PATCH] " Felipe Balbi
2018-03-05  8:49       ` Felipe Balbi
2018-03-05  9:45       ` [PATCH] " Roger Quadros
2018-03-05  9:45         ` Roger Quadros
2018-03-05 10:41         ` [PATCH] " Baolin Wang
2018-03-05 10:41           ` Baolin Wang
2018-03-05 11:03           ` [PATCH] " Roger Quadros
2018-03-05 11:03             ` Roger Quadros
2018-03-05 11:06           ` [PATCH] " Felipe Balbi
2018-03-05 11:06             ` Felipe Balbi
2018-03-05 11:14             ` [PATCH] " Roger Quadros
2018-03-05 11:14               ` Roger Quadros
2018-03-05 11:25               ` [PATCH] " Baolin Wang
2018-03-05 11:25                 ` Baolin Wang
2018-03-05 11:27                 ` [PATCH] " Felipe Balbi
2018-03-05 11:27                   ` Felipe Balbi
2018-03-09  9:19                   ` [PATCH] " Roger Quadros
2018-03-09  9:19                     ` Roger Quadros
2018-03-09  9:23                     ` [PATCH] " Felipe Balbi
2018-03-09  9:23                       ` Felipe Balbi
2018-03-09  9:26                       ` [PATCH] " Roger Quadros
2018-03-09  9:26                         ` Roger Quadros
2018-03-09  9:49                         ` [PATCH] " Roger Quadros
2018-03-09  9:49                           ` Roger Quadros
2018-03-09 10:39                           ` [PATCH] " Felipe Balbi
2018-03-09 10:39                             ` Felipe Balbi
2018-03-09 10:36                         ` [PATCH] " Felipe Balbi
2018-03-09 10:36                           ` Felipe Balbi
2018-03-05 11:25               ` [PATCH] " Felipe Balbi
2018-03-05 11:25                 ` Felipe Balbi
2018-03-09 12:47 ` [PATCH v2] " Roger Quadros
2018-03-09 12:47   ` [v2] " Roger Quadros
2018-03-16 10:34   ` [PATCH v2] " Roger Quadros
2018-03-16 10:34     ` [v2] " Roger Quadros
2018-03-16 11:00     ` [PATCH v2] " Felipe Balbi
2018-03-16 11:00       ` [v2] " Felipe Balbi
2018-03-16 11:03       ` [PATCH v2] " Roger Quadros
2018-03-16 11:03         ` [v2] " Roger Quadros
2018-03-16 11:43         ` [PATCH v2] " Minas Harutyunyan
2018-03-16 11:43           ` [v2] " Minas Harutyunyan
2018-03-16 12:25           ` [PATCH v2] " Felipe Balbi
2018-03-16 12:25             ` [v2] " Felipe Balbi
2018-03-17  6:33             ` Minas Harutyunyan [this message]
2018-03-17  6:33               ` Minas Harutyunyan
2018-03-19  8:54               ` [PATCH v2] " Felipe Balbi
2018-03-19  8:54                 ` [v2] " Felipe Balbi
2018-03-19 11:36                 ` [PATCH v2] " Minas Harutyunyan
2018-03-19 11:36                   ` [v2] " Minas Harutyunyan
2018-03-19 13:53                   ` [PATCH v2] " Minas Harutyunyan
2018-03-19 13:53                     ` [v2] " Minas Harutyunyan
2018-04-10  6:29                     ` [PATCH v2] " Minas Harutyunyan
2018-04-10  6:29                       ` [v2] " Minas Harutyunyan
2018-04-10  7:31                       ` [PATCH v2] " Felipe Balbi
2018-04-10  7:31                         ` [v2] " Felipe Balbi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=410670D7E743164D87FA6160E7907A560113ABBA4B@am04wembxa.internal.synopsys.com \
    --to=minas.harutyunyan@synopsys.com \
    --cc=balbi@kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-usb@vger.kernel.org \
    --cc=rogerq@ti.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.