All of lore.kernel.org
 help / color / mirror / Atom feed
* ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
@ 2016-06-09  7:46 A. Benz
  2016-06-10  8:50 ` Michal Kazior
  0 siblings, 1 reply; 15+ messages in thread
From: A. Benz @ 2016-06-09  7:46 UTC (permalink / raw)
  To: linux-wireless

Dear All,

I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600).
With compat-wireless-2016-05-12, I observed traces attached below.
The router is unstable and eventually reboots by itself (randomly).

Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing 
else is changed (software-wise or hardware).
This was confirmed with other users.

A new compile with the fixes below:
https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0

Did not solve the problem.

Please let me know if I need to provide any further information.

------------[ cut here ]------------
WARNING: CPU: 0 PID: 558 at 
compat-wireless-2016-05-12/net/mac80211/rx.c:4068 
ieee80211_rx_napi+0x8c/0x8a4 [mac80211]()
Modules linked in: pppoe ppp_async iptable_nat pppox ppp_generic 
nf_nat_ipv4 nf_conntrack_ipv6 nf_conntrack_ipv4 ipt_REJECT 
ipt_MASQUERADE xt_time xt_tcpudp xt_state xt_nat xt_multiport xt_mark 
xt_mac xt_limit xt_id xt_conntrack xt_comment xt_TCPMSS xt_REDIRECT 
xt_LOG xt_CT slhc nf_reject_ipv4 nf_nat_redirect nf_nat_masquerade_ipv4 
nf_nat nf_log_ipv4 nf_defrag_ipv6 nf_defrag_ipv4 nf_conntrack_rtcache 
nf_conntrack iptable_raw iptable_mangle iptable_fWed May 25 21:21:57 
2016 kern.warn kernel: [24187.498347] CPU: 0 PID: 558 Comm: hostapd 
Tainted: G        W       4.4.11 #2
Hardware name: Qualcomm (Flattened Device Tree)
[<c021ff34>] (unwind_backtrace) from [<c021cb9c>] (show_stack+0x10/0x14)
[<c021cb9c>] (show_stack) from [<c03a2218>] (dump_stack+0x88/0x9c)
[<c03a2218>] (dump_stack) from [<c0227adc>] (warn_slowpath_common+0x94/0xb0)
[<c0227adc>] (warn_slowpath_common) from [<c0227b94>] 
(warn_slowpath_null+0x1c/0x24)
[<c0227b94>] (warn_slowpath_null) from [<bf19fa44>] 
(ieee80211_rx_napi+0x8c/0x8a4 [mac80211])
[<bf19fa44>] (ieee80211_rx_napi [mac80211]) from [<bf20e9d0>] 
(ath10k_htt_t2h_msg_handler+0x92c/0x988 [ath10k_core])
[<bf20e9d0>] (ath10k_htt_t2h_msg_handler [ath10k_core]) from 
[<bf20f3e8>] (ath10k_htt_txrx_compl_task+0x9bc/0x117c [ath10k_core])
[<bf20f3e8>] (ath10k_htt_txrx_compl_task [ath10k_core]) from 
[<c022b158>] (tasklet_action+0xb8/0x144)
[<c022b158>] (tasklet_action) from [<c022b32c>] (__do_softirq+0xe0/0x21c)
[<c022b32c>] (__do_softirq) from [<c022b4d4>] (do_softirq.part.2+0x28/0x30)
[<c022b4d4>] (do_softirq.part.2) from [<c022b590>] 
(__local_bh_enable_ip+0xb4/0x104)
[<c022b590>] (__local_bh_enable_ip) from [<c0596684>] 
(packet_poll+0xc0/0x100)
[<c0596684>] (packet_poll) from [<c04b1368>] (sock_poll+0xec/0xf8)
[<c04b1368>] (sock_poll) from [<c02e7fc4>] (do_select+0x2f8/0x62c)
[<c02e7fc4>] (do_select) from [<c02e858c>] (core_sys_select+0x294/0x424)
[<c02e858c>] (core_sys_select) from [<c02e8820>] (SyS_select+0x104/0x130)
[<c02e8820>] (SyS_select) from [<c0209c00>] (ret_fast_syscall+0x0/0x3c)
---[ end trace e55b94e0d302fcd8 ]---

Regards,
A. Benz

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
  2016-06-09  7:46 ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 A. Benz
@ 2016-06-10  8:50 ` Michal Kazior
  2016-06-10  8:55   ` Felix Fietkau
  0 siblings, 1 reply; 15+ messages in thread
From: Michal Kazior @ 2016-06-10  8:50 UTC (permalink / raw)
  To: A. Benz; +Cc: linux-wireless

On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote:
> Dear All,
>
> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600).
> With compat-wireless-2016-05-12, I observed traces attached below.
> The router is unstable and eventually reboots by itself (randomly).
>
> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing
> else is changed (software-wise or hardware).
> This was confirmed with other users.
>
> A new compile with the fixes below:
> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0
>
> Did not solve the problem.
>
> Please let me know if I need to provide any further information.
>
> ------------[ cut here ]------------
> WARNING: CPU: 0 PID: 558 at
> compat-wireless-2016-05-12/net/mac80211/rx.c:4068
> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]()

Can you post what is at rx.c line 4068 (and +/- 3 lines), please?


Michał

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
  2016-06-10  8:50 ` Michal Kazior
@ 2016-06-10  8:55   ` Felix Fietkau
  2016-06-10  9:10     ` Michal Kazior
  2016-06-10 12:22       ` Kalle Valo
  0 siblings, 2 replies; 15+ messages in thread
From: Felix Fietkau @ 2016-06-10  8:55 UTC (permalink / raw)
  To: Michal Kazior, A. Benz; +Cc: linux-wireless

On 2016-06-10 10:50, Michal Kazior wrote:
> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote:
>> Dear All,
>>
>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600).
>> With compat-wireless-2016-05-12, I observed traces attached below.
>> The router is unstable and eventually reboots by itself (randomly).
>>
>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing
>> else is changed (software-wise or hardware).
>> This was confirmed with other users.
>>
>> A new compile with the fixes below:
>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0
>>
>> Did not solve the problem.
>>
>> Please let me know if I need to provide any further information.
>>
>> ------------[ cut here ]------------
>> WARNING: CPU: 0 PID: 558 at
>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068
>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]()
> 
> Can you post what is at rx.c line 4068 (and +/- 3 lines), please?
It's early in ieee80211_rx_napi:

    sband = local->hw.wiphy->bands[status->band];
    if (WARN_ON(!sband))
        goto drop;

I could not easily find a scenario under which status->band would not be
set properly by the driver, so my guess is there is some nasty memory
corruption going on.

FWIW, I've received several reports like this from different people on
different devices. They're also confirming that reverting to the
snapshot from January makes things stable again.

- Felix

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
  2016-06-10  8:55   ` Felix Fietkau
@ 2016-06-10  9:10     ` Michal Kazior
  2016-06-10  9:20       ` Johannes Berg
  2016-06-10 12:22       ` Kalle Valo
  1 sibling, 1 reply; 15+ messages in thread
From: Michal Kazior @ 2016-06-10  9:10 UTC (permalink / raw)
  To: Felix Fietkau; +Cc: A. Benz, linux-wireless

On 10 June 2016 at 10:55, Felix Fietkau <nbd@nbd.name> wrote:
> On 2016-06-10 10:50, Michal Kazior wrote:
>> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote:
>>> Dear All,
>>>
>>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600).
>>> With compat-wireless-2016-05-12, I observed traces attached below.
>>> The router is unstable and eventually reboots by itself (randomly).
>>>
>>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing
>>> else is changed (software-wise or hardware).
>>> This was confirmed with other users.
>>>
>>> A new compile with the fixes below:
>>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0
>>>
>>> Did not solve the problem.
>>>
>>> Please let me know if I need to provide any further information.
>>>
>>> ------------[ cut here ]------------
>>> WARNING: CPU: 0 PID: 558 at
>>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068
>>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]()
>>
>> Can you post what is at rx.c line 4068 (and +/- 3 lines), please?
> It's early in ieee80211_rx_napi:
>
>     sband = local->hw.wiphy->bands[status->band];
>     if (WARN_ON(!sband))
>         goto drop;

Thanks.


> I could not easily find a scenario under which status->band would not be
> set properly by the driver, so my guess is there is some nasty memory
> corruption going on.

Hmm.. could it be related to ath10k not fulfilling (some) NAPI's
locking requirements and thus ending up with, e.g. linked-list mayhem?


Michał

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
  2016-06-10  9:10     ` Michal Kazior
@ 2016-06-10  9:20       ` Johannes Berg
  0 siblings, 0 replies; 15+ messages in thread
From: Johannes Berg @ 2016-06-10  9:20 UTC (permalink / raw)
  To: Michal Kazior, Felix Fietkau; +Cc: A. Benz, linux-wireless

On Fri, 2016-06-10 at 11:10 +0200, Michal Kazior wrote:
> 
> Hmm.. could it be related to ath10k not fulfilling (some) NAPI's
> locking requirements and thus ending up with, e.g. linked-list
> mayhem?
> 

Shoudln't matter since ath10k doesn't actually use rx_napi()?

johannes

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
  2016-06-10  8:55   ` Felix Fietkau
@ 2016-06-10 12:22       ` Kalle Valo
  2016-06-10 12:22       ` Kalle Valo
  1 sibling, 0 replies; 15+ messages in thread
From: Kalle Valo @ 2016-06-10 12:22 UTC (permalink / raw)
  To: Felix Fietkau; +Cc: Michal Kazior, A. Benz, linux-wireless, ath10k

Felix Fietkau <nbd@nbd.name> writes:

> On 2016-06-10 10:50, Michal Kazior wrote:
>> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote:
>>> Dear All,
>>>
>>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600).
>>> With compat-wireless-2016-05-12, I observed traces attached below.
>>> The router is unstable and eventually reboots by itself (randomly).
>>>
>>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing
>>> else is changed (software-wise or hardware).
>>> This was confirmed with other users.
>>>
>>> A new compile with the fixes below:
>>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0
>>>
>>> Did not solve the problem.
>>>
>>> Please let me know if I need to provide any further information.
>>>
>>> ------------[ cut here ]------------
>>> WARNING: CPU: 0 PID: 558 at
>>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068
>>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]()
>> 
>> Can you post what is at rx.c line 4068 (and +/- 3 lines), please?
> It's early in ieee80211_rx_napi:
>
>     sband = local->hw.wiphy->bands[status->band];
>     if (WARN_ON(!sband))
>         goto drop;
>
> I could not easily find a scenario under which status->band would not be
> set properly by the driver, so my guess is there is some nasty memory
> corruption going on.
>
> FWIW, I've received several reports like this from different people on
> different devices. They're also confirming that reverting to the
> snapshot from January makes things stable again.

Adding ath10k list to the loop.

-- 
Kalle Valo

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
@ 2016-06-10 12:22       ` Kalle Valo
  0 siblings, 0 replies; 15+ messages in thread
From: Kalle Valo @ 2016-06-10 12:22 UTC (permalink / raw)
  To: Felix Fietkau; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz

Felix Fietkau <nbd@nbd.name> writes:

> On 2016-06-10 10:50, Michal Kazior wrote:
>> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote:
>>> Dear All,
>>>
>>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600).
>>> With compat-wireless-2016-05-12, I observed traces attached below.
>>> The router is unstable and eventually reboots by itself (randomly).
>>>
>>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing
>>> else is changed (software-wise or hardware).
>>> This was confirmed with other users.
>>>
>>> A new compile with the fixes below:
>>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0
>>>
>>> Did not solve the problem.
>>>
>>> Please let me know if I need to provide any further information.
>>>
>>> ------------[ cut here ]------------
>>> WARNING: CPU: 0 PID: 558 at
>>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068
>>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]()
>> 
>> Can you post what is at rx.c line 4068 (and +/- 3 lines), please?
> It's early in ieee80211_rx_napi:
>
>     sband = local->hw.wiphy->bands[status->band];
>     if (WARN_ON(!sband))
>         goto drop;
>
> I could not easily find a scenario under which status->band would not be
> set properly by the driver, so my guess is there is some nasty memory
> corruption going on.
>
> FWIW, I've received several reports like this from different people on
> different devices. They're also confirming that reverting to the
> snapshot from January makes things stable again.

Adding ath10k list to the loop.

-- 
Kalle Valo

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
  2016-06-10 12:22       ` Kalle Valo
@ 2016-06-10 12:52         ` Ben Greear
  -1 siblings, 0 replies; 15+ messages in thread
From: Ben Greear @ 2016-06-10 12:52 UTC (permalink / raw)
  To: Kalle Valo, Felix Fietkau; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz



On 06/10/2016 05:22 AM, Kalle Valo wrote:
> Felix Fietkau <nbd@nbd.name> writes:
>
>> On 2016-06-10 10:50, Michal Kazior wrote:
>>> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote:
>>>> Dear All,
>>>>
>>>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600).
>>>> With compat-wireless-2016-05-12, I observed traces attached below.
>>>> The router is unstable and eventually reboots by itself (randomly).
>>>>
>>>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing
>>>> else is changed (software-wise or hardware).
>>>> This was confirmed with other users.
>>>>
>>>> A new compile with the fixes below:
>>>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0
>>>>
>>>> Did not solve the problem.
>>>>
>>>> Please let me know if I need to provide any further information.
>>>>
>>>> ------------[ cut here ]------------
>>>> WARNING: CPU: 0 PID: 558 at
>>>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068
>>>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]()
>>>
>>> Can you post what is at rx.c line 4068 (and +/- 3 lines), please?
>> It's early in ieee80211_rx_napi:
>>
>>      sband = local->hw.wiphy->bands[status->band];
>>      if (WARN_ON(!sband))
>>          goto drop;
>>
>> I could not easily find a scenario under which status->band would not be
>> set properly by the driver, so my guess is there is some nasty memory
>> corruption going on.
>>
>> FWIW, I've received several reports like this from different people on
>> different devices. They're also confirming that reverting to the
>> snapshot from January makes things stable again.
>
> Adding ath10k list to the loop.

This looks a lot like the problems I was having.

Two of these 5 patches recently made it upstream (but may not be in LEDE yet),
but the other patches also were related to memory corruption.

See my patches posted on 4/1/16:

https://patchwork.kernel.org/project/ath10k/list/

I don't know where the 5/5 patch ended up.

Thanks,
Ben

-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
@ 2016-06-10 12:52         ` Ben Greear
  0 siblings, 0 replies; 15+ messages in thread
From: Ben Greear @ 2016-06-10 12:52 UTC (permalink / raw)
  To: Kalle Valo, Felix Fietkau; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz



On 06/10/2016 05:22 AM, Kalle Valo wrote:
> Felix Fietkau <nbd@nbd.name> writes:
>
>> On 2016-06-10 10:50, Michal Kazior wrote:
>>> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote:
>>>> Dear All,
>>>>
>>>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600).
>>>> With compat-wireless-2016-05-12, I observed traces attached below.
>>>> The router is unstable and eventually reboots by itself (randomly).
>>>>
>>>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing
>>>> else is changed (software-wise or hardware).
>>>> This was confirmed with other users.
>>>>
>>>> A new compile with the fixes below:
>>>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0
>>>>
>>>> Did not solve the problem.
>>>>
>>>> Please let me know if I need to provide any further information.
>>>>
>>>> ------------[ cut here ]------------
>>>> WARNING: CPU: 0 PID: 558 at
>>>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068
>>>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]()
>>>
>>> Can you post what is at rx.c line 4068 (and +/- 3 lines), please?
>> It's early in ieee80211_rx_napi:
>>
>>      sband = local->hw.wiphy->bands[status->band];
>>      if (WARN_ON(!sband))
>>          goto drop;
>>
>> I could not easily find a scenario under which status->band would not be
>> set properly by the driver, so my guess is there is some nasty memory
>> corruption going on.
>>
>> FWIW, I've received several reports like this from different people on
>> different devices. They're also confirming that reverting to the
>> snapshot from January makes things stable again.
>
> Adding ath10k list to the loop.

This looks a lot like the problems I was having.

Two of these 5 patches recently made it upstream (but may not be in LEDE yet),
but the other patches also were related to memory corruption.

See my patches posted on 4/1/16:

https://patchwork.kernel.org/project/ath10k/list/

I don't know where the 5/5 patch ended up.

Thanks,
Ben

-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
  2016-06-10 12:52         ` Ben Greear
@ 2016-06-10 12:57           ` Felix Fietkau
  -1 siblings, 0 replies; 15+ messages in thread
From: Felix Fietkau @ 2016-06-10 12:57 UTC (permalink / raw)
  To: Ben Greear, Kalle Valo; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz

On 2016-06-10 14:52, Ben Greear wrote:
> This looks a lot like the problems I was having.
> 
> Two of these 5 patches recently made it upstream (but may not be in LEDE yet),
> but the other patches also were related to memory corruption.
> 
> See my patches posted on 4/1/16:
> 
> https://patchwork.kernel.org/project/ath10k/list/
> 
> I don't know where the 5/5 patch ended up.
I had already asked affected users to test with those patches (I have a
commit that adds them in my staging tree), but it did not resolve the issue.

- Felix

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
@ 2016-06-10 12:57           ` Felix Fietkau
  0 siblings, 0 replies; 15+ messages in thread
From: Felix Fietkau @ 2016-06-10 12:57 UTC (permalink / raw)
  To: Ben Greear, Kalle Valo; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz

On 2016-06-10 14:52, Ben Greear wrote:
> This looks a lot like the problems I was having.
> 
> Two of these 5 patches recently made it upstream (but may not be in LEDE yet),
> but the other patches also were related to memory corruption.
> 
> See my patches posted on 4/1/16:
> 
> https://patchwork.kernel.org/project/ath10k/list/
> 
> I don't know where the 5/5 patch ended up.
I had already asked affected users to test with those patches (I have a
commit that adds them in my staging tree), but it did not resolve the issue.

- Felix

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
  2016-06-10 12:57           ` Felix Fietkau
@ 2016-06-10 13:16             ` Ben Greear
  -1 siblings, 0 replies; 15+ messages in thread
From: Ben Greear @ 2016-06-10 13:16 UTC (permalink / raw)
  To: Felix Fietkau, Kalle Valo; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz



On 06/10/2016 05:57 AM, Felix Fietkau wrote:
> On 2016-06-10 14:52, Ben Greear wrote:
>> This looks a lot like the problems I was having.
>>
>> Two of these 5 patches recently made it upstream (but may not be in LEDE yet),
>> but the other patches also were related to memory corruption.
>>
>> See my patches posted on 4/1/16:
>>
>> https://patchwork.kernel.org/project/ath10k/list/
>>
>> I don't know where the 5/5 patch ended up.
> I had already asked affected users to test with those patches (I have a
> commit that adds them in my staging tree), but it did not resolve the issue.

Ok, must be something else then.

If you can run on x86 under KASAN it may provide some clues..that is how I eventually
made progress on the issues I was seeing.  My rebase onto 3.7 has been slow and painful,
but I should be ready to start testing that sometime soon, maybe I can reproduce something
there.

Thanks,
Ben

-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
@ 2016-06-10 13:16             ` Ben Greear
  0 siblings, 0 replies; 15+ messages in thread
From: Ben Greear @ 2016-06-10 13:16 UTC (permalink / raw)
  To: Felix Fietkau, Kalle Valo; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz



On 06/10/2016 05:57 AM, Felix Fietkau wrote:
> On 2016-06-10 14:52, Ben Greear wrote:
>> This looks a lot like the problems I was having.
>>
>> Two of these 5 patches recently made it upstream (but may not be in LEDE yet),
>> but the other patches also were related to memory corruption.
>>
>> See my patches posted on 4/1/16:
>>
>> https://patchwork.kernel.org/project/ath10k/list/
>>
>> I don't know where the 5/5 patch ended up.
> I had already asked affected users to test with those patches (I have a
> commit that adds them in my staging tree), but it did not resolve the issue.

Ok, must be something else then.

If you can run on x86 under KASAN it may provide some clues..that is how I eventually
made progress on the issues I was seeing.  My rebase onto 3.7 has been slow and painful,
but I should be ready to start testing that sometime soon, maybe I can reproduce something
there.

Thanks,
Ben

-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
  2016-06-10 13:16             ` Ben Greear
@ 2016-07-15 11:49               ` Nagarajan, Ashok Raj
  -1 siblings, 0 replies; 15+ messages in thread
From: Nagarajan, Ashok Raj @ 2016-07-15 11:49 UTC (permalink / raw)
  To: Ben Greear, nbd, Kalle Valo
  Cc: linux-wireless, michal.kazior, ath10k, A. Benz

> On 06/10/2016 05:57 AM, Felix Fietkau wrote:
>>> On 2016-06-10 14:52, Ben Greear wrote:
>>> This looks a lot like the problems I was having.
>>>
>>> Two of these 5 patches recently made it upstream (but may not be in LEDE yet),
>>> but the other patches also were related to memory corruption.
>>>
>>> See my patches posted on 4/1/16:
>>>
>>> https://patchwork.kernel.org/project/ath10k/list/
>>>
>>> I don't know where the 5/5 patch ended up.
>> I had already asked affected users to test with those patches (I have a
>> commit that adds them in my staging tree), but it did not resolve the issue.
>
>Ok, must be something else then.
>
>If you can run on x86 under KASAN it may provide some clues..that is how I eventually
>made progress on the issues I was seeing.  My rebase onto 3.7 has been slow and painful,
>but I should be ready to start testing that sometime soon, maybe I can reproduce something
>there.

Hi Benz,

Could you please check with the following diff if it is solving your issue?

diff --git a/drivers/net/wireless/ath/ath10k/htt_rx.c b/drivers/net/wireless/ath/ath10k/htt_rx.c
index 6f19fca..c192a41 100644
--- a/drivers/net/wireless/ath/ath10k/htt_rx.c
+++ b/drivers/net/wireless/ath/ath10k/htt_rx.c
@@ -1528,7 +1528,7 @@ static void ath10k_htt_rx_h_filter(struct ath10k *ar,
static int ath10k_htt_rx_handle_amsdu(struct ath10k_htt *htt)
{
        struct ath10k *ar = htt->ar;
-       static struct ieee80211_rx_status rx_status;
+       struct ieee80211_rx_status *rx_status = &htt->rx_status;
        struct sk_buff_head amsdu;
        int ret;
 
@@ -1553,11 +1553,11 @@ static int ath10k_htt_rx_handle_amsdu(struct ath10k_htt *htt)
        }
 
        ath10k_pktlog_rx(ar, &amsdu);
-       ath10k_htt_rx_h_ppdu(ar, &amsdu, &rx_status, 0xffff);
+       ath10k_htt_rx_h_ppdu(ar, &amsdu, rx_status, 0xffff);
        ath10k_htt_rx_h_unchain(ar, &amsdu, ret > 0);
-       ath10k_htt_rx_h_filter(ar, &amsdu, &rx_status);
-       ath10k_htt_rx_h_mpdu(ar, &amsdu, &rx_status);
-       ath10k_htt_rx_h_deliver(ar, &amsdu, &rx_status);
+       ath10k_htt_rx_h_filter(ar, &amsdu, rx_status);
+       ath10k_htt_rx_h_mpdu(ar, &amsdu, rx_status);
+       ath10k_htt_rx_h_deliver(ar, &amsdu, rx_status);

Thanks,
Ashok

>Thanks,
>Ben

^ permalink raw reply related	[flat|nested] 15+ messages in thread

* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05
@ 2016-07-15 11:49               ` Nagarajan, Ashok Raj
  0 siblings, 0 replies; 15+ messages in thread
From: Nagarajan, Ashok Raj @ 2016-07-15 11:49 UTC (permalink / raw)
  To: Ben Greear, nbd, Kalle Valo
  Cc: linux-wireless, michal.kazior, ath10k, A. Benz

> On 06/10/2016 05:57 AM, Felix Fietkau wrote:
>>> On 2016-06-10 14:52, Ben Greear wrote:
>>> This looks a lot like the problems I was having.
>>>
>>> Two of these 5 patches recently made it upstream (but may not be in LEDE yet),
>>> but the other patches also were related to memory corruption.
>>>
>>> See my patches posted on 4/1/16:
>>>
>>> https://patchwork.kernel.org/project/ath10k/list/
>>>
>>> I don't know where the 5/5 patch ended up.
>> I had already asked affected users to test with those patches (I have a
>> commit that adds them in my staging tree), but it did not resolve the issue.
>
>Ok, must be something else then.
>
>If you can run on x86 under KASAN it may provide some clues..that is how I eventually
>made progress on the issues I was seeing.  My rebase onto 3.7 has been slow and painful,
>but I should be ready to start testing that sometime soon, maybe I can reproduce something
>there.

Hi Benz,

Could you please check with the following diff if it is solving your issue?

diff --git a/drivers/net/wireless/ath/ath10k/htt_rx.c b/drivers/net/wireless/ath/ath10k/htt_rx.c
index 6f19fca..c192a41 100644
--- a/drivers/net/wireless/ath/ath10k/htt_rx.c
+++ b/drivers/net/wireless/ath/ath10k/htt_rx.c
@@ -1528,7 +1528,7 @@ static void ath10k_htt_rx_h_filter(struct ath10k *ar,
static int ath10k_htt_rx_handle_amsdu(struct ath10k_htt *htt)
{
        struct ath10k *ar = htt->ar;
-       static struct ieee80211_rx_status rx_status;
+       struct ieee80211_rx_status *rx_status = &htt->rx_status;
        struct sk_buff_head amsdu;
        int ret;
 
@@ -1553,11 +1553,11 @@ static int ath10k_htt_rx_handle_amsdu(struct ath10k_htt *htt)
        }
 
        ath10k_pktlog_rx(ar, &amsdu);
-       ath10k_htt_rx_h_ppdu(ar, &amsdu, &rx_status, 0xffff);
+       ath10k_htt_rx_h_ppdu(ar, &amsdu, rx_status, 0xffff);
        ath10k_htt_rx_h_unchain(ar, &amsdu, ret > 0);
-       ath10k_htt_rx_h_filter(ar, &amsdu, &rx_status);
-       ath10k_htt_rx_h_mpdu(ar, &amsdu, &rx_status);
-       ath10k_htt_rx_h_deliver(ar, &amsdu, &rx_status);
+       ath10k_htt_rx_h_filter(ar, &amsdu, rx_status);
+       ath10k_htt_rx_h_mpdu(ar, &amsdu, rx_status);
+       ath10k_htt_rx_h_deliver(ar, &amsdu, rx_status);

Thanks,
Ashok

>Thanks,
>Ben

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply related	[flat|nested] 15+ messages in thread

end of thread, other threads:[~2016-07-15 11:50 UTC | newest]

Thread overview: 15+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-06-09  7:46 ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 A. Benz
2016-06-10  8:50 ` Michal Kazior
2016-06-10  8:55   ` Felix Fietkau
2016-06-10  9:10     ` Michal Kazior
2016-06-10  9:20       ` Johannes Berg
2016-06-10 12:22     ` Kalle Valo
2016-06-10 12:22       ` Kalle Valo
2016-06-10 12:52       ` Ben Greear
2016-06-10 12:52         ` Ben Greear
2016-06-10 12:57         ` Felix Fietkau
2016-06-10 12:57           ` Felix Fietkau
2016-06-10 13:16           ` Ben Greear
2016-06-10 13:16             ` Ben Greear
2016-07-15 11:49             ` Nagarajan, Ashok Raj
2016-07-15 11:49               ` Nagarajan, Ashok Raj

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.