* ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 @ 2016-06-09 7:46 A. Benz 2016-06-10 8:50 ` Michal Kazior 0 siblings, 1 reply; 15+ messages in thread From: A. Benz @ 2016-06-09 7:46 UTC (permalink / raw) To: linux-wireless Dear All, I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600). With compat-wireless-2016-05-12, I observed traces attached below. The router is unstable and eventually reboots by itself (randomly). Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing else is changed (software-wise or hardware). This was confirmed with other users. A new compile with the fixes below: https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0 Did not solve the problem. Please let me know if I need to provide any further information. ------------[ cut here ]------------ WARNING: CPU: 0 PID: 558 at compat-wireless-2016-05-12/net/mac80211/rx.c:4068 ieee80211_rx_napi+0x8c/0x8a4 [mac80211]() Modules linked in: pppoe ppp_async iptable_nat pppox ppp_generic nf_nat_ipv4 nf_conntrack_ipv6 nf_conntrack_ipv4 ipt_REJECT ipt_MASQUERADE xt_time xt_tcpudp xt_state xt_nat xt_multiport xt_mark xt_mac xt_limit xt_id xt_conntrack xt_comment xt_TCPMSS xt_REDIRECT xt_LOG xt_CT slhc nf_reject_ipv4 nf_nat_redirect nf_nat_masquerade_ipv4 nf_nat nf_log_ipv4 nf_defrag_ipv6 nf_defrag_ipv4 nf_conntrack_rtcache nf_conntrack iptable_raw iptable_mangle iptable_fWed May 25 21:21:57 2016 kern.warn kernel: [24187.498347] CPU: 0 PID: 558 Comm: hostapd Tainted: G W 4.4.11 #2 Hardware name: Qualcomm (Flattened Device Tree) [<c021ff34>] (unwind_backtrace) from [<c021cb9c>] (show_stack+0x10/0x14) [<c021cb9c>] (show_stack) from [<c03a2218>] (dump_stack+0x88/0x9c) [<c03a2218>] (dump_stack) from [<c0227adc>] (warn_slowpath_common+0x94/0xb0) [<c0227adc>] (warn_slowpath_common) from [<c0227b94>] (warn_slowpath_null+0x1c/0x24) [<c0227b94>] (warn_slowpath_null) from [<bf19fa44>] (ieee80211_rx_napi+0x8c/0x8a4 [mac80211]) [<bf19fa44>] (ieee80211_rx_napi [mac80211]) from [<bf20e9d0>] (ath10k_htt_t2h_msg_handler+0x92c/0x988 [ath10k_core]) [<bf20e9d0>] (ath10k_htt_t2h_msg_handler [ath10k_core]) from [<bf20f3e8>] (ath10k_htt_txrx_compl_task+0x9bc/0x117c [ath10k_core]) [<bf20f3e8>] (ath10k_htt_txrx_compl_task [ath10k_core]) from [<c022b158>] (tasklet_action+0xb8/0x144) [<c022b158>] (tasklet_action) from [<c022b32c>] (__do_softirq+0xe0/0x21c) [<c022b32c>] (__do_softirq) from [<c022b4d4>] (do_softirq.part.2+0x28/0x30) [<c022b4d4>] (do_softirq.part.2) from [<c022b590>] (__local_bh_enable_ip+0xb4/0x104) [<c022b590>] (__local_bh_enable_ip) from [<c0596684>] (packet_poll+0xc0/0x100) [<c0596684>] (packet_poll) from [<c04b1368>] (sock_poll+0xec/0xf8) [<c04b1368>] (sock_poll) from [<c02e7fc4>] (do_select+0x2f8/0x62c) [<c02e7fc4>] (do_select) from [<c02e858c>] (core_sys_select+0x294/0x424) [<c02e858c>] (core_sys_select) from [<c02e8820>] (SyS_select+0x104/0x130) [<c02e8820>] (SyS_select) from [<c0209c00>] (ret_fast_syscall+0x0/0x3c) ---[ end trace e55b94e0d302fcd8 ]--- Regards, A. Benz ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 2016-06-09 7:46 ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 A. Benz @ 2016-06-10 8:50 ` Michal Kazior 2016-06-10 8:55 ` Felix Fietkau 0 siblings, 1 reply; 15+ messages in thread From: Michal Kazior @ 2016-06-10 8:50 UTC (permalink / raw) To: A. Benz; +Cc: linux-wireless On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote: > Dear All, > > I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600). > With compat-wireless-2016-05-12, I observed traces attached below. > The router is unstable and eventually reboots by itself (randomly). > > Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing > else is changed (software-wise or hardware). > This was confirmed with other users. > > A new compile with the fixes below: > https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0 > > Did not solve the problem. > > Please let me know if I need to provide any further information. > > ------------[ cut here ]------------ > WARNING: CPU: 0 PID: 558 at > compat-wireless-2016-05-12/net/mac80211/rx.c:4068 > ieee80211_rx_napi+0x8c/0x8a4 [mac80211]() Can you post what is at rx.c line 4068 (and +/- 3 lines), please? Michał ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 2016-06-10 8:50 ` Michal Kazior @ 2016-06-10 8:55 ` Felix Fietkau 2016-06-10 9:10 ` Michal Kazior 2016-06-10 12:22 ` Kalle Valo 0 siblings, 2 replies; 15+ messages in thread From: Felix Fietkau @ 2016-06-10 8:55 UTC (permalink / raw) To: Michal Kazior, A. Benz; +Cc: linux-wireless On 2016-06-10 10:50, Michal Kazior wrote: > On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote: >> Dear All, >> >> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600). >> With compat-wireless-2016-05-12, I observed traces attached below. >> The router is unstable and eventually reboots by itself (randomly). >> >> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing >> else is changed (software-wise or hardware). >> This was confirmed with other users. >> >> A new compile with the fixes below: >> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0 >> >> Did not solve the problem. >> >> Please let me know if I need to provide any further information. >> >> ------------[ cut here ]------------ >> WARNING: CPU: 0 PID: 558 at >> compat-wireless-2016-05-12/net/mac80211/rx.c:4068 >> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]() > > Can you post what is at rx.c line 4068 (and +/- 3 lines), please? It's early in ieee80211_rx_napi: sband = local->hw.wiphy->bands[status->band]; if (WARN_ON(!sband)) goto drop; I could not easily find a scenario under which status->band would not be set properly by the driver, so my guess is there is some nasty memory corruption going on. FWIW, I've received several reports like this from different people on different devices. They're also confirming that reverting to the snapshot from January makes things stable again. - Felix ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 2016-06-10 8:55 ` Felix Fietkau @ 2016-06-10 9:10 ` Michal Kazior 2016-06-10 9:20 ` Johannes Berg 2016-06-10 12:22 ` Kalle Valo 1 sibling, 1 reply; 15+ messages in thread From: Michal Kazior @ 2016-06-10 9:10 UTC (permalink / raw) To: Felix Fietkau; +Cc: A. Benz, linux-wireless On 10 June 2016 at 10:55, Felix Fietkau <nbd@nbd.name> wrote: > On 2016-06-10 10:50, Michal Kazior wrote: >> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote: >>> Dear All, >>> >>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600). >>> With compat-wireless-2016-05-12, I observed traces attached below. >>> The router is unstable and eventually reboots by itself (randomly). >>> >>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing >>> else is changed (software-wise or hardware). >>> This was confirmed with other users. >>> >>> A new compile with the fixes below: >>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0 >>> >>> Did not solve the problem. >>> >>> Please let me know if I need to provide any further information. >>> >>> ------------[ cut here ]------------ >>> WARNING: CPU: 0 PID: 558 at >>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068 >>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]() >> >> Can you post what is at rx.c line 4068 (and +/- 3 lines), please? > It's early in ieee80211_rx_napi: > > sband = local->hw.wiphy->bands[status->band]; > if (WARN_ON(!sband)) > goto drop; Thanks. > I could not easily find a scenario under which status->band would not be > set properly by the driver, so my guess is there is some nasty memory > corruption going on. Hmm.. could it be related to ath10k not fulfilling (some) NAPI's locking requirements and thus ending up with, e.g. linked-list mayhem? Michał ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 2016-06-10 9:10 ` Michal Kazior @ 2016-06-10 9:20 ` Johannes Berg 0 siblings, 0 replies; 15+ messages in thread From: Johannes Berg @ 2016-06-10 9:20 UTC (permalink / raw) To: Michal Kazior, Felix Fietkau; +Cc: A. Benz, linux-wireless On Fri, 2016-06-10 at 11:10 +0200, Michal Kazior wrote: > > Hmm.. could it be related to ath10k not fulfilling (some) NAPI's > locking requirements and thus ending up with, e.g. linked-list > mayhem? > Shoudln't matter since ath10k doesn't actually use rx_napi()? johannes ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 2016-06-10 8:55 ` Felix Fietkau @ 2016-06-10 12:22 ` Kalle Valo 2016-06-10 12:22 ` Kalle Valo 1 sibling, 0 replies; 15+ messages in thread From: Kalle Valo @ 2016-06-10 12:22 UTC (permalink / raw) To: Felix Fietkau; +Cc: Michal Kazior, A. Benz, linux-wireless, ath10k Felix Fietkau <nbd@nbd.name> writes: > On 2016-06-10 10:50, Michal Kazior wrote: >> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote: >>> Dear All, >>> >>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600). >>> With compat-wireless-2016-05-12, I observed traces attached below. >>> The router is unstable and eventually reboots by itself (randomly). >>> >>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing >>> else is changed (software-wise or hardware). >>> This was confirmed with other users. >>> >>> A new compile with the fixes below: >>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0 >>> >>> Did not solve the problem. >>> >>> Please let me know if I need to provide any further information. >>> >>> ------------[ cut here ]------------ >>> WARNING: CPU: 0 PID: 558 at >>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068 >>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]() >> >> Can you post what is at rx.c line 4068 (and +/- 3 lines), please? > It's early in ieee80211_rx_napi: > > sband = local->hw.wiphy->bands[status->band]; > if (WARN_ON(!sband)) > goto drop; > > I could not easily find a scenario under which status->band would not be > set properly by the driver, so my guess is there is some nasty memory > corruption going on. > > FWIW, I've received several reports like this from different people on > different devices. They're also confirming that reverting to the > snapshot from January makes things stable again. Adding ath10k list to the loop. -- Kalle Valo ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 @ 2016-06-10 12:22 ` Kalle Valo 0 siblings, 0 replies; 15+ messages in thread From: Kalle Valo @ 2016-06-10 12:22 UTC (permalink / raw) To: Felix Fietkau; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz Felix Fietkau <nbd@nbd.name> writes: > On 2016-06-10 10:50, Michal Kazior wrote: >> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote: >>> Dear All, >>> >>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600). >>> With compat-wireless-2016-05-12, I observed traces attached below. >>> The router is unstable and eventually reboots by itself (randomly). >>> >>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing >>> else is changed (software-wise or hardware). >>> This was confirmed with other users. >>> >>> A new compile with the fixes below: >>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0 >>> >>> Did not solve the problem. >>> >>> Please let me know if I need to provide any further information. >>> >>> ------------[ cut here ]------------ >>> WARNING: CPU: 0 PID: 558 at >>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068 >>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]() >> >> Can you post what is at rx.c line 4068 (and +/- 3 lines), please? > It's early in ieee80211_rx_napi: > > sband = local->hw.wiphy->bands[status->band]; > if (WARN_ON(!sband)) > goto drop; > > I could not easily find a scenario under which status->band would not be > set properly by the driver, so my guess is there is some nasty memory > corruption going on. > > FWIW, I've received several reports like this from different people on > different devices. They're also confirming that reverting to the > snapshot from January makes things stable again. Adding ath10k list to the loop. -- Kalle Valo _______________________________________________ ath10k mailing list ath10k@lists.infradead.org http://lists.infradead.org/mailman/listinfo/ath10k ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 2016-06-10 12:22 ` Kalle Valo @ 2016-06-10 12:52 ` Ben Greear -1 siblings, 0 replies; 15+ messages in thread From: Ben Greear @ 2016-06-10 12:52 UTC (permalink / raw) To: Kalle Valo, Felix Fietkau; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz On 06/10/2016 05:22 AM, Kalle Valo wrote: > Felix Fietkau <nbd@nbd.name> writes: > >> On 2016-06-10 10:50, Michal Kazior wrote: >>> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote: >>>> Dear All, >>>> >>>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600). >>>> With compat-wireless-2016-05-12, I observed traces attached below. >>>> The router is unstable and eventually reboots by itself (randomly). >>>> >>>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing >>>> else is changed (software-wise or hardware). >>>> This was confirmed with other users. >>>> >>>> A new compile with the fixes below: >>>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0 >>>> >>>> Did not solve the problem. >>>> >>>> Please let me know if I need to provide any further information. >>>> >>>> ------------[ cut here ]------------ >>>> WARNING: CPU: 0 PID: 558 at >>>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068 >>>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]() >>> >>> Can you post what is at rx.c line 4068 (and +/- 3 lines), please? >> It's early in ieee80211_rx_napi: >> >> sband = local->hw.wiphy->bands[status->band]; >> if (WARN_ON(!sband)) >> goto drop; >> >> I could not easily find a scenario under which status->band would not be >> set properly by the driver, so my guess is there is some nasty memory >> corruption going on. >> >> FWIW, I've received several reports like this from different people on >> different devices. They're also confirming that reverting to the >> snapshot from January makes things stable again. > > Adding ath10k list to the loop. This looks a lot like the problems I was having. Two of these 5 patches recently made it upstream (but may not be in LEDE yet), but the other patches also were related to memory corruption. See my patches posted on 4/1/16: https://patchwork.kernel.org/project/ath10k/list/ I don't know where the 5/5 patch ended up. Thanks, Ben -- Ben Greear <greearb@candelatech.com> Candela Technologies Inc http://www.candelatech.com ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 @ 2016-06-10 12:52 ` Ben Greear 0 siblings, 0 replies; 15+ messages in thread From: Ben Greear @ 2016-06-10 12:52 UTC (permalink / raw) To: Kalle Valo, Felix Fietkau; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz On 06/10/2016 05:22 AM, Kalle Valo wrote: > Felix Fietkau <nbd@nbd.name> writes: > >> On 2016-06-10 10:50, Michal Kazior wrote: >>> On 9 June 2016 at 09:46, A. Benz <ash.benz@bk.ru> wrote: >>>> Dear All, >>>> >>>> I am using LEDE on my IPQ806x (QCA9980) system (Archer C2600). >>>> With compat-wireless-2016-05-12, I observed traces attached below. >>>> The router is unstable and eventually reboots by itself (randomly). >>>> >>>> Upon reverting to compat-wireless-2016-01, the issue disappears. Nothing >>>> else is changed (software-wise or hardware). >>>> This was confirmed with other users. >>>> >>>> A new compile with the fixes below: >>>> https://git.lede-project.org/?p=lede/nbd/staging.git;a=commit;h=858e26f3c0fc11231f25497cbb2ddca1e5f101e0 >>>> >>>> Did not solve the problem. >>>> >>>> Please let me know if I need to provide any further information. >>>> >>>> ------------[ cut here ]------------ >>>> WARNING: CPU: 0 PID: 558 at >>>> compat-wireless-2016-05-12/net/mac80211/rx.c:4068 >>>> ieee80211_rx_napi+0x8c/0x8a4 [mac80211]() >>> >>> Can you post what is at rx.c line 4068 (and +/- 3 lines), please? >> It's early in ieee80211_rx_napi: >> >> sband = local->hw.wiphy->bands[status->band]; >> if (WARN_ON(!sband)) >> goto drop; >> >> I could not easily find a scenario under which status->band would not be >> set properly by the driver, so my guess is there is some nasty memory >> corruption going on. >> >> FWIW, I've received several reports like this from different people on >> different devices. They're also confirming that reverting to the >> snapshot from January makes things stable again. > > Adding ath10k list to the loop. This looks a lot like the problems I was having. Two of these 5 patches recently made it upstream (but may not be in LEDE yet), but the other patches also were related to memory corruption. See my patches posted on 4/1/16: https://patchwork.kernel.org/project/ath10k/list/ I don't know where the 5/5 patch ended up. Thanks, Ben -- Ben Greear <greearb@candelatech.com> Candela Technologies Inc http://www.candelatech.com _______________________________________________ ath10k mailing list ath10k@lists.infradead.org http://lists.infradead.org/mailman/listinfo/ath10k ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 2016-06-10 12:52 ` Ben Greear @ 2016-06-10 12:57 ` Felix Fietkau -1 siblings, 0 replies; 15+ messages in thread From: Felix Fietkau @ 2016-06-10 12:57 UTC (permalink / raw) To: Ben Greear, Kalle Valo; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz On 2016-06-10 14:52, Ben Greear wrote: > This looks a lot like the problems I was having. > > Two of these 5 patches recently made it upstream (but may not be in LEDE yet), > but the other patches also were related to memory corruption. > > See my patches posted on 4/1/16: > > https://patchwork.kernel.org/project/ath10k/list/ > > I don't know where the 5/5 patch ended up. I had already asked affected users to test with those patches (I have a commit that adds them in my staging tree), but it did not resolve the issue. - Felix ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 @ 2016-06-10 12:57 ` Felix Fietkau 0 siblings, 0 replies; 15+ messages in thread From: Felix Fietkau @ 2016-06-10 12:57 UTC (permalink / raw) To: Ben Greear, Kalle Valo; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz On 2016-06-10 14:52, Ben Greear wrote: > This looks a lot like the problems I was having. > > Two of these 5 patches recently made it upstream (but may not be in LEDE yet), > but the other patches also were related to memory corruption. > > See my patches posted on 4/1/16: > > https://patchwork.kernel.org/project/ath10k/list/ > > I don't know where the 5/5 patch ended up. I had already asked affected users to test with those patches (I have a commit that adds them in my staging tree), but it did not resolve the issue. - Felix _______________________________________________ ath10k mailing list ath10k@lists.infradead.org http://lists.infradead.org/mailman/listinfo/ath10k ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 2016-06-10 12:57 ` Felix Fietkau @ 2016-06-10 13:16 ` Ben Greear -1 siblings, 0 replies; 15+ messages in thread From: Ben Greear @ 2016-06-10 13:16 UTC (permalink / raw) To: Felix Fietkau, Kalle Valo; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz On 06/10/2016 05:57 AM, Felix Fietkau wrote: > On 2016-06-10 14:52, Ben Greear wrote: >> This looks a lot like the problems I was having. >> >> Two of these 5 patches recently made it upstream (but may not be in LEDE yet), >> but the other patches also were related to memory corruption. >> >> See my patches posted on 4/1/16: >> >> https://patchwork.kernel.org/project/ath10k/list/ >> >> I don't know where the 5/5 patch ended up. > I had already asked affected users to test with those patches (I have a > commit that adds them in my staging tree), but it did not resolve the issue. Ok, must be something else then. If you can run on x86 under KASAN it may provide some clues..that is how I eventually made progress on the issues I was seeing. My rebase onto 3.7 has been slow and painful, but I should be ready to start testing that sometime soon, maybe I can reproduce something there. Thanks, Ben -- Ben Greear <greearb@candelatech.com> Candela Technologies Inc http://www.candelatech.com ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 @ 2016-06-10 13:16 ` Ben Greear 0 siblings, 0 replies; 15+ messages in thread From: Ben Greear @ 2016-06-10 13:16 UTC (permalink / raw) To: Felix Fietkau, Kalle Valo; +Cc: linux-wireless, Michal Kazior, ath10k, A. Benz On 06/10/2016 05:57 AM, Felix Fietkau wrote: > On 2016-06-10 14:52, Ben Greear wrote: >> This looks a lot like the problems I was having. >> >> Two of these 5 patches recently made it upstream (but may not be in LEDE yet), >> but the other patches also were related to memory corruption. >> >> See my patches posted on 4/1/16: >> >> https://patchwork.kernel.org/project/ath10k/list/ >> >> I don't know where the 5/5 patch ended up. > I had already asked affected users to test with those patches (I have a > commit that adds them in my staging tree), but it did not resolve the issue. Ok, must be something else then. If you can run on x86 under KASAN it may provide some clues..that is how I eventually made progress on the issues I was seeing. My rebase onto 3.7 has been slow and painful, but I should be ready to start testing that sometime soon, maybe I can reproduce something there. Thanks, Ben -- Ben Greear <greearb@candelatech.com> Candela Technologies Inc http://www.candelatech.com _______________________________________________ ath10k mailing list ath10k@lists.infradead.org http://lists.infradead.org/mailman/listinfo/ath10k ^ permalink raw reply [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 2016-06-10 13:16 ` Ben Greear @ 2016-07-15 11:49 ` Nagarajan, Ashok Raj -1 siblings, 0 replies; 15+ messages in thread From: Nagarajan, Ashok Raj @ 2016-07-15 11:49 UTC (permalink / raw) To: Ben Greear, nbd, Kalle Valo Cc: linux-wireless, michal.kazior, ath10k, A. Benz > On 06/10/2016 05:57 AM, Felix Fietkau wrote: >>> On 2016-06-10 14:52, Ben Greear wrote: >>> This looks a lot like the problems I was having. >>> >>> Two of these 5 patches recently made it upstream (but may not be in LEDE yet), >>> but the other patches also were related to memory corruption. >>> >>> See my patches posted on 4/1/16: >>> >>> https://patchwork.kernel.org/project/ath10k/list/ >>> >>> I don't know where the 5/5 patch ended up. >> I had already asked affected users to test with those patches (I have a >> commit that adds them in my staging tree), but it did not resolve the issue. > >Ok, must be something else then. > >If you can run on x86 under KASAN it may provide some clues..that is how I eventually >made progress on the issues I was seeing. My rebase onto 3.7 has been slow and painful, >but I should be ready to start testing that sometime soon, maybe I can reproduce something >there. Hi Benz, Could you please check with the following diff if it is solving your issue? diff --git a/drivers/net/wireless/ath/ath10k/htt_rx.c b/drivers/net/wireless/ath/ath10k/htt_rx.c index 6f19fca..c192a41 100644 --- a/drivers/net/wireless/ath/ath10k/htt_rx.c +++ b/drivers/net/wireless/ath/ath10k/htt_rx.c @@ -1528,7 +1528,7 @@ static void ath10k_htt_rx_h_filter(struct ath10k *ar, static int ath10k_htt_rx_handle_amsdu(struct ath10k_htt *htt) { struct ath10k *ar = htt->ar; - static struct ieee80211_rx_status rx_status; + struct ieee80211_rx_status *rx_status = &htt->rx_status; struct sk_buff_head amsdu; int ret; @@ -1553,11 +1553,11 @@ static int ath10k_htt_rx_handle_amsdu(struct ath10k_htt *htt) } ath10k_pktlog_rx(ar, &amsdu); - ath10k_htt_rx_h_ppdu(ar, &amsdu, &rx_status, 0xffff); + ath10k_htt_rx_h_ppdu(ar, &amsdu, rx_status, 0xffff); ath10k_htt_rx_h_unchain(ar, &amsdu, ret > 0); - ath10k_htt_rx_h_filter(ar, &amsdu, &rx_status); - ath10k_htt_rx_h_mpdu(ar, &amsdu, &rx_status); - ath10k_htt_rx_h_deliver(ar, &amsdu, &rx_status); + ath10k_htt_rx_h_filter(ar, &amsdu, rx_status); + ath10k_htt_rx_h_mpdu(ar, &amsdu, rx_status); + ath10k_htt_rx_h_deliver(ar, &amsdu, rx_status); Thanks, Ashok >Thanks, >Ben ^ permalink raw reply related [flat|nested] 15+ messages in thread
* Re: ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 @ 2016-07-15 11:49 ` Nagarajan, Ashok Raj 0 siblings, 0 replies; 15+ messages in thread From: Nagarajan, Ashok Raj @ 2016-07-15 11:49 UTC (permalink / raw) To: Ben Greear, nbd, Kalle Valo Cc: linux-wireless, michal.kazior, ath10k, A. Benz > On 06/10/2016 05:57 AM, Felix Fietkau wrote: >>> On 2016-06-10 14:52, Ben Greear wrote: >>> This looks a lot like the problems I was having. >>> >>> Two of these 5 patches recently made it upstream (but may not be in LEDE yet), >>> but the other patches also were related to memory corruption. >>> >>> See my patches posted on 4/1/16: >>> >>> https://patchwork.kernel.org/project/ath10k/list/ >>> >>> I don't know where the 5/5 patch ended up. >> I had already asked affected users to test with those patches (I have a >> commit that adds them in my staging tree), but it did not resolve the issue. > >Ok, must be something else then. > >If you can run on x86 under KASAN it may provide some clues..that is how I eventually >made progress on the issues I was seeing. My rebase onto 3.7 has been slow and painful, >but I should be ready to start testing that sometime soon, maybe I can reproduce something >there. Hi Benz, Could you please check with the following diff if it is solving your issue? diff --git a/drivers/net/wireless/ath/ath10k/htt_rx.c b/drivers/net/wireless/ath/ath10k/htt_rx.c index 6f19fca..c192a41 100644 --- a/drivers/net/wireless/ath/ath10k/htt_rx.c +++ b/drivers/net/wireless/ath/ath10k/htt_rx.c @@ -1528,7 +1528,7 @@ static void ath10k_htt_rx_h_filter(struct ath10k *ar, static int ath10k_htt_rx_handle_amsdu(struct ath10k_htt *htt) { struct ath10k *ar = htt->ar; - static struct ieee80211_rx_status rx_status; + struct ieee80211_rx_status *rx_status = &htt->rx_status; struct sk_buff_head amsdu; int ret; @@ -1553,11 +1553,11 @@ static int ath10k_htt_rx_handle_amsdu(struct ath10k_htt *htt) } ath10k_pktlog_rx(ar, &amsdu); - ath10k_htt_rx_h_ppdu(ar, &amsdu, &rx_status, 0xffff); + ath10k_htt_rx_h_ppdu(ar, &amsdu, rx_status, 0xffff); ath10k_htt_rx_h_unchain(ar, &amsdu, ret > 0); - ath10k_htt_rx_h_filter(ar, &amsdu, &rx_status); - ath10k_htt_rx_h_mpdu(ar, &amsdu, &rx_status); - ath10k_htt_rx_h_deliver(ar, &amsdu, &rx_status); + ath10k_htt_rx_h_filter(ar, &amsdu, rx_status); + ath10k_htt_rx_h_mpdu(ar, &amsdu, rx_status); + ath10k_htt_rx_h_deliver(ar, &amsdu, rx_status); Thanks, Ashok >Thanks, >Ben _______________________________________________ ath10k mailing list ath10k@lists.infradead.org http://lists.infradead.org/mailman/listinfo/ath10k ^ permalink raw reply related [flat|nested] 15+ messages in thread
end of thread, other threads:[~2016-07-15 11:50 UTC | newest] Thread overview: 15+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2016-06-09 7:46 ath10k/QCA9980 - Issues introduced in wireless testing 2016-05 A. Benz 2016-06-10 8:50 ` Michal Kazior 2016-06-10 8:55 ` Felix Fietkau 2016-06-10 9:10 ` Michal Kazior 2016-06-10 9:20 ` Johannes Berg 2016-06-10 12:22 ` Kalle Valo 2016-06-10 12:22 ` Kalle Valo 2016-06-10 12:52 ` Ben Greear 2016-06-10 12:52 ` Ben Greear 2016-06-10 12:57 ` Felix Fietkau 2016-06-10 12:57 ` Felix Fietkau 2016-06-10 13:16 ` Ben Greear 2016-06-10 13:16 ` Ben Greear 2016-07-15 11:49 ` Nagarajan, Ashok Raj 2016-07-15 11:49 ` Nagarajan, Ashok Raj
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.