All of lore.kernel.org
 help / color / mirror / Atom feed
* More than one ath10k NIC in 3.19.0-rc5?
@ 2015-01-20 21:53 Ben Greear
  2015-01-21  2:13 ` Sebastian Gottschall
  0 siblings, 1 reply; 16+ messages in thread
From: Ben Greear @ 2015-01-20 21:53 UTC (permalink / raw)
  To: ath10k

I tried running a patched version of 3.19.0-rc5 kernel, and it seems to work
fine when there is one ath10k NIC in the system, but when there is two it
doesn't try to load the firmware.

Hacked 3.17.8+ works fine.

Anyone tried more than one ath10k NIC in 3.19 kernel?

Thanks,
Ben

-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC in 3.19.0-rc5?
  2015-01-20 21:53 More than one ath10k NIC in 3.19.0-rc5? Ben Greear
@ 2015-01-21  2:13 ` Sebastian Gottschall
  2015-01-21  2:49   ` Ben Greear
  0 siblings, 1 reply; 16+ messages in thread
From: Sebastian Gottschall @ 2015-01-21  2:13 UTC (permalink / raw)
  To: ath10k

Am 20.01.2015 um 22:53 schrieb Ben Greear:
> I tried running a patched version of 3.19.0-rc5 kernel, and it seems to work
> fine when there is one ath10k NIC in the system, but when there is two it
> doesn't try to load the firmware.
>
> Hacked 3.17.8+ works fine.
>
> Anyone tried more than one ath10k NIC in 3.19 kernel?
>
> Thanks,
> Ben
standard x86 system? i just can say that with 3.18 i have big troubles 
with the new pci bus code introduced on several embedded devices.
so maybe the reason is here?
>


_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC in 3.19.0-rc5?
  2015-01-21  2:13 ` Sebastian Gottschall
@ 2015-01-21  2:49   ` Ben Greear
  2015-01-21 19:01     ` Ben Greear
  0 siblings, 1 reply; 16+ messages in thread
From: Ben Greear @ 2015-01-21  2:49 UTC (permalink / raw)
  To: Sebastian Gottschall; +Cc: ath10k

On 01/20/2015 06:13 PM, Sebastian Gottschall wrote:
> Am 20.01.2015 um 22:53 schrieb Ben Greear:
>> I tried running a patched version of 3.19.0-rc5 kernel, and it seems to work
>> fine when there is one ath10k NIC in the system, but when there is two it
>> doesn't try to load the firmware.
>>
>> Hacked 3.17.8+ works fine.
>>
>> Anyone tried more than one ath10k NIC in 3.19 kernel?
>>
>> Thanks,
>> Ben
> standard x86 system? i just can say that with 3.18 i have big troubles with the new pci bus code introduced on several embedded devices.
> so maybe the reason is here?

Standard 64-bit x86.

Thanks,
Ben


-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC in 3.19.0-rc5?
  2015-01-21  2:49   ` Ben Greear
@ 2015-01-21 19:01     ` Ben Greear
  2015-01-21 19:47       ` Ben Greear
  0 siblings, 1 reply; 16+ messages in thread
From: Ben Greear @ 2015-01-21 19:01 UTC (permalink / raw)
  To: Sebastian Gottschall; +Cc: ath10k

On 01/20/2015 06:49 PM, Ben Greear wrote:
> On 01/20/2015 06:13 PM, Sebastian Gottschall wrote:
>> Am 20.01.2015 um 22:53 schrieb Ben Greear:
>>> I tried running a patched version of 3.19.0-rc5 kernel, and it seems to work
>>> fine when there is one ath10k NIC in the system, but when there is two it
>>> doesn't try to load the firmware.
>>>
>>> Hacked 3.17.8+ works fine.
>>>
>>> Anyone tried more than one ath10k NIC in 3.19 kernel?
>>>
>>> Thanks,
>>> Ben
>> standard x86 system? i just can say that with 3.18 i have big troubles with the new pci bus code introduced on several embedded devices.
>> so maybe the reason is here?
> 
> Standard 64-bit x86.

Looks like 3.19.0-rc2 is OK, -rc5 is busted.  I'll bisect...

Thanks,
Ben

> 
> Thanks,
> Ben
> 
> 


-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC in 3.19.0-rc5?
  2015-01-21 19:01     ` Ben Greear
@ 2015-01-21 19:47       ` Ben Greear
  2015-01-22 21:08         ` More than one ath10k NIC will not load (bisected) Ben Greear
  0 siblings, 1 reply; 16+ messages in thread
From: Ben Greear @ 2015-01-21 19:47 UTC (permalink / raw)
  To: ath10k

On 01/21/2015 11:01 AM, Ben Greear wrote:
> On 01/20/2015 06:49 PM, Ben Greear wrote:
>> On 01/20/2015 06:13 PM, Sebastian Gottschall wrote:
>>> Am 20.01.2015 um 22:53 schrieb Ben Greear:
>>>> I tried running a patched version of 3.19.0-rc5 kernel, and it seems to work
>>>> fine when there is one ath10k NIC in the system, but when there is two it
>>>> doesn't try to load the firmware.
>>>>
>>>> Hacked 3.17.8+ works fine.
>>>>
>>>> Anyone tried more than one ath10k NIC in 3.19 kernel?
>>>>
>>>> Thanks,
>>>> Ben
>>> standard x86 system? i just can say that with 3.18 i have big troubles with the new pci bus code introduced on several embedded devices.
>>> so maybe the reason is here?
>>
>> Standard 64-bit x86.
> 
> Looks like 3.19.0-rc2 is OK, -rc5 is busted.  I'll bisect...

Err, no.  rc2 is busted too...sometimes first NIC loads, sometimes
second, but not both at once....doing bigger bisect...

Ben


-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC will not load (bisected)
  2015-01-21 19:47       ` Ben Greear
@ 2015-01-22 21:08         ` Ben Greear
  2015-01-23  6:43           ` Michal Kazior
  0 siblings, 1 reply; 16+ messages in thread
From: Ben Greear @ 2015-01-22 21:08 UTC (permalink / raw)
  To: ath10k, Michal Kazior


On 01/21/2015 11:47 AM, Ben Greear wrote:
> On 01/21/2015 11:01 AM, Ben Greear wrote:
>> On 01/20/2015 06:49 PM, Ben Greear wrote:
>>> On 01/20/2015 06:13 PM, Sebastian Gottschall wrote:
>>>> Am 20.01.2015 um 22:53 schrieb Ben Greear:
>>>>> I tried running a patched version of 3.19.0-rc5 kernel, and it seems to work
>>>>> fine when there is one ath10k NIC in the system, but when there is two it
>>>>> doesn't try to load the firmware.

Ok, this bisects to this commit.  First (or sometimes second) NIC will load,
but the other one will not load firmware.  No obvious errors in dmesg.

Platform is x86-64, Fedora 20.  NICS:

03:00.0 Network controller: Qualcomm Atheros QCA988x 802.11ac Wireless Network Adapter
04:00.0 Network controller: Qualcomm Atheros QCA988x 802.11ac Wireless Network Adapter


7c0f0e3c9797e5a08d15da68812f24eadaf9e82b is the first bad commit
commit 7c0f0e3c9797e5a08d15da68812f24eadaf9e82b
Author: Michal Kazior <michal.kazior@tieto.com>
Date:   Mon Oct 20 14:14:38 2014 +0200

    ath10k: mask/unmask msi fw irq

    This was the final missing bit to making sure the
    device doesn't assert interrupts to host.

    This should fix possible race when target crashes
    during driver teardown.

    This also removes an early warm reset workaround
    during pci probing.

    Signed-off-by: Michal Kazior <michal.kazior@tieto.com>
    Signed-off-by: Kalle Valo <kvalo@qca.qualcomm.com>

:040000 040000 f7ab5ff9f7e3713118dab6f23d02f11003de6fef fa5cb5ee76bdee2c654fcb8e6b8b1283c225bcc2 M	drivers

Thanks,
Ben

-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC will not load (bisected)
  2015-01-22 21:08         ` More than one ath10k NIC will not load (bisected) Ben Greear
@ 2015-01-23  6:43           ` Michal Kazior
  2015-01-23 15:54             ` Ben Greear
  0 siblings, 1 reply; 16+ messages in thread
From: Michal Kazior @ 2015-01-23  6:43 UTC (permalink / raw)
  To: Ben Greear; +Cc: ath10k

On 22 January 2015 at 22:08, Ben Greear <greearb@candelatech.com> wrote:
>
> On 01/21/2015 11:47 AM, Ben Greear wrote:
>> On 01/21/2015 11:01 AM, Ben Greear wrote:
>>> On 01/20/2015 06:49 PM, Ben Greear wrote:
>>>> On 01/20/2015 06:13 PM, Sebastian Gottschall wrote:
>>>>> Am 20.01.2015 um 22:53 schrieb Ben Greear:
>>>>>> I tried running a patched version of 3.19.0-rc5 kernel, and it seems to work
>>>>>> fine when there is one ath10k NIC in the system, but when there is two it
>>>>>> doesn't try to load the firmware.
>
> Ok, this bisects to this commit.  First (or sometimes second) NIC will load,
> but the other one will not load firmware.  No obvious errors in dmesg.

So I'm confused. If it doesn't load firmware then there has to be some
error in dmesg. If there are no errors then how come you know it
doesn't load firmware? Is it being probed at all?


> Platform is x86-64, Fedora 20.  NICS:
>
> 03:00.0 Network controller: Qualcomm Atheros QCA988x 802.11ac Wireless Network Adapter
> 04:00.0 Network controller: Qualcomm Atheros QCA988x 802.11ac Wireless Network Adapter
>
>
> 7c0f0e3c9797e5a08d15da68812f24eadaf9e82b is the first bad commit
> commit 7c0f0e3c9797e5a08d15da68812f24eadaf9e82b
> Author: Michal Kazior <michal.kazior@tieto.com>
> Date:   Mon Oct 20 14:14:38 2014 +0200
>
>     ath10k: mask/unmask msi fw irq
>
>     This was the final missing bit to making sure the
>     device doesn't assert interrupts to host.
>
>     This should fix possible race when target crashes
>     during driver teardown.
>
>     This also removes an early warm reset workaround
>     during pci probing.
>
>     Signed-off-by: Michal Kazior <michal.kazior@tieto.com>
>     Signed-off-by: Kalle Valo <kvalo@qca.qualcomm.com>
>
> :040000 040000 f7ab5ff9f7e3713118dab6f23d02f11003de6fef fa5cb5ee76bdee2c654fcb8e6b8b1283c225bcc2 M      drivers

Hmm.. This removes warm_reset in probe function but I fail to see how
this could end up not loading one of the NIC *silently*?

Anyway there's a pending patch which adds the reset back:
https://github.com/kvalo/ath/commit/bdcd6f4e4ac5d2d2a56da4813f56655e6db0ee45
. You might want to try it and see if it helps.


Michał

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC will not load (bisected)
  2015-01-23  6:43           ` Michal Kazior
@ 2015-01-23 15:54             ` Ben Greear
  2015-01-24 18:53               ` Janusz Dziedzic
  2015-02-04  9:07               ` Kalle Valo
  0 siblings, 2 replies; 16+ messages in thread
From: Ben Greear @ 2015-01-23 15:54 UTC (permalink / raw)
  To: Michal Kazior; +Cc: ath10k



On 01/22/2015 10:43 PM, Michal Kazior wrote:
> On 22 January 2015 at 22:08, Ben Greear <greearb@candelatech.com> wrote:
>>
>> On 01/21/2015 11:47 AM, Ben Greear wrote:
>>> On 01/21/2015 11:01 AM, Ben Greear wrote:
>>>> On 01/20/2015 06:49 PM, Ben Greear wrote:
>>>>> On 01/20/2015 06:13 PM, Sebastian Gottschall wrote:
>>>>>> Am 20.01.2015 um 22:53 schrieb Ben Greear:
>>>>>>> I tried running a patched version of 3.19.0-rc5 kernel, and it seems to work
>>>>>>> fine when there is one ath10k NIC in the system, but when there is two it
>>>>>>> doesn't try to load the firmware.
>>
>> Ok, this bisects to this commit.  First (or sometimes second) NIC will load,
>> but the other one will not load firmware.  No obvious errors in dmesg.
>
> So I'm confused. If it doesn't load firmware then there has to be some
> error in dmesg. If there are no errors then how come you know it
> doesn't load firmware? Is it being probed at all?
>
>
>> Platform is x86-64, Fedora 20.  NICS:
>>
>> 03:00.0 Network controller: Qualcomm Atheros QCA988x 802.11ac Wireless Network Adapter
>> 04:00.0 Network controller: Qualcomm Atheros QCA988x 802.11ac Wireless Network Adapter
>>
>>
>> 7c0f0e3c9797e5a08d15da68812f24eadaf9e82b is the first bad commit
>> commit 7c0f0e3c9797e5a08d15da68812f24eadaf9e82b
>> Author: Michal Kazior <michal.kazior@tieto.com>
>> Date:   Mon Oct 20 14:14:38 2014 +0200
>>
>>      ath10k: mask/unmask msi fw irq
>>
>>      This was the final missing bit to making sure the
>>      device doesn't assert interrupts to host.
>>
>>      This should fix possible race when target crashes
>>      during driver teardown.
>>
>>      This also removes an early warm reset workaround
>>      during pci probing.
>>
>>      Signed-off-by: Michal Kazior <michal.kazior@tieto.com>
>>      Signed-off-by: Kalle Valo <kvalo@qca.qualcomm.com>
>>
>> :040000 040000 f7ab5ff9f7e3713118dab6f23d02f11003de6fef fa5cb5ee76bdee2c654fcb8e6b8b1283c225bcc2 M      drivers
>
> Hmm.. This removes warm_reset in probe function but I fail to see how
> this could end up not loading one of the NIC *silently*?
>
> Anyway there's a pending patch which adds the reset back:
> https://github.com/kvalo/ath/commit/bdcd6f4e4ac5d2d2a56da4813f56655e6db0ee45
> . You might want to try it and see if it helps.

Reverting the patch made it work again for me.

I don't understand that code well, but perhaps you are disabling
a shared interrupt that silently stops the second NIC from
being able to do it's thing?

Do you have a PC with 2 NICs in it that you could try yourself?

I can grab you the logs of a failure to boot later today.

Thanks,
Ben

>
>
> Michał
>

-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC will not load (bisected)
  2015-01-23 15:54             ` Ben Greear
@ 2015-01-24 18:53               ` Janusz Dziedzic
  2015-02-05 20:50                 ` AW: " Conrad Kostecki
  2015-02-04  9:07               ` Kalle Valo
  1 sibling, 1 reply; 16+ messages in thread
From: Janusz Dziedzic @ 2015-01-24 18:53 UTC (permalink / raw)
  To: Ben Greear; +Cc: Michal Kazior, ath10k

On 23 January 2015 at 16:54, Ben Greear <greearb@candelatech.com> wrote:
>
>
> On 01/22/2015 10:43 PM, Michal Kazior wrote:
>>
>> On 22 January 2015 at 22:08, Ben Greear <greearb@candelatech.com> wrote:
>>>
>>>
>>> On 01/21/2015 11:47 AM, Ben Greear wrote:
>>>>
>>>> On 01/21/2015 11:01 AM, Ben Greear wrote:
>>>>>
>>>>> On 01/20/2015 06:49 PM, Ben Greear wrote:
>>>>>>
>>>>>> On 01/20/2015 06:13 PM, Sebastian Gottschall wrote:
>>>>>>>
>>>>>>> Am 20.01.2015 um 22:53 schrieb Ben Greear:
>>>>>>>>
>>>>>>>> I tried running a patched version of 3.19.0-rc5 kernel, and it seems
>>>>>>>> to work
>>>>>>>> fine when there is one ath10k NIC in the system, but when there is
>>>>>>>> two it
>>>>>>>> doesn't try to load the firmware.
>>>
>>>
>>> Ok, this bisects to this commit.  First (or sometimes second) NIC will
>>> load,
>>> but the other one will not load firmware.  No obvious errors in dmesg.
>>
>>
>> So I'm confused. If it doesn't load firmware then there has to be some
>> error in dmesg. If there are no errors then how come you know it
>> doesn't load firmware? Is it being probed at all?
>>
>>
>>> Platform is x86-64, Fedora 20.  NICS:
>>>
>>> 03:00.0 Network controller: Qualcomm Atheros QCA988x 802.11ac Wireless
>>> Network Adapter
>>> 04:00.0 Network controller: Qualcomm Atheros QCA988x 802.11ac Wireless
>>> Network Adapter
>>>
>>>
>>> 7c0f0e3c9797e5a08d15da68812f24eadaf9e82b is the first bad commit
>>> commit 7c0f0e3c9797e5a08d15da68812f24eadaf9e82b
>>> Author: Michal Kazior <michal.kazior@tieto.com>
>>> Date:   Mon Oct 20 14:14:38 2014 +0200
>>>
>>>      ath10k: mask/unmask msi fw irq
>>>
>>>      This was the final missing bit to making sure the
>>>      device doesn't assert interrupts to host.
>>>
>>>      This should fix possible race when target crashes
>>>      during driver teardown.
>>>
>>>      This also removes an early warm reset workaround
>>>      during pci probing.
>>>
>>>      Signed-off-by: Michal Kazior <michal.kazior@tieto.com>
>>>      Signed-off-by: Kalle Valo <kvalo@qca.qualcomm.com>
>>>
>>> :040000 040000 f7ab5ff9f7e3713118dab6f23d02f11003de6fef
>>> fa5cb5ee76bdee2c654fcb8e6b8b1283c225bcc2 M      drivers
>>
>>
>> Hmm.. This removes warm_reset in probe function but I fail to see how
>> this could end up not loading one of the NIC *silently*?
>>
>> Anyway there's a pending patch which adds the reset back:
>>
>> https://github.com/kvalo/ath/commit/bdcd6f4e4ac5d2d2a56da4813f56655e6db0ee45
>> . You might want to try it and see if it helps.
>
>
> Reverting the patch made it work again for me.
>
> I don't understand that code well, but perhaps you are disabling
> a shared interrupt that silently stops the second NIC from
> being able to do it's thing?
>
> Do you have a PC with 2 NICs in it that you could try yourself?
>
I have laptop with 2 ath10k cards (one internal one external), and yes
I had some problems, but that was timeout issue (I am not sure,
reading chip_id), but we fixed this.

> I can grab you the logs of a failure to boot later today.
>
Please send logs.

BR
Janusz

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC will not load (bisected)
  2015-01-23 15:54             ` Ben Greear
  2015-01-24 18:53               ` Janusz Dziedzic
@ 2015-02-04  9:07               ` Kalle Valo
  2015-02-04  9:35                 ` Michal Kazior
  1 sibling, 1 reply; 16+ messages in thread
From: Kalle Valo @ 2015-02-04  9:07 UTC (permalink / raw)
  To: Ben Greear; +Cc: Michal Kazior, ath10k

Ben Greear <greearb@candelatech.com> writes:

>> Hmm.. This removes warm_reset in probe function but I fail to see how
>> this could end up not loading one of the NIC *silently*?
>>
>> Anyway there's a pending patch which adds the reset back:
>> https://github.com/kvalo/ath/commit/bdcd6f4e4ac5d2d2a56da4813f56655e6db0ee45
>> . You might want to try it and see if it helps.
>
> Reverting the patch made it work again for me.
>
> I don't understand that code well, but perhaps you are disabling
> a shared interrupt that silently stops the second NIC from
> being able to do it's thing?
>
> Do you have a PC with 2 NICs in it that you could try yourself?
>
> I can grab you the logs of a failure to boot later today.

What should we do with this one? I didn't look at the details yet, but
do we have any other option than to revert?

-- 
Kalle Valo

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC will not load (bisected)
  2015-02-04  9:07               ` Kalle Valo
@ 2015-02-04  9:35                 ` Michal Kazior
  2015-02-04 12:02                   ` Kalle Valo
  2015-02-04 15:23                   ` Ben Greear
  0 siblings, 2 replies; 16+ messages in thread
From: Michal Kazior @ 2015-02-04  9:35 UTC (permalink / raw)
  To: Kalle Valo; +Cc: Ben Greear, ath10k

On 4 February 2015 at 10:07, Kalle Valo <kvalo@qca.qualcomm.com> wrote:
> Ben Greear <greearb@candelatech.com> writes:
>
>>> Hmm.. This removes warm_reset in probe function but I fail to see how
>>> this could end up not loading one of the NIC *silently*?
>>>
>>> Anyway there's a pending patch which adds the reset back:
>>> https://github.com/kvalo/ath/commit/bdcd6f4e4ac5d2d2a56da4813f56655e6db0ee45
>>> . You might want to try it and see if it helps.
>>
>> Reverting the patch made it work again for me.
>>
>> I don't understand that code well, but perhaps you are disabling
>> a shared interrupt that silently stops the second NIC from
>> being able to do it's thing?
>>
>> Do you have a PC with 2 NICs in it that you could try yourself?
>>
>> I can grab you the logs of a failure to boot later today.
>
> What should we do with this one? I didn't look at the details yet, but
> do we have any other option than to revert?

I believe this is an issue in Ben's userspace (he sent me logs
privately) or some sort of kernel event bug. It basically looked like
this: both devices were detected by ath10k and both started
register_work. One of the devices loaded all the way while the other
tried to load a few non-existing firmware files and it stopped. Few
minutes later there was a hung task splat pointing to
request_firmware() called from ath10k suggesting userspace didn't
handle firmware request.

The "offending" patch effectively removed 200ms from probe() in
ath10k. This could've change timing on request_firmware() calls on
Ben's system. Btw. the 200ms is back again now with
1a7fecb766c83dace747f42b25bbb544b00a0163 ("ath10k: reset chip before
reading chip_id in probe").

Marek tried running 2 qca988x on his laptop some time ago (with and
without the extra timing) and didn't have any issues.


Michał

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC will not load (bisected)
  2015-02-04  9:35                 ` Michal Kazior
@ 2015-02-04 12:02                   ` Kalle Valo
  2015-02-04 15:23                   ` Ben Greear
  1 sibling, 0 replies; 16+ messages in thread
From: Kalle Valo @ 2015-02-04 12:02 UTC (permalink / raw)
  To: Michal Kazior; +Cc: Ben Greear, ath10k

Michal Kazior <michal.kazior@tieto.com> writes:

> On 4 February 2015 at 10:07, Kalle Valo <kvalo@qca.qualcomm.com> wrote:
>> Ben Greear <greearb@candelatech.com> writes:
>>
>>>> Hmm.. This removes warm_reset in probe function but I fail to see how
>>>> this could end up not loading one of the NIC *silently*?
>>>>
>>>> Anyway there's a pending patch which adds the reset back:
>>>> https://github.com/kvalo/ath/commit/bdcd6f4e4ac5d2d2a56da4813f56655e6db0ee45
>>>> . You might want to try it and see if it helps.
>>>
>>> Reverting the patch made it work again for me.
>>>
>>> I don't understand that code well, but perhaps you are disabling
>>> a shared interrupt that silently stops the second NIC from
>>> being able to do it's thing?
>>>
>>> Do you have a PC with 2 NICs in it that you could try yourself?
>>>
>>> I can grab you the logs of a failure to boot later today.
>>
>> What should we do with this one? I didn't look at the details yet, but
>> do we have any other option than to revert?
>
> I believe this is an issue in Ben's userspace (he sent me logs
> privately) or some sort of kernel event bug. It basically looked like
> this: both devices were detected by ath10k and both started
> register_work. One of the devices loaded all the way while the other
> tried to load a few non-existing firmware files and it stopped. Few
> minutes later there was a hung task splat pointing to
> request_firmware() called from ath10k suggesting userspace didn't
> handle firmware request.
>
> The "offending" patch effectively removed 200ms from probe() in
> ath10k. This could've change timing on request_firmware() calls on
> Ben's system. Btw. the 200ms is back again now with
> 1a7fecb766c83dace747f42b25bbb544b00a0163 ("ath10k: reset chip before
> reading chip_id in probe").
>
> Marek tried running 2 qca988x on his laptop some time ago (with and
> without the extra timing) and didn't have any issues.

Good, thanks for checking this.

-- 
Kalle Valo

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC will not load (bisected)
  2015-02-04  9:35                 ` Michal Kazior
  2015-02-04 12:02                   ` Kalle Valo
@ 2015-02-04 15:23                   ` Ben Greear
  2015-02-05  6:02                     ` Michal Kazior
  1 sibling, 1 reply; 16+ messages in thread
From: Ben Greear @ 2015-02-04 15:23 UTC (permalink / raw)
  To: Michal Kazior, Kalle Valo; +Cc: ath10k

On 02/04/2015 01:35 AM, Michal Kazior wrote:
> On 4 February 2015 at 10:07, Kalle Valo <kvalo@qca.qualcomm.com> wrote:
>> Ben Greear <greearb@candelatech.com> writes:
>>
>>>> Hmm.. This removes warm_reset in probe function but I fail to see how
>>>> this could end up not loading one of the NIC *silently*?
>>>>
>>>> Anyway there's a pending patch which adds the reset back:
>>>> https://github.com/kvalo/ath/commit/bdcd6f4e4ac5d2d2a56da4813f56655e6db0ee45
>>>> . You might want to try it and see if it helps.
>>>
>>> Reverting the patch made it work again for me.
>>>
>>> I don't understand that code well, but perhaps you are disabling
>>> a shared interrupt that silently stops the second NIC from
>>> being able to do it's thing?
>>>
>>> Do you have a PC with 2 NICs in it that you could try yourself?
>>>
>>> I can grab you the logs of a failure to boot later today.
>>
>> What should we do with this one? I didn't look at the details yet, but
>> do we have any other option than to revert?
>
> I believe this is an issue in Ben's userspace (he sent me logs
> privately) or some sort of kernel event bug. It basically looked like
> this: both devices were detected by ath10k and both started
> register_work. One of the devices loaded all the way while the other
> tried to load a few non-existing firmware files and it stopped. Few
> minutes later there was a hung task splat pointing to
> request_firmware() called from ath10k suggesting userspace didn't
> handle firmware request.
>
> The "offending" patch effectively removed 200ms from probe() in
> ath10k. This could've change timing on request_firmware() calls on
> Ben's system. Btw. the 200ms is back again now with
> 1a7fecb766c83dace747f42b25bbb544b00a0163 ("ath10k: reset chip before
> reading chip_id in probe").
>
> Marek tried running 2 qca988x on his laptop some time ago (with and
> without the extra timing) and didn't have any issues.

I can retry my system with stock Fedora 20 and see if it works there.

If not, then I think it still needs to be worked on...you agree?

Thanks,
Ben



-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC will not load (bisected)
  2015-02-04 15:23                   ` Ben Greear
@ 2015-02-05  6:02                     ` Michal Kazior
  2015-02-05  6:41                       ` Ben Greear
  0 siblings, 1 reply; 16+ messages in thread
From: Michal Kazior @ 2015-02-05  6:02 UTC (permalink / raw)
  To: Ben Greear; +Cc: Kalle Valo, ath10k

On 4 February 2015 at 16:23, Ben Greear <greearb@candelatech.com> wrote:
> On 02/04/2015 01:35 AM, Michal Kazior wrote:
>>
>> On 4 February 2015 at 10:07, Kalle Valo <kvalo@qca.qualcomm.com> wrote:
>>>
>>> Ben Greear <greearb@candelatech.com> writes:
>>>
>>>>> Hmm.. This removes warm_reset in probe function but I fail to see how
>>>>> this could end up not loading one of the NIC *silently*?
>>>>>
>>>>> Anyway there's a pending patch which adds the reset back:
>>>>>
>>>>> https://github.com/kvalo/ath/commit/bdcd6f4e4ac5d2d2a56da4813f56655e6db0ee45
>>>>> . You might want to try it and see if it helps.
>>>>
>>>>
>>>> Reverting the patch made it work again for me.
>>>>
>>>> I don't understand that code well, but perhaps you are disabling
>>>> a shared interrupt that silently stops the second NIC from
>>>> being able to do it's thing?
>>>>
>>>> Do you have a PC with 2 NICs in it that you could try yourself?
>>>>
>>>> I can grab you the logs of a failure to boot later today.
>>>
>>>
>>> What should we do with this one? I didn't look at the details yet, but
>>> do we have any other option than to revert?
>>
>>
>> I believe this is an issue in Ben's userspace (he sent me logs
>> privately) or some sort of kernel event bug. It basically looked like
>> this: both devices were detected by ath10k and both started
>> register_work. One of the devices loaded all the way while the other
>> tried to load a few non-existing firmware files and it stopped. Few
>> minutes later there was a hung task splat pointing to
>> request_firmware() called from ath10k suggesting userspace didn't
>> handle firmware request.
>>
>> The "offending" patch effectively removed 200ms from probe() in
>> ath10k. This could've change timing on request_firmware() calls on
>> Ben's system. Btw. the 200ms is back again now with
>> 1a7fecb766c83dace747f42b25bbb544b00a0163 ("ath10k: reset chip before
>> reading chip_id in probe").
>>
>> Marek tried running 2 qca988x on his laptop some time ago (with and
>> without the extra timing) and didn't have any issues.
>
>
> I can retry my system with stock Fedora 20 and see if it works there.
>
> If not, then I think it still needs to be worked on...you agree?

In the driver? I'd argue. I don't see how ath10k could make
request_firmware() hang, do you?


Michał

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* Re: More than one ath10k NIC will not load (bisected)
  2015-02-05  6:02                     ` Michal Kazior
@ 2015-02-05  6:41                       ` Ben Greear
  0 siblings, 0 replies; 16+ messages in thread
From: Ben Greear @ 2015-02-05  6:41 UTC (permalink / raw)
  To: Michal Kazior; +Cc: Kalle Valo, ath10k

On 02/04/2015 10:02 PM, Michal Kazior wrote:
> On 4 February 2015 at 16:23, Ben Greear <greearb@candelatech.com> wrote:
>> On 02/04/2015 01:35 AM, Michal Kazior wrote:
>>>
>>> On 4 February 2015 at 10:07, Kalle Valo <kvalo@qca.qualcomm.com> wrote:
>>>>
>>>> Ben Greear <greearb@candelatech.com> writes:
>>>>
>>>>>> Hmm.. This removes warm_reset in probe function but I fail to see how
>>>>>> this could end up not loading one of the NIC *silently*?
>>>>>>
>>>>>> Anyway there's a pending patch which adds the reset back:
>>>>>>
>>>>>> https://github.com/kvalo/ath/commit/bdcd6f4e4ac5d2d2a56da4813f56655e6db0ee45
>>>>>> . You might want to try it and see if it helps.
>>>>>
>>>>>
>>>>> Reverting the patch made it work again for me.
>>>>>
>>>>> I don't understand that code well, but perhaps you are disabling
>>>>> a shared interrupt that silently stops the second NIC from
>>>>> being able to do it's thing?
>>>>>
>>>>> Do you have a PC with 2 NICs in it that you could try yourself?
>>>>>
>>>>> I can grab you the logs of a failure to boot later today.
>>>>
>>>>
>>>> What should we do with this one? I didn't look at the details yet, but
>>>> do we have any other option than to revert?
>>>
>>>
>>> I believe this is an issue in Ben's userspace (he sent me logs
>>> privately) or some sort of kernel event bug. It basically looked like
>>> this: both devices were detected by ath10k and both started
>>> register_work. One of the devices loaded all the way while the other
>>> tried to load a few non-existing firmware files and it stopped. Few
>>> minutes later there was a hung task splat pointing to
>>> request_firmware() called from ath10k suggesting userspace didn't
>>> handle firmware request.
>>>
>>> The "offending" patch effectively removed 200ms from probe() in
>>> ath10k. This could've change timing on request_firmware() calls on
>>> Ben's system. Btw. the 200ms is back again now with
>>> 1a7fecb766c83dace747f42b25bbb544b00a0163 ("ath10k: reset chip before
>>> reading chip_id in probe").
>>>
>>> Marek tried running 2 qca988x on his laptop some time ago (with and
>>> without the extra timing) and didn't have any issues.
>>
>>
>> I can retry my system with stock Fedora 20 and see if it works there.
>>
>> If not, then I think it still needs to be worked on...you agree?
>
> In the driver? I'd argue. I don't see how ath10k could make
> request_firmware() hang, do you?

I can lard up the kernel with lockdep and related things and see if
that offers some clues.  Will be a bit though, I'm pretty busy
with other things at the moment.

Thanks,
Ben

>
>
> Michał
>


-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

* AW: More than one ath10k NIC will not load (bisected)
  2015-01-24 18:53               ` Janusz Dziedzic
@ 2015-02-05 20:50                 ` Conrad Kostecki
  0 siblings, 0 replies; 16+ messages in thread
From: Conrad Kostecki @ 2015-02-05 20:50 UTC (permalink / raw)
  To: ath10k

> Do you have a PC with 2 NICs in it that you could try yourself?

I have two ath10k cards build in my soekris.
Those are working fine without any problems.
Both cards are loading fine.

[    2.731750] ath10k_pci 0000:07:00.0: PCI->APIC IRQ transform: INT A -> IRQ 17
[    3.037142] ath10k_pci 0000:07:00.0: irq 42 for MSI/MSI-X
[    3.037192] ath10k_pci 0000:07:00.0: pci irq msi interrupts 1 irq_mode 0 reset_mode 0
[    3.037564] ath10k_pci 0000:0c:00.0: PCI->APIC IRQ transform: INT A -> IRQ 18
[    3.340106] ath10k_pci 0000:0c:00.0: irq 43 for MSI/MSI-X
[    3.340145] ath10k_pci 0000:0c:00.0: pci irq msi interrupts 1 irq_mode 0 reset_mode 0
[    4.461521] ath10k_pci 0000:07:00.0: qca988x hw2.0 (0x4100016c, 0x043202ff) fw 10.2-00082-4-2 api 3 htt 2.1
[    4.469965] ath10k_pci 0000:07:00.0: debug 0 debugfs 0 tracing 0 dfs 1 testmode 0
[    5.070084] ath: EEPROM regdomain: 0x0
[    5.070086] ath: EEPROM indicates default country code should be used
[    5.070087] ath: doing EEPROM country->regdmn map search
[    5.070090] ath: country maps to regdmn code: 0x37
[    5.070092] ath: Country alpha2 being used: DE
[    5.070093] ath: Regpair used: 0x37
[    5.085625] ath10k_pci 0000:0c:00.0: qca988x hw2.0 (0x4100016c, 0x043202ff) fw 10.2-00082-4-2 api 3 htt 2.1
[    5.085631] ath10k_pci 0000:0c:00.0: debug 0 debugfs 0 tracing 0 dfs 1 testmode 0
[    5.680073] ath: EEPROM regdomain: 0x0
[    5.680078] ath: EEPROM indicates default country code should be used
[    5.680080] ath: doing EEPROM country->regdmn map search
[    5.680083] ath: country maps to regdmn code: 0x37
[    5.680085] ath: Country alpha2 being used: DE
[    5.680087] ath: Regpair used: 0x37

_______________________________________________
ath10k mailing list
ath10k@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/ath10k

^ permalink raw reply	[flat|nested] 16+ messages in thread

end of thread, other threads:[~2015-02-05 20:51 UTC | newest]

Thread overview: 16+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-01-20 21:53 More than one ath10k NIC in 3.19.0-rc5? Ben Greear
2015-01-21  2:13 ` Sebastian Gottschall
2015-01-21  2:49   ` Ben Greear
2015-01-21 19:01     ` Ben Greear
2015-01-21 19:47       ` Ben Greear
2015-01-22 21:08         ` More than one ath10k NIC will not load (bisected) Ben Greear
2015-01-23  6:43           ` Michal Kazior
2015-01-23 15:54             ` Ben Greear
2015-01-24 18:53               ` Janusz Dziedzic
2015-02-05 20:50                 ` AW: " Conrad Kostecki
2015-02-04  9:07               ` Kalle Valo
2015-02-04  9:35                 ` Michal Kazior
2015-02-04 12:02                   ` Kalle Valo
2015-02-04 15:23                   ` Ben Greear
2015-02-05  6:02                     ` Michal Kazior
2015-02-05  6:41                       ` Ben Greear

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.