All of lore.kernel.org
 help / color / mirror / Atom feed
* [ath9k-devel] WiFi Failure with AR9280
@ 2013-03-24 16:26 Pannirselvam Kanagaratnam
  2013-03-24 16:39 ` Felix Fietkau
  0 siblings, 1 reply; 10+ messages in thread
From: Pannirselvam Kanagaratnam @ 2013-03-24 16:26 UTC (permalink / raw)
  To: ath9k-devel

Hello,

My WiFi connection drops while running the AR9280 on a Freescale 
MPC8315E platform in AP mode with hostapd. I get the following error 
within 15 seconds and the WiFi connection drops. However, I do not 
observe any issues when using the AR9382. Can you let me know what could 
be causing this and if there is a workaround for this. I am using 
Compat-Wireless-3.5.4.1.

ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024 
AR_DIAG_SW=0x42000020 DMADBG_7=0x000062c0
ath: phy0: Could not stop RX, we could be confusing the DMA engine when 
we start RX up
------------[ cut here ]------------
Badness at 
/compat-wireless-3.5.4-1/drivers/net/wireless/ath/ath9k/recv.c:531
NIP: c9adabe0 LR: c9adabd4 CTR: c01b8068
REGS: c7b47e30 TRAP: 0700   Not tainted  (2.6.35)
MSR: 00029032 <EE,ME,CE,IR,DR>  CR: 22002022  XER: 20000000
TASK = c783db20[916] 'phy0' THREAD: c7b46000
GPR00: 00000001 c7b47ee0 c783db20 0000004c 00005554 ffffffff c01b55d8 
00004000
GPR08: 00000000 c9af0000 00005554 c03f4d58 22002082 100b1254 07ff9000 
00000000
GPR16: 100a9c50 00000000 00000000 00000000 00000000 00000000 00000000 
100a9048
GPR24: 00000000 c7b4a6cc 00000000 00000001 00000001 00000000 c7b49720 
c7b4c000
Call Trace:
[c7b47ee0] [c9adabd4] 0xc9adabd4 (unreliable)
[c7b47f10] [c9ad547c] 0xc9ad547c
[c7b47f30] [c9ad5eec] 0xc9ad5eec
[c7b47f60] [c9ad64e0] 0xc9ad64e0
[c7b47f80] [c003939c] 0xc003939c
[c7b47fc0] [c003d328] 0xc003d328
[c7b47ff0] [c0010f08] 0xc0010f08
Instruction dump:
4082ffac 3c60c9af 3ca0c9af 3863b45c 38a5b590 419eff98 809e0888 3884001c
4beeb431 3d20c9af 8809d59c 68000001 <0f000000> 2f800000 419eff74 38000001
ath: phy0: Failed to stop TX DMA, queues=0x004!
ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024 
AR_DIAG_SW=0x42000020 DMADBG_7=0x00006040
ath: phy0: Could not stop RX, we could be confusing the DMA engine when 
we start RX up
ath: phy0: Failed to stop TX DMA, queues=0x004!
ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024 
AR_DIAG_SW=0x42000020 DMADBG_7=0x00006040
ath: phy0: Could not stop RX, we could be confusing the DMA engine when 
we start RX up


Regards,

Pannir

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [ath9k-devel] WiFi Failure with AR9280
  2013-03-24 16:26 [ath9k-devel] WiFi Failure with AR9280 Pannirselvam Kanagaratnam
@ 2013-03-24 16:39 ` Felix Fietkau
  2013-03-27  3:27   ` Pannirselvam Kanagaratnam
  0 siblings, 1 reply; 10+ messages in thread
From: Felix Fietkau @ 2013-03-24 16:39 UTC (permalink / raw)
  To: ath9k-devel

On 2013-03-24 5:26 PM, Pannirselvam Kanagaratnam wrote:
> Hello,
> 
> My WiFi connection drops while running the AR9280 on a Freescale 
> MPC8315E platform in AP mode with hostapd. I get the following error 
> within 15 seconds and the WiFi connection drops. However, I do not 
> observe any issues when using the AR9382. Can you let me know what could 
> be causing this and if there is a workaround for this. I am using 
> Compat-Wireless-3.5.4.1.
I suggest updating to a more recent version of compat-wireless - 3.5.4.1
is quite old.
If a similar failure still occurs there, make sure you turn on
CONFIG_KALLSYMS so that the trace is not just a long list of meaningless
numbers.

- Felix

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [ath9k-devel] WiFi Failure with AR9280
  2013-03-24 16:39 ` Felix Fietkau
@ 2013-03-27  3:27   ` Pannirselvam Kanagaratnam
  2013-04-15  2:07     ` Pannirselvam Kanagaratnam
  0 siblings, 1 reply; 10+ messages in thread
From: Pannirselvam Kanagaratnam @ 2013-03-27  3:27 UTC (permalink / raw)
  To: ath9k-devel

On 25/3/2013 12:39 AM, Felix Fietkau wrote:
> On 2013-03-24 5:26 PM, Pannirselvam Kanagaratnam wrote:
>> Hello,
>>
>> My WiFi connection drops while running the AR9280 on a Freescale
>> MPC8315E platform in AP mode with hostapd. I get the following error
>> within 15 seconds and the WiFi connection drops. However, I do not
>> observe any issues when using the AR9382. Can you let me know what could
>> be causing this and if there is a workaround for this. I am using
>> Compat-Wireless-3.5.4.1.
> I suggest updating to a more recent version of compat-wireless - 3.5.4.1
> is quite old.
> If a similar failure still occurs there, make sure you turn on
> CONFIG_KALLSYMS so that the trace is not just a long list of meaningless
> numbers.
>
> - Felix
>
Here are my logs with Compat 3.9-rc2 and CONFIG_KALLSYMS enabled.

ath: phy0: Failed to stop TX DMA, queues=0x004!
ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024 
AR_DIAG_SW=0x42000020 DMADBG_7=0x000062c0
ath: phy0: Could not stop RX, we could be confusing the DMA engine when 
we start RX up
------------[ cut here ]------------
Badness at 
/build/compat-wireless-3.9-rc2-2-su/drivers/net/wireless/ath/ath9k/recv.c:487
NIP: c9b72eec LR: c9b72ee0 CTR: c01bc900
REGS: c6c63e30 TRAP: 0700   Not tainted  (2.6.35)
MSR: 00029032 <EE,ME,CE,IR,DR>  CR: 22002022  XER: 20000000
TASK = c7859360[930] 'phy0' THREAD: c6c62000
GPR00: 00000001 c6c63ee0 c7859360 0000004c 00005665 ffffffff c01b9e70 
00004000
GPR08: 00000000 c9b90000 00005665 c046ad78 22002082 00000000 07ff9000 
00000000
GPR16: 00000000 00000000 00000000 00000000 00000000 00000000 00000000 
0000000a
GPR24: 00000000 c6c66b4c c6c68030 00000000 00000000 00000000 c6c68010 
c6c65740
NIP [c9b72eec] ath_stoprecv+0x13c/0x154 [ath9k]
LR [c9b72ee0] ath_stoprecv+0x130/0x154 [ath9k]
Call Trace:
[c6c63ee0] [c9b72ee0] ath_stoprecv+0x130/0x154 [ath9k] (unreliable)
[c6c63f10] [c9b6ec20] ath_prepare_reset+0x54/0x90 [ath9k]
[c6c63f30] [c9b6ee90] ath_reset_internal+0x6c/0x220 [ath9k]
[c6c63f60] [c9b6f948] ath_reset+0x28/0x9c [ath9k]
[c6c63f80] [c0039398] worker_thread+0x108/0x1b0
[c6c63fc0] [c003d338] kthread+0x78/0x7c
[c6c63ff0] [c0010f08] kernel_thread+0x4c/0x68
Instruction dump:
2f800000 419eff74 809f0888 3c60c9b8 3ca0c9b8 38637194 38a572c8 38840020
4bec2125 3d20c9b9 88099475 68000001 <0f000000> 2f800000 419eff40 38000001
ath: phy0: Failed to stop TX DMA, queues=0x004!
ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024 
AR_DIAG_SW=0x42000020 DMADBG_7=0x00006040
ath: phy0: Could not stop RX, we could be confusing the DMA engine when 
we start RX up

Regards,

Pannir

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [ath9k-devel] WiFi Failure with AR9280
  2013-03-27  3:27   ` Pannirselvam Kanagaratnam
@ 2013-04-15  2:07     ` Pannirselvam Kanagaratnam
  2013-04-15  2:31       ` Adrian Chadd
  0 siblings, 1 reply; 10+ messages in thread
From: Pannirselvam Kanagaratnam @ 2013-04-15  2:07 UTC (permalink / raw)
  To: ath9k-devel

I have now tried 4 different chipsets on my board (AR9280, AR9287, 
AR9380 and AR9382). Only AR9280 is failing. Debugfs interrupt stats is 
as follows when using the AR9280:

SYNC_CAUSE stats:
              Sync-All:         48
               RTC-IRQ:          0
               MAC-IRQ:          0
EEPROM-Illegal-Access:          0
           APB-Timeout:          0
     PCI-Mode-Conflict:          0
           HOST1-Fatal:          0
            HOST1-Perr:          0
        TRCV-FIFO-Perr:          0
           RADM-CPL-EP:          0
   RADM-CPL-DLLP-Abort:         11
    RADM-CPL-TLP-Abort:          0
     RADM-CPL-ECRC-Err:          0
      RADM-CPL-Timeout:          2
     Local-Bus-Timeout:         35
             PM-Access:          0
             MAC-Awake:          0
            MAC-Asleep:          0
      MAC-Sleep-Access:          0

I have managed to get access to an Agilent Protocol Exerciser & Analyzer 
(E2960A). Any tips on what I should be zooming into?

Regards,

Pannir

On 27/3/2013 11:27 AM, Pannirselvam Kanagaratnam wrote:
> On 25/3/2013 12:39 AM, Felix Fietkau wrote:
>> On 2013-03-24 5:26 PM, Pannirselvam Kanagaratnam wrote:
>>> Hello,
>>>
>>> My WiFi connection drops while running the AR9280 on a Freescale
>>> MPC8315E platform in AP mode with hostapd. I get the following error
>>> within 15 seconds and the WiFi connection drops. However, I do not
>>> observe any issues when using the AR9382. Can you let me know what 
>>> could
>>> be causing this and if there is a workaround for this. I am using
>>> Compat-Wireless-3.5.4.1.
>> I suggest updating to a more recent version of compat-wireless - 3.5.4.1
>> is quite old.
>> If a similar failure still occurs there, make sure you turn on
>> CONFIG_KALLSYMS so that the trace is not just a long list of meaningless
>> numbers.
>>
>> - Felix
>>
> Here are my logs with Compat 3.9-rc2 and CONFIG_KALLSYMS enabled.
>
> ath: phy0: Failed to stop TX DMA, queues=0x004!
> ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024 
> AR_DIAG_SW=0x42000020 DMADBG_7=0x000062c0
> ath: phy0: Could not stop RX, we could be confusing the DMA engine 
> when we start RX up
> ------------[ cut here ]------------
> Badness at 
> /build/compat-wireless-3.9-rc2-2-su/drivers/net/wireless/ath/ath9k/recv.c:487
> NIP: c9b72eec LR: c9b72ee0 CTR: c01bc900
> REGS: c6c63e30 TRAP: 0700   Not tainted  (2.6.35)
> MSR: 00029032 <EE,ME,CE,IR,DR>  CR: 22002022  XER: 20000000
> TASK = c7859360[930] 'phy0' THREAD: c6c62000
> GPR00: 00000001 c6c63ee0 c7859360 0000004c 00005665 ffffffff c01b9e70 
> 00004000
> GPR08: 00000000 c9b90000 00005665 c046ad78 22002082 00000000 07ff9000 
> 00000000
> GPR16: 00000000 00000000 00000000 00000000 00000000 00000000 00000000 
> 0000000a
> GPR24: 00000000 c6c66b4c c6c68030 00000000 00000000 00000000 c6c68010 
> c6c65740
> NIP [c9b72eec] ath_stoprecv+0x13c/0x154 [ath9k]
> LR [c9b72ee0] ath_stoprecv+0x130/0x154 [ath9k]
> Call Trace:
> [c6c63ee0] [c9b72ee0] ath_stoprecv+0x130/0x154 [ath9k] (unreliable)
> [c6c63f10] [c9b6ec20] ath_prepare_reset+0x54/0x90 [ath9k]
> [c6c63f30] [c9b6ee90] ath_reset_internal+0x6c/0x220 [ath9k]
> [c6c63f60] [c9b6f948] ath_reset+0x28/0x9c [ath9k]
> [c6c63f80] [c0039398] worker_thread+0x108/0x1b0
> [c6c63fc0] [c003d338] kthread+0x78/0x7c
> [c6c63ff0] [c0010f08] kernel_thread+0x4c/0x68
> Instruction dump:
> 2f800000 419eff74 809f0888 3c60c9b8 3ca0c9b8 38637194 38a572c8 38840020
> 4bec2125 3d20c9b9 88099475 68000001 <0f000000> 2f800000 419eff40 38000001
> ath: phy0: Failed to stop TX DMA, queues=0x004!
> ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024 
> AR_DIAG_SW=0x42000020 DMADBG_7=0x00006040
> ath: phy0: Could not stop RX, we could be confusing the DMA engine 
> when we start RX up
>
> Regards,
>
> Pannir

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [ath9k-devel] WiFi Failure with AR9280
  2013-04-15  2:07     ` Pannirselvam Kanagaratnam
@ 2013-04-15  2:31       ` Adrian Chadd
  2013-04-18 10:16         ` Pannirselvam Kanagaratnam
  0 siblings, 1 reply; 10+ messages in thread
From: Adrian Chadd @ 2013-04-15  2:31 UTC (permalink / raw)
  To: ath9k-devel

well, those radm errors and local bus errors are actually pcie errors.

So look at PCIe errors with your bus analyser. :)



adrian

On 14 April 2013 19:07, Pannirselvam Kanagaratnam <pannir@xsmail.com> wrote:
> I have now tried 4 different chipsets on my board (AR9280, AR9287,
> AR9380 and AR9382). Only AR9280 is failing. Debugfs interrupt stats is
> as follows when using the AR9280:
>
> SYNC_CAUSE stats:
>               Sync-All:         48
>                RTC-IRQ:          0
>                MAC-IRQ:          0
> EEPROM-Illegal-Access:          0
>            APB-Timeout:          0
>      PCI-Mode-Conflict:          0
>            HOST1-Fatal:          0
>             HOST1-Perr:          0
>         TRCV-FIFO-Perr:          0
>            RADM-CPL-EP:          0
>    RADM-CPL-DLLP-Abort:         11
>     RADM-CPL-TLP-Abort:          0
>      RADM-CPL-ECRC-Err:          0
>       RADM-CPL-Timeout:          2
>      Local-Bus-Timeout:         35
>              PM-Access:          0
>              MAC-Awake:          0
>             MAC-Asleep:          0
>       MAC-Sleep-Access:          0
>
> I have managed to get access to an Agilent Protocol Exerciser & Analyzer
> (E2960A). Any tips on what I should be zooming into?
>
> Regards,
>
> Pannir
>
> On 27/3/2013 11:27 AM, Pannirselvam Kanagaratnam wrote:
>> On 25/3/2013 12:39 AM, Felix Fietkau wrote:
>>> On 2013-03-24 5:26 PM, Pannirselvam Kanagaratnam wrote:
>>>> Hello,
>>>>
>>>> My WiFi connection drops while running the AR9280 on a Freescale
>>>> MPC8315E platform in AP mode with hostapd. I get the following error
>>>> within 15 seconds and the WiFi connection drops. However, I do not
>>>> observe any issues when using the AR9382. Can you let me know what
>>>> could
>>>> be causing this and if there is a workaround for this. I am using
>>>> Compat-Wireless-3.5.4.1.
>>> I suggest updating to a more recent version of compat-wireless - 3.5.4.1
>>> is quite old.
>>> If a similar failure still occurs there, make sure you turn on
>>> CONFIG_KALLSYMS so that the trace is not just a long list of meaningless
>>> numbers.
>>>
>>> - Felix
>>>
>> Here are my logs with Compat 3.9-rc2 and CONFIG_KALLSYMS enabled.
>>
>> ath: phy0: Failed to stop TX DMA, queues=0x004!
>> ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024
>> AR_DIAG_SW=0x42000020 DMADBG_7=0x000062c0
>> ath: phy0: Could not stop RX, we could be confusing the DMA engine
>> when we start RX up
>> ------------[ cut here ]------------
>> Badness at
>> /build/compat-wireless-3.9-rc2-2-su/drivers/net/wireless/ath/ath9k/recv.c:487
>> NIP: c9b72eec LR: c9b72ee0 CTR: c01bc900
>> REGS: c6c63e30 TRAP: 0700   Not tainted  (2.6.35)
>> MSR: 00029032 <EE,ME,CE,IR,DR>  CR: 22002022  XER: 20000000
>> TASK = c7859360[930] 'phy0' THREAD: c6c62000
>> GPR00: 00000001 c6c63ee0 c7859360 0000004c 00005665 ffffffff c01b9e70
>> 00004000
>> GPR08: 00000000 c9b90000 00005665 c046ad78 22002082 00000000 07ff9000
>> 00000000
>> GPR16: 00000000 00000000 00000000 00000000 00000000 00000000 00000000
>> 0000000a
>> GPR24: 00000000 c6c66b4c c6c68030 00000000 00000000 00000000 c6c68010
>> c6c65740
>> NIP [c9b72eec] ath_stoprecv+0x13c/0x154 [ath9k]
>> LR [c9b72ee0] ath_stoprecv+0x130/0x154 [ath9k]
>> Call Trace:
>> [c6c63ee0] [c9b72ee0] ath_stoprecv+0x130/0x154 [ath9k] (unreliable)
>> [c6c63f10] [c9b6ec20] ath_prepare_reset+0x54/0x90 [ath9k]
>> [c6c63f30] [c9b6ee90] ath_reset_internal+0x6c/0x220 [ath9k]
>> [c6c63f60] [c9b6f948] ath_reset+0x28/0x9c [ath9k]
>> [c6c63f80] [c0039398] worker_thread+0x108/0x1b0
>> [c6c63fc0] [c003d338] kthread+0x78/0x7c
>> [c6c63ff0] [c0010f08] kernel_thread+0x4c/0x68
>> Instruction dump:
>> 2f800000 419eff74 809f0888 3c60c9b8 3ca0c9b8 38637194 38a572c8 38840020
>> 4bec2125 3d20c9b9 88099475 68000001 <0f000000> 2f800000 419eff40 38000001
>> ath: phy0: Failed to stop TX DMA, queues=0x004!
>> ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024
>> AR_DIAG_SW=0x42000020 DMADBG_7=0x00006040
>> ath: phy0: Could not stop RX, we could be confusing the DMA engine
>> when we start RX up
>>
>> Regards,
>>
>> Pannir
>
> _______________________________________________
> ath9k-devel mailing list
> ath9k-devel at lists.ath9k.org
> https://lists.ath9k.org/mailman/listinfo/ath9k-devel

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [ath9k-devel] WiFi Failure with AR9280
  2013-04-15  2:31       ` Adrian Chadd
@ 2013-04-18 10:16         ` Pannirselvam Kanagaratnam
  2013-04-19  6:36           ` Adrian Chadd
  0 siblings, 1 reply; 10+ messages in thread
From: Pannirselvam Kanagaratnam @ 2013-04-18 10:16 UTC (permalink / raw)
  To: ath9k-devel

An HTML attachment was scrubbed...
URL: http://lists.ath9k.org/pipermail/ath9k-devel/attachments/20130418/90640813/attachment.htm 

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [ath9k-devel] WiFi Failure with AR9280
  2013-04-18 10:16         ` Pannirselvam Kanagaratnam
@ 2013-04-19  6:36           ` Adrian Chadd
  2013-04-22  7:41             ` Pannirselvam Kanagaratnam
  0 siblings, 1 reply; 10+ messages in thread
From: Adrian Chadd @ 2013-04-19  6:36 UTC (permalink / raw)
  To: ath9k-devel

Hi,

Thanks for digging into this!

Unfortunately going digging through the PCIe host interface code will
take too much time and I just don't have the time to spare at the
moment.

There _may_ be a workaround? But it's also equally likely that it was
just fixed in later revisions of the chip so to be more tolerant with
known buggy PCIe implementations.

I think the host interface glue has a transaction timeout and
transaction retry counter. I'm not sure what the details are though
and whether or not it'll help here.



Adrian

On 18 April 2013 03:16, Pannirselvam Kanagaratnam <pannir@xsmail.com> wrote:
> Hello,
>
> Upon analyzing the Protocol Analyzer I shared my finding with the CPU
> chipset support. They said it is an errata on their CPU. It reads as
> follows:
>
> **********************************
>
> PCI Express Packets may be transmitted with excess data on x1 link
> Description: An internal error in the PCI Express controller may cause 8
> bytes of packet header or data
> payload to be duplicated on transmit. Depending on the location of the
> duplicate bytes, this
> may cause a malformed packet error or CRC error detected in the link
> partner.
>
> Impact: A PCI Express link partner may see excess correctable errors or
> malformed packets in x1
> links.
>
> Note that any PCI Express specification-compliant recipient at the ultimate
> end of the
> transaction may only recognize the erroneous packet as a Bad TLP and flag
> the error as
> correctable due to the LCRC error. It should respond with a NAK and then
> wait for the device
> to re-try the packet. The ultimate recipient should not recognize the
> erroneous packet as a
> Malformed TLP, since before reaching the transaction layer the Bad TLP
> should have been
> discarded by the ultimate recipient?s link layer.
>
> **********************************
>
> I have noticed that although the error occurs, the AR9287, AR9380 and AR9382
> are able to handle this. However, the AR9280 link fails. Is there something
> that can be done on the driver level to manage this?
>
> You may take a look at the analyzer logs at:
> http://www.yousendit.com/download/UVJqaUNITWNtUUZBSXNUQw
>
> You will need the Analyzer software to read it:
> ftp://ftp.agilent.com/cos/outbound/PCIe_Gen2/Analyzer/6.13%20for%20Gen1%20anlyzer/
>
> The record in question is # 3957101. It is Completion with Data for the
> Memory Read request by the EP at record #3957098. This packet was NAK by the
> EP and so there was a replay of this packet. The reason for NAK was that
> some extra bytes were inserted and transmitted by RC.
>
> Regards,
>
> Pannir
>
>
> On 15/4/2013 10:31 AM, Adrian Chadd wrote:
>
> well, those radm errors and local bus errors are actually pcie errors.
>
> So look at PCIe errors with your bus analyser. :)
>
>
>
> adrian
>
> On 14 April 2013 19:07, Pannirselvam Kanagaratnam <pannir@xsmail.com> wrote:
>
> I have now tried 4 different chipsets on my board (AR9280, AR9287,
> AR9380 and AR9382). Only AR9280 is failing. Debugfs interrupt stats is
> as follows when using the AR9280:
>
> SYNC_CAUSE stats:
>               Sync-All:         48
>                RTC-IRQ:          0
>                MAC-IRQ:          0
> EEPROM-Illegal-Access:          0
>            APB-Timeout:          0
>      PCI-Mode-Conflict:          0
>            HOST1-Fatal:          0
>             HOST1-Perr:          0
>         TRCV-FIFO-Perr:          0
>            RADM-CPL-EP:          0
>    RADM-CPL-DLLP-Abort:         11
>     RADM-CPL-TLP-Abort:          0
>      RADM-CPL-ECRC-Err:          0
>       RADM-CPL-Timeout:          2
>      Local-Bus-Timeout:         35
>              PM-Access:          0
>              MAC-Awake:          0
>             MAC-Asleep:          0
>       MAC-Sleep-Access:          0
>
> I have managed to get access to an Agilent Protocol Exerciser & Analyzer
> (E2960A). Any tips on what I should be zooming into?
>
> Regards,
>
> Pannir
>
> On 27/3/2013 11:27 AM, Pannirselvam Kanagaratnam wrote:
>
> On 25/3/2013 12:39 AM, Felix Fietkau wrote:
>
> On 2013-03-24 5:26 PM, Pannirselvam Kanagaratnam wrote:
>
> Hello,
>
> My WiFi connection drops while running the AR9280 on a Freescale
> MPC8315E platform in AP mode with hostapd. I get the following error
> within 15 seconds and the WiFi connection drops. However, I do not
> observe any issues when using the AR9382. Can you let me know what
> could
> be causing this and if there is a workaround for this. I am using
> Compat-Wireless-3.5.4.1.
>
> I suggest updating to a more recent version of compat-wireless - 3.5.4.1
> is quite old.
> If a similar failure still occurs there, make sure you turn on
> CONFIG_KALLSYMS so that the trace is not just a long list of meaningless
> numbers.
>
> - Felix
>
> Here are my logs with Compat 3.9-rc2 and CONFIG_KALLSYMS enabled.
>
> ath: phy0: Failed to stop TX DMA, queues=0x004!
> ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024
> AR_DIAG_SW=0x42000020 DMADBG_7=0x000062c0
> ath: phy0: Could not stop RX, we could be confusing the DMA engine
> when we start RX up
> ------------[ cut here ]------------
> Badness at
> /build/compat-wireless-3.9-rc2-2-su/drivers/net/wireless/ath/ath9k/recv.c:487
> NIP: c9b72eec LR: c9b72ee0 CTR: c01bc900
> REGS: c6c63e30 TRAP: 0700   Not tainted  (2.6.35)
> MSR: 00029032 <EE,ME,CE,IR,DR>  CR: 22002022  XER: 20000000
> TASK = c7859360[930] 'phy0' THREAD: c6c62000
> GPR00: 00000001 c6c63ee0 c7859360 0000004c 00005665 ffffffff c01b9e70
> 00004000
> GPR08: 00000000 c9b90000 00005665 c046ad78 22002082 00000000 07ff9000
> 00000000
> GPR16: 00000000 00000000 00000000 00000000 00000000 00000000 00000000
> 0000000a
> GPR24: 00000000 c6c66b4c c6c68030 00000000 00000000 00000000 c6c68010
> c6c65740
> NIP [c9b72eec] ath_stoprecv+0x13c/0x154 [ath9k]
> LR [c9b72ee0] ath_stoprecv+0x130/0x154 [ath9k]
> Call Trace:
> [c6c63ee0] [c9b72ee0] ath_stoprecv+0x130/0x154 [ath9k] (unreliable)
> [c6c63f10] [c9b6ec20] ath_prepare_reset+0x54/0x90 [ath9k]
> [c6c63f30] [c9b6ee90] ath_reset_internal+0x6c/0x220 [ath9k]
> [c6c63f60] [c9b6f948] ath_reset+0x28/0x9c [ath9k]
> [c6c63f80] [c0039398] worker_thread+0x108/0x1b0
> [c6c63fc0] [c003d338] kthread+0x78/0x7c
> [c6c63ff0] [c0010f08] kernel_thread+0x4c/0x68
> Instruction dump:
> 2f800000 419eff74 809f0888 3c60c9b8 3ca0c9b8 38637194 38a572c8 38840020
> 4bec2125 3d20c9b9 88099475 68000001 <0f000000> 2f800000 419eff40 38000001
> ath: phy0: Failed to stop TX DMA, queues=0x004!
> ath: phy0: DMA failed to stop in 10 ms AR_CR=0x00000024
> AR_DIAG_SW=0x42000020 DMADBG_7=0x00006040
> ath: phy0: Could not stop RX, we could be confusing the DMA engine
> when we start RX up
>
> Regards,
>
> Pannir
>
> _______________________________________________
> ath9k-devel mailing list
> ath9k-devel at lists.ath9k.org
> https://lists.ath9k.org/mailman/listinfo/ath9k-devel
>
>

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [ath9k-devel] WiFi Failure with AR9280
  2013-04-19  6:36           ` Adrian Chadd
@ 2013-04-22  7:41             ` Pannirselvam Kanagaratnam
  2013-04-22  8:38               ` Adrian Chadd
  0 siblings, 1 reply; 10+ messages in thread
From: Pannirselvam Kanagaratnam @ 2013-04-22  7:41 UTC (permalink / raw)
  To: ath9k-devel

Hello,

I am using the latest revision and the chip vendor has confirmed that 
the errata is applicable to the latest revision.

I analyzed the behaviour of the AR9287 and AR9380 and found the 
following when encountering the "Completion with Data" packet with LCRC 
errors:

1) AR9280: Completion Timeout bit: 1. Bad TLP bit: 1.
2) AR9287: Completion Timeout bit: 0. Bad TLP bit: 1.
3) AR9380: Completion Timeout bit: 0. Bad TLP bit: 0.

I can get > 90 Mbps on both the AR9287 and AR9380. Is there a way to 
disable the completion timeout on the AR9280? Can you point me to the 
code which handles the transaction timeout and transaction retries? Is 
this the PCIe host interface code you were referring to?

Thank you.

Pannir

On 19/4/2013 2:36 PM, Adrian Chadd wrote:
> Hi,
>
> Thanks for digging into this!
>
> Unfortunately going digging through the PCIe host interface code will
> take too much time and I just don't have the time to spare at the
> moment.
>
> There _may_ be a workaround? But it's also equally likely that it was
> just fixed in later revisions of the chip so to be more tolerant with
> known buggy PCIe implementations.
>
> I think the host interface glue has a transaction timeout and
> transaction retry counter. I'm not sure what the details are though
> and whether or not it'll help here.
>
>
>
> Adrian
>
> On 18 April 2013 03:16, Pannirselvam Kanagaratnam <pannir@xsmail.com> wrote:
>> Hello,
>>
>> Upon analyzing the Protocol Analyzer I shared my finding with the CPU
>> chipset support. They said it is an errata on their CPU. It reads as
>> follows:
>>
>> **********************************
>>
>> PCI Express Packets may be transmitted with excess data on x1 link
>> Description: An internal error in the PCI Express controller may cause 8
>> bytes of packet header or data
>> payload to be duplicated on transmit. Depending on the location of the
>> duplicate bytes, this
>> may cause a malformed packet error or CRC error detected in the link
>> partner.
>>
>> Impact: A PCI Express link partner may see excess correctable errors or
>> malformed packets in x1
>> links.
>>
>> Note that any PCI Express specification-compliant recipient at the ultimate
>> end of the
>> transaction may only recognize the erroneous packet as a Bad TLP and flag
>> the error as
>> correctable due to the LCRC error. It should respond with a NAK and then
>> wait for the device
>> to re-try the packet. The ultimate recipient should not recognize the
>> erroneous packet as a
>> Malformed TLP, since before reaching the transaction layer the Bad TLP
>> should have been
>> discarded by the ultimate recipient?s link layer.
>>
>> **********************************
>>
>> I have noticed that although the error occurs, the AR9287, AR9380 and AR9382
>> are able to handle this. However, the AR9280 link fails. Is there something
>> that can be done on the driver level to manage this?
>>
>> You may take a look at the analyzer logs at:
>> http://www.yousendit.com/download/UVJqaUNITWNtUUZBSXNUQw
>>
>> You will need the Analyzer software to read it:
>> ftp://ftp.agilent.com/cos/outbound/PCIe_Gen2/Analyzer/6.13%20for%20Gen1%20anlyzer/
>>
>> The record in question is # 3957101. It is Completion with Data for the
>> Memory Read request by the EP at record #3957098. This packet was NAK by the
>> EP and so there was a replay of this packet. The reason for NAK was that
>> some extra bytes were inserted and transmitted by RC.
>>
>> Regards,
>>
>> Pannir
>>

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [ath9k-devel] WiFi Failure with AR9280
  2013-04-22  7:41             ` Pannirselvam Kanagaratnam
@ 2013-04-22  8:38               ` Adrian Chadd
  2013-04-22 14:58                 ` Pannirselvam Kanagaratnam
  0 siblings, 1 reply; 10+ messages in thread
From: Adrian Chadd @ 2013-04-22  8:38 UTC (permalink / raw)
  To: ath9k-devel

Hi,

Please wrap all of this up in a bug report at bugzilla.kernel.org .

I'll then punt it around internally to see if someone can dig up
what's going on.

Thanks,



Adrian

On 22 April 2013 00:41, Pannirselvam Kanagaratnam <pannir@xsmail.com> wrote:
> Hello,
>
> I am using the latest revision and the chip vendor has confirmed that the
> errata is applicable to the latest revision.
>
> I analyzed the behaviour of the AR9287 and AR9380 and found the following
> when encountering the "Completion with Data" packet with LCRC errors:
>
> 1) AR9280: Completion Timeout bit: 1. Bad TLP bit: 1.
> 2) AR9287: Completion Timeout bit: 0. Bad TLP bit: 1.
> 3) AR9380: Completion Timeout bit: 0. Bad TLP bit: 0.
>
> I can get > 90 Mbps on both the AR9287 and AR9380. Is there a way to disable
> the completion timeout on the AR9280? Can you point me to the code which
> handles the transaction timeout and transaction retries? Is this the PCIe
> host interface code you were referring to?
>
> Thank you.
>
> Pannir
>
>
> On 19/4/2013 2:36 PM, Adrian Chadd wrote:
>>
>> Hi,
>>
>> Thanks for digging into this!
>>
>> Unfortunately going digging through the PCIe host interface code will
>> take too much time and I just don't have the time to spare at the
>> moment.
>>
>> There _may_ be a workaround? But it's also equally likely that it was
>> just fixed in later revisions of the chip so to be more tolerant with
>> known buggy PCIe implementations.
>>
>> I think the host interface glue has a transaction timeout and
>> transaction retry counter. I'm not sure what the details are though
>> and whether or not it'll help here.
>>
>>
>>
>> Adrian
>>
>> On 18 April 2013 03:16, Pannirselvam Kanagaratnam <pannir@xsmail.com>
>> wrote:
>>>
>>> Hello,
>>>
>>> Upon analyzing the Protocol Analyzer I shared my finding with the CPU
>>> chipset support. They said it is an errata on their CPU. It reads as
>>> follows:
>>>
>>> **********************************
>>>
>>> PCI Express Packets may be transmitted with excess data on x1 link
>>> Description: An internal error in the PCI Express controller may cause 8
>>> bytes of packet header or data
>>> payload to be duplicated on transmit. Depending on the location of the
>>> duplicate bytes, this
>>> may cause a malformed packet error or CRC error detected in the link
>>> partner.
>>>
>>> Impact: A PCI Express link partner may see excess correctable errors or
>>> malformed packets in x1
>>> links.
>>>
>>> Note that any PCI Express specification-compliant recipient at the
>>> ultimate
>>> end of the
>>> transaction may only recognize the erroneous packet as a Bad TLP and flag
>>> the error as
>>> correctable due to the LCRC error. It should respond with a NAK and then
>>> wait for the device
>>> to re-try the packet. The ultimate recipient should not recognize the
>>> erroneous packet as a
>>> Malformed TLP, since before reaching the transaction layer the Bad TLP
>>> should have been
>>> discarded by the ultimate recipient?s link layer.
>>>
>>> **********************************
>>>
>>> I have noticed that although the error occurs, the AR9287, AR9380 and
>>> AR9382
>>> are able to handle this. However, the AR9280 link fails. Is there
>>> something
>>> that can be done on the driver level to manage this?
>>>
>>> You may take a look at the analyzer logs at:
>>> http://www.yousendit.com/download/UVJqaUNITWNtUUZBSXNUQw
>>>
>>> You will need the Analyzer software to read it:
>>>
>>> ftp://ftp.agilent.com/cos/outbound/PCIe_Gen2/Analyzer/6.13%20for%20Gen1%20anlyzer/
>>>
>>> The record in question is # 3957101. It is Completion with Data for the
>>> Memory Read request by the EP at record #3957098. This packet was NAK by
>>> the
>>> EP and so there was a replay of this packet. The reason for NAK was that
>>> some extra bytes were inserted and transmitted by RC.
>>>
>>> Regards,
>>>
>>> Pannir
>>>
>

^ permalink raw reply	[flat|nested] 10+ messages in thread

* [ath9k-devel] WiFi Failure with AR9280
  2013-04-22  8:38               ` Adrian Chadd
@ 2013-04-22 14:58                 ` Pannirselvam Kanagaratnam
  0 siblings, 0 replies; 10+ messages in thread
From: Pannirselvam Kanagaratnam @ 2013-04-22 14:58 UTC (permalink / raw)
  To: ath9k-devel

Hello,

I have created a bug report at:
https://bugzilla.kernel.org/show_bug.cgi?id=56991

Thanks for your support.

Regards,

Pannir

On 22/4/2013 4:38 PM, Adrian Chadd wrote:
> Hi,
>
> Please wrap all of this up in a bug report at bugzilla.kernel.org .
>
> I'll then punt it around internally to see if someone can dig up
> what's going on.
>
> Thanks,
>
>
>
> Adrian
>
> On 22 April 2013 00:41, Pannirselvam Kanagaratnam <pannir@xsmail.com> wrote:
>> Hello,
>>
>> I am using the latest revision and the chip vendor has confirmed that the
>> errata is applicable to the latest revision.
>>
>> I analyzed the behaviour of the AR9287 and AR9380 and found the following
>> when encountering the "Completion with Data" packet with LCRC errors:
>>
>> 1) AR9280: Completion Timeout bit: 1. Bad TLP bit: 1.
>> 2) AR9287: Completion Timeout bit: 0. Bad TLP bit: 1.
>> 3) AR9380: Completion Timeout bit: 0. Bad TLP bit: 0.
>>
>> I can get > 90 Mbps on both the AR9287 and AR9380. Is there a way to disable
>> the completion timeout on the AR9280? Can you point me to the code which
>> handles the transaction timeout and transaction retries? Is this the PCIe
>> host interface code you were referring to?
>>
>> Thank you.
>>
>> Pannir
>>
>>
>> On 19/4/2013 2:36 PM, Adrian Chadd wrote:
>>> Hi,
>>>
>>> Thanks for digging into this!
>>>
>>> Unfortunately going digging through the PCIe host interface code will
>>> take too much time and I just don't have the time to spare at the
>>> moment.
>>>
>>> There _may_ be a workaround? But it's also equally likely that it was
>>> just fixed in later revisions of the chip so to be more tolerant with
>>> known buggy PCIe implementations.
>>>
>>> I think the host interface glue has a transaction timeout and
>>> transaction retry counter. I'm not sure what the details are though
>>> and whether or not it'll help here.
>>>
>>>
>>>
>>> Adrian
>>>
>>> On 18 April 2013 03:16, Pannirselvam Kanagaratnam <pannir@xsmail.com>
>>> wrote:
>>>> Hello,
>>>>
>>>> Upon analyzing the Protocol Analyzer I shared my finding with the CPU
>>>> chipset support. They said it is an errata on their CPU. It reads as
>>>> follows:
>>>>
>>>> **********************************
>>>>
>>>> PCI Express Packets may be transmitted with excess data on x1 link
>>>> Description: An internal error in the PCI Express controller may cause 8
>>>> bytes of packet header or data
>>>> payload to be duplicated on transmit. Depending on the location of the
>>>> duplicate bytes, this
>>>> may cause a malformed packet error or CRC error detected in the link
>>>> partner.
>>>>
>>>> Impact: A PCI Express link partner may see excess correctable errors or
>>>> malformed packets in x1
>>>> links.
>>>>
>>>> Note that any PCI Express specification-compliant recipient at the
>>>> ultimate
>>>> end of the
>>>> transaction may only recognize the erroneous packet as a Bad TLP and flag
>>>> the error as
>>>> correctable due to the LCRC error. It should respond with a NAK and then
>>>> wait for the device
>>>> to re-try the packet. The ultimate recipient should not recognize the
>>>> erroneous packet as a
>>>> Malformed TLP, since before reaching the transaction layer the Bad TLP
>>>> should have been
>>>> discarded by the ultimate recipient?s link layer.
>>>>
>>>> **********************************
>>>>
>>>> I have noticed that although the error occurs, the AR9287, AR9380 and
>>>> AR9382
>>>> are able to handle this. However, the AR9280 link fails. Is there
>>>> something
>>>> that can be done on the driver level to manage this?
>>>>
>>>> You may take a look at the analyzer logs at:
>>>> http://www.yousendit.com/download/UVJqaUNITWNtUUZBSXNUQw
>>>>
>>>> You will need the Analyzer software to read it:
>>>>
>>>> ftp://ftp.agilent.com/cos/outbound/PCIe_Gen2/Analyzer/6.13%20for%20Gen1%20anlyzer/
>>>>
>>>> The record in question is # 3957101. It is Completion with Data for the
>>>> Memory Read request by the EP at record #3957098. This packet was NAK by
>>>> the
>>>> EP and so there was a replay of this packet. The reason for NAK was that
>>>> some extra bytes were inserted and transmitted by RC.
>>>>
>>>> Regards,
>>>>
>>>> Pannir
>>>>

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2013-04-22 14:58 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-03-24 16:26 [ath9k-devel] WiFi Failure with AR9280 Pannirselvam Kanagaratnam
2013-03-24 16:39 ` Felix Fietkau
2013-03-27  3:27   ` Pannirselvam Kanagaratnam
2013-04-15  2:07     ` Pannirselvam Kanagaratnam
2013-04-15  2:31       ` Adrian Chadd
2013-04-18 10:16         ` Pannirselvam Kanagaratnam
2013-04-19  6:36           ` Adrian Chadd
2013-04-22  7:41             ` Pannirselvam Kanagaratnam
2013-04-22  8:38               ` Adrian Chadd
2013-04-22 14:58                 ` Pannirselvam Kanagaratnam

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.