regressions.lists.linux.dev archive mirror
 help / color / mirror / Atom feed
* Fwd: ath11k: QCN9074: ce desc not available for wmi command
@ 2023-06-15  7:07 Bagas Sanjaya
  2023-06-26 12:49 ` Linux regression tracking (Thorsten Leemhuis)
  0 siblings, 1 reply; 9+ messages in thread
From: Bagas Sanjaya @ 2023-06-15  7:07 UTC (permalink / raw)
  To: Manikanta Pubbisetty, Kalle Valo, Johannes Berg, Jakub Kicinski
  Cc: Linux Kernel Mailing List, Linux Regressions, Linux Atheros 11K

Hi,

I notice a regression report on Bugzilla [1]. Quoting from it:

> Hello, 
> 
> We are trying to connect 2x QCN9074 together (one as AP, the other as client). 
> 
> Using Ubuntu 22.04 hwe 5.19 generic kernel allows to pair both units in 800.11ac 80MHz only. Any other combinations of 802.11ax or ac/ax with 160MHz bandwidth does not work. The client kernel freezes when associating to QCN9074 AP without specific logs and requires reboot. I'll post another bug once I can get more logs. 
> 
> Since quite some patches came through since 5.19 - some of them related to 160MHz bandwidth. I tried multiple newer mainline kernels without success and usually the same error. 
> 
> Building kernel from latest ath master branch: 6.4.0-rc4-wt-ath+ gives the following dmesg output:
> 
> [  353.587072] ath11k_pci 0000:04:00.0: BAR 0: assigned [mem 0xa4200000-0xa43fffff 64bit]
> [  353.587180] ath11k_pci 0000:04:00.0: MSI vectors: 1
> [  353.587186] ath11k_pci 0000:04:00.0: qcn9074 hw1.0
> [  353.741799] mhi mhi0: Requested to power ON
> [  353.741806] mhi mhi0: Power on setup success
> [  353.912479] mhi mhi0: Wait for device to enter SBL or Mission mode
> [  354.007221] ath11k_pci 0000:04:00.0: chip_id 0x0 chip_family 0x0 board_id 0xff soc_id 0xffffffff
> [  354.007225] ath11k_pci 0000:04:00.0: fw_version 0x2403072e fw_build_timestamp 2021-06-06 23:27 fw_build_id 
> [  355.333791] ath11k_pci 0000:04:00.0: leaving PCI ASPM disabled to avoid MHI M2 problems
> [  355.729786] ath11k_pci 0000:04:00.0 wlp4s0: renamed from wlan0
> [  358.960477] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 36866
> [  358.960481] ath11k_pci 0000:04:00.0: failed to send WMI_STA_POWERSAVE_PARAM_CMDID
> [  358.960484] ath11k_pci 0000:04:00.0: could not set uapsd params -105
> [  358.960485] ath11k_pci 0000:04:00.0: failed to set sta uapsd: -105
> [  362.032472] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 90113
> [  362.032477] ath11k_pci 0000:04:00.0: failed to send WMI_REQUEST_STATS cmd
> [  362.032479] ath11k_pci 0000:04:00.0: could not request fw stats (-105)
> [  362.032480] ath11k_pci 0000:04:00.0: failed to request fw pdev stats: -105
> [  365.104479] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 20482
> [  365.104483] ath11k_pci 0000:04:00.0: failed to submit WMI_VDEV_DELETE_CMDID
> [  365.104485] ath11k_pci 0000:04:00.0: failed to delete WMI vdev 0: -105
> [  365.104487] ath11k_pci 0000:04:00.0: failed to delete vdev 0: -105
> [  368.176472] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 16387
> [  368.176476] ath11k_pci 0000:04:00.0: failed to send WMI_PDEV_SET_PARAM cmd
> [  368.176479] ath11k_pci 0000:04:00.0: failed to enable PMF QOS: (-105
> [  371.248474] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 16387
> [  371.248478] ath11k_pci 0000:04:00.0: failed to send WMI_PDEV_SET_PARAM cmd
> [  371.248480] ath11k_pci 0000:04:00.0: failed to enable PMF QOS: (-105
> [  374.320393] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 16387
> [  374.320397] ath11k_pci 0000:04:00.0: failed to send WMI_PDEV_SET_PARAM cmd
> [  374.320400] ath11k_pci 0000:04:00.0: failed to enable PMF QOS: (-105
> 
> Both PCs are Intel x86 (same bug for AMD). We have multiple references of QCN9074 that we tested:
> - Sparklan WPEQ-405AX (our preferred one, as they are the only vendor I know that went through FCC certification) - This unit can associate to APs only with the firmware Sparklan provided. Otherwise, link strength is reported low and barely no APs are listed after scanning.
> - Emwicon WMX7406 - has better performances with Sparklan's vendor FW. Works with ath11k-firmware 2.7.0.1 but shows lower TX mostly.
> 
> Tested FW (non exhaustive):
> # ath11k-firmware 2.5.0.1
> 823915206101779f8cab6b89066e1040  /lib/firmware/ath11k/QCN9074/hw1.0/amss.bin
> 668f53050a92db5b4281ae5f26c7e35d  /lib/firmware/ath11k/QCN9074/hw1.0/board-2.bin
> fcca36959c5f56f9f0fb7015083dc806  /lib/firmware/ath11k/QCN9074/hw1.0/m3.bin
> 
> # ath11k-firmware 2.7.0.1
> 465d0a063d049f7e4b79d267a035c6c7  /lib/firmware/ath11k/QCN9074/hw1.0/amss.bin
> 668f53050a92db5b4281ae5f26c7e35d  /lib/firmware/ath11k/QCN9074/hw1.0/board-2.bin
> ad8fafb9c1deab744c972469be916e72  /lib/firmware/ath11k/QCN9074/hw1.0/m3.bin
> 
> # Vendor firmware 
> 1e88ff2e2b5bcf7f130397cb5b21ef39  /lib/firmware/ath11k/QCN9074/hw1.0/amss.bin
> 7b3ce8686713a724946466ec1fefc2f4  /lib/firmware/ath11k/QCN9074/hw1.0/board.bin
> d0a6f7ccd52f9e3886f0bc96309f7b9a  /lib/firmware/ath11k/QCN9074/hw1.0/m3.bin
> 
> 
> Attached dmesg log with ath11k debug_mask=0xFFFF and lspci. 
> 
> Thank you

See Bugzilla for the full thread and attached dmesg.

Manikanta: This regression is apparently caused by a commit of yours.
Would you like to take a look on it?

Anyway, I'm adding it to regzbot:

#regzbot introduced: 13aa2fb692d371 https://bugzilla.kernel.org/show_bug.cgi?id=217536
#regzbot title: Threaded NAPI causes ce desc unavailable error on ath11k

Thanks.

[1]: https://bugzilla.kernel.org/show_bug.cgi?id=217536

-- 
An old man doll... just what I always wanted! - Clara

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Fwd: ath11k: QCN9074: ce desc not available for wmi command
  2023-06-15  7:07 Fwd: ath11k: QCN9074: ce desc not available for wmi command Bagas Sanjaya
@ 2023-06-26 12:49 ` Linux regression tracking (Thorsten Leemhuis)
  2023-07-11  6:57   ` Kalle Valo
  2023-07-25  9:17   ` Manikanta Pubbisetty
  0 siblings, 2 replies; 9+ messages in thread
From: Linux regression tracking (Thorsten Leemhuis) @ 2023-06-26 12:49 UTC (permalink / raw)
  To: Bagas Sanjaya, Manikanta Pubbisetty, Kalle Valo, Johannes Berg,
	Jakub Kicinski
  Cc: Linux Kernel Mailing List, Linux Regressions, Linux Atheros 11K

Hi, Thorsten here, the Linux kernel's regression tracker. Top-posting
for once, to make this easily accessible to everyone.

Hmmm, there afaics was no real progress and not even a single reply from
a developer (neither here or in bugzilla) since the issue was reported
~10 days ago. :-/

Manikanta, did you maybe just miss that this is caused by change of
yours (and thus is something you should look into)?

Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
--
Everything you wanna know about Linux kernel regression tracking:
https://linux-regtracking.leemhuis.info/about/#tldr
If I did something stupid, please tell me, as explained on that page.

#regzbot poke

On 15.06.23 09:07, Bagas Sanjaya wrote:
> Hi,
> 
> I notice a regression report on Bugzilla [1]. Quoting from it:
> 
>> Hello, 
>>
>> We are trying to connect 2x QCN9074 together (one as AP, the other as client). 
>>
>> Using Ubuntu 22.04 hwe 5.19 generic kernel allows to pair both units in 800.11ac 80MHz only. Any other combinations of 802.11ax or ac/ax with 160MHz bandwidth does not work. The client kernel freezes when associating to QCN9074 AP without specific logs and requires reboot. I'll post another bug once I can get more logs. 
>>
>> Since quite some patches came through since 5.19 - some of them related to 160MHz bandwidth. I tried multiple newer mainline kernels without success and usually the same error. 
>>
>> Building kernel from latest ath master branch: 6.4.0-rc4-wt-ath+ gives the following dmesg output:
>>
>> [  353.587072] ath11k_pci 0000:04:00.0: BAR 0: assigned [mem 0xa4200000-0xa43fffff 64bit]
>> [  353.587180] ath11k_pci 0000:04:00.0: MSI vectors: 1
>> [  353.587186] ath11k_pci 0000:04:00.0: qcn9074 hw1.0
>> [  353.741799] mhi mhi0: Requested to power ON
>> [  353.741806] mhi mhi0: Power on setup success
>> [  353.912479] mhi mhi0: Wait for device to enter SBL or Mission mode
>> [  354.007221] ath11k_pci 0000:04:00.0: chip_id 0x0 chip_family 0x0 board_id 0xff soc_id 0xffffffff
>> [  354.007225] ath11k_pci 0000:04:00.0: fw_version 0x2403072e fw_build_timestamp 2021-06-06 23:27 fw_build_id 
>> [  355.333791] ath11k_pci 0000:04:00.0: leaving PCI ASPM disabled to avoid MHI M2 problems
>> [  355.729786] ath11k_pci 0000:04:00.0 wlp4s0: renamed from wlan0
>> [  358.960477] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 36866
>> [  358.960481] ath11k_pci 0000:04:00.0: failed to send WMI_STA_POWERSAVE_PARAM_CMDID
>> [  358.960484] ath11k_pci 0000:04:00.0: could not set uapsd params -105
>> [  358.960485] ath11k_pci 0000:04:00.0: failed to set sta uapsd: -105
>> [  362.032472] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 90113
>> [  362.032477] ath11k_pci 0000:04:00.0: failed to send WMI_REQUEST_STATS cmd
>> [  362.032479] ath11k_pci 0000:04:00.0: could not request fw stats (-105)
>> [  362.032480] ath11k_pci 0000:04:00.0: failed to request fw pdev stats: -105
>> [  365.104479] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 20482
>> [  365.104483] ath11k_pci 0000:04:00.0: failed to submit WMI_VDEV_DELETE_CMDID
>> [  365.104485] ath11k_pci 0000:04:00.0: failed to delete WMI vdev 0: -105
>> [  365.104487] ath11k_pci 0000:04:00.0: failed to delete vdev 0: -105
>> [  368.176472] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 16387
>> [  368.176476] ath11k_pci 0000:04:00.0: failed to send WMI_PDEV_SET_PARAM cmd
>> [  368.176479] ath11k_pci 0000:04:00.0: failed to enable PMF QOS: (-105
>> [  371.248474] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 16387
>> [  371.248478] ath11k_pci 0000:04:00.0: failed to send WMI_PDEV_SET_PARAM cmd
>> [  371.248480] ath11k_pci 0000:04:00.0: failed to enable PMF QOS: (-105
>> [  374.320393] ath11k_pci 0000:04:00.0: ce desc not available for wmi command 16387
>> [  374.320397] ath11k_pci 0000:04:00.0: failed to send WMI_PDEV_SET_PARAM cmd
>> [  374.320400] ath11k_pci 0000:04:00.0: failed to enable PMF QOS: (-105
>>
>> Both PCs are Intel x86 (same bug for AMD). We have multiple references of QCN9074 that we tested:
>> - Sparklan WPEQ-405AX (our preferred one, as they are the only vendor I know that went through FCC certification) - This unit can associate to APs only with the firmware Sparklan provided. Otherwise, link strength is reported low and barely no APs are listed after scanning.
>> - Emwicon WMX7406 - has better performances with Sparklan's vendor FW. Works with ath11k-firmware 2.7.0.1 but shows lower TX mostly.
>>
>> Tested FW (non exhaustive):
>> # ath11k-firmware 2.5.0.1
>> 823915206101779f8cab6b89066e1040  /lib/firmware/ath11k/QCN9074/hw1.0/amss.bin
>> 668f53050a92db5b4281ae5f26c7e35d  /lib/firmware/ath11k/QCN9074/hw1.0/board-2.bin
>> fcca36959c5f56f9f0fb7015083dc806  /lib/firmware/ath11k/QCN9074/hw1.0/m3.bin
>>
>> # ath11k-firmware 2.7.0.1
>> 465d0a063d049f7e4b79d267a035c6c7  /lib/firmware/ath11k/QCN9074/hw1.0/amss.bin
>> 668f53050a92db5b4281ae5f26c7e35d  /lib/firmware/ath11k/QCN9074/hw1.0/board-2.bin
>> ad8fafb9c1deab744c972469be916e72  /lib/firmware/ath11k/QCN9074/hw1.0/m3.bin
>>
>> # Vendor firmware 
>> 1e88ff2e2b5bcf7f130397cb5b21ef39  /lib/firmware/ath11k/QCN9074/hw1.0/amss.bin
>> 7b3ce8686713a724946466ec1fefc2f4  /lib/firmware/ath11k/QCN9074/hw1.0/board.bin
>> d0a6f7ccd52f9e3886f0bc96309f7b9a  /lib/firmware/ath11k/QCN9074/hw1.0/m3.bin
>>
>>
>> Attached dmesg log with ath11k debug_mask=0xFFFF and lspci. 
>>
>> Thank you
> 
> See Bugzilla for the full thread and attached dmesg.
> 
> Manikanta: This regression is apparently caused by a commit of yours.
> Would you like to take a look on it?
> 
> Anyway, I'm adding it to regzbot:
> 
> #regzbot introduced: 13aa2fb692d371 https://bugzilla.kernel.org/show_bug.cgi?id=217536
> #regzbot title: Threaded NAPI causes ce desc unavailable error on ath11k
> 
> Thanks.
> 
> [1]: https://bugzilla.kernel.org/show_bug.cgi?id=217536
> 

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Fwd: ath11k: QCN9074: ce desc not available for wmi command
  2023-06-26 12:49 ` Linux regression tracking (Thorsten Leemhuis)
@ 2023-07-11  6:57   ` Kalle Valo
  2023-07-25  9:17   ` Manikanta Pubbisetty
  1 sibling, 0 replies; 9+ messages in thread
From: Kalle Valo @ 2023-07-11  6:57 UTC (permalink / raw)
  To: Linux regression tracking (Thorsten Leemhuis)
  Cc: Bagas Sanjaya, Manikanta Pubbisetty, Johannes Berg,
	Jakub Kicinski, Linux regressions mailing list,
	Linux Kernel Mailing List, Linux Atheros 11K

"Linux regression tracking (Thorsten Leemhuis)"
<regressions@leemhuis.info> writes:

> Hi, Thorsten here, the Linux kernel's regression tracker. Top-posting
> for once, to make this easily accessible to everyone.
>
> Hmmm, there afaics was no real progress and not even a single reply from
> a developer (neither here or in bugzilla) since the issue was reported
> ~10 days ago. :-/
>
> Manikanta, did you maybe just miss that this is caused by change of
> yours (and thus is something you should look into)?

No reply from Manikanta so I think I'll just revert the commit. I have
assigned bug #217536 to me now.

The wireless trees are closed for July but my plan is that I submit the
revert directly to net tree.

-- 
https://patchwork.kernel.org/project/linux-wireless/list/

https://wireless.wiki.kernel.org/en/developers/documentation/submittingpatches

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Fwd: ath11k: QCN9074: ce desc not available for wmi command
  2023-06-26 12:49 ` Linux regression tracking (Thorsten Leemhuis)
  2023-07-11  6:57   ` Kalle Valo
@ 2023-07-25  9:17   ` Manikanta Pubbisetty
  2023-07-26  9:21     ` Linux regression tracking (Thorsten Leemhuis)
  1 sibling, 1 reply; 9+ messages in thread
From: Manikanta Pubbisetty @ 2023-07-25  9:17 UTC (permalink / raw)
  To: Linux regressions mailing list, Bagas Sanjaya, Kalle Valo,
	Johannes Berg, Jakub Kicinski
  Cc: Linux Kernel Mailing List, Linux Atheros 11K

On 6/26/2023 6:19 PM, Linux regression tracking (Thorsten Leemhuis) wrote:
> Hi, Thorsten here, the Linux kernel's regression tracker. Top-posting
> for once, to make this easily accessible to everyone.
> 
> Hmmm, there afaics was no real progress and not even a single reply from
> a developer (neither here or in bugzilla) since the issue was reported
> ~10 days ago. :-/
> 
> Manikanta, did you maybe just miss that this is caused by change of
> yours (and thus is something you should look into)?
> 

Extremely sorry for having this missed due to incorrect mail filters on 
my machine. I have looked the logs attached to the buganizer.

The issue from the logs looks like it is happening during the boot.
Generally, issues like these "ce desc not available for wmi command" 
occur when there is no room in the copy engine pipe for driver to 
enqueue the command to the firmware and in many cases these would have 
happen when firmware is reaping the ring slowly.

It is puzzling to know that thread NAPI is causing this and reverting 
this got the issue fixed. NAPI generally acts on the RX rings and has 
nothing to do with the TX.

Hi Sanjay,

This issue is seen just with the kernel upgrade alone? Or firmware has 
also been upgraded?

Meanwhile, I'll try to repro the issue on my local setup and try to root 
cause the problem. Pls let me know the firmware version that has been 
used for testing.

Although I'm okay reverting the threaded NAPI patch for now, in the long 
run we want that back as threaded NAPI brings significant improvement on 
the throughput front.

Thanks,
Manikanta

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Fwd: ath11k: QCN9074: ce desc not available for wmi command
  2023-07-25  9:17   ` Manikanta Pubbisetty
@ 2023-07-26  9:21     ` Linux regression tracking (Thorsten Leemhuis)
  2023-07-26  9:49       ` Manikanta Pubbisetty
  0 siblings, 1 reply; 9+ messages in thread
From: Linux regression tracking (Thorsten Leemhuis) @ 2023-07-26  9:21 UTC (permalink / raw)
  To: Manikanta Pubbisetty, Linux regressions mailing list,
	Bagas Sanjaya, Kalle Valo, Johannes Berg, Jakub Kicinski
  Cc: Linux Kernel Mailing List, Linux Atheros 11K

On 25.07.23 11:17, Manikanta Pubbisetty wrote:
> On 6/26/2023 6:19 PM, Linux regression tracking (Thorsten Leemhuis) wrote:
>>
>> Hmmm, there afaics was no real progress and not even a single reply from
>> a developer (neither here or in bugzilla) since the issue was reported
>> ~10 days ago. :-/

BTW: Kalle, many thx for picking this up and posting & applying the revert!

>> Manikanta, did you maybe just miss that this is caused by change of
>> yours (and thus is something you should look into)?
> 
> Extremely sorry for having this missed [...]
> 
> Hi Sanjay, [...]

FWIW, Bagas Sanjaya just forwarded the report and the reporter is not
CCed afaics (bugzilla privacy policy does not allow this, which
complicates things a lot :-/ ). You have to use bugzilla to reach the
reporter: https://bugzilla.kernel.org/show_bug.cgi?id=217536

Bagas Sanjaya: wondering if you should make that "I'm just forwarding"
aspect more obvious in your mails. And it afaics would also be good to
mentioned the author of the culprit quite early in your mails, as there
is a risk that people will miss that aspect otherwise.

Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
--
Everything you wanna know about Linux kernel regression tracking:
https://linux-regtracking.leemhuis.info/about/#tldr
If I did something stupid, please tell me, as explained on that page.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Fwd: ath11k: QCN9074: ce desc not available for wmi command
  2023-07-26  9:21     ` Linux regression tracking (Thorsten Leemhuis)
@ 2023-07-26  9:49       ` Manikanta Pubbisetty
  2023-09-03 23:30         ` Tyler Stachecki
  0 siblings, 1 reply; 9+ messages in thread
From: Manikanta Pubbisetty @ 2023-07-26  9:49 UTC (permalink / raw)
  To: Linux regressions mailing list, Bagas Sanjaya, Kalle Valo,
	Johannes Berg, Jakub Kicinski
  Cc: Linux Kernel Mailing List, Linux Atheros 11K


On 7/26/2023 2:51 PM, Linux regression tracking (Thorsten Leemhuis) wrote:
> On 25.07.23 11:17, Manikanta Pubbisetty wrote:
>> On 6/26/2023 6:19 PM, Linux regression tracking (Thorsten Leemhuis) wrote:
>>>
>>> Hmmm, there afaics was no real progress and not even a single reply from
>>> a developer (neither here or in bugzilla) since the issue was reported
>>> ~10 days ago. :-/
> 
> BTW: Kalle, many thx for picking this up and posting & applying the revert!
> 
>>> Manikanta, did you maybe just miss that this is caused by change of
>>> yours (and thus is something you should look into)?
>>
>> Extremely sorry for having this missed [...]
>>
>> Hi Sanjay, [...]
> 
> FWIW, Bagas Sanjaya just forwarded the report and the reporter is not
> CCed afaics (bugzilla privacy policy does not allow this, which
> complicates things a lot :-/ ). You have to use bugzilla to reach the
> reporter: https://bugzilla.kernel.org/show_bug.cgi?id=217536
> 

Sure, thanks Thorsten.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Fwd: ath11k: QCN9074: ce desc not available for wmi command
  2023-07-26  9:49       ` Manikanta Pubbisetty
@ 2023-09-03 23:30         ` Tyler Stachecki
  2023-09-04  9:47           ` Manikanta Pubbisetty
  0 siblings, 1 reply; 9+ messages in thread
From: Tyler Stachecki @ 2023-09-03 23:30 UTC (permalink / raw)
  To: Manikanta Pubbisetty
  Cc: Linux regressions mailing list, Bagas Sanjaya, Kalle Valo,
	Johannes Berg, Jakub Kicinski, Linux Kernel Mailing List,
	Linux Atheros 11K

> On 25.07.23 11:17, Manikanta Pubbisetty wrote:
> 
> FWIW, Bagas Sanjaya just forwarded the report and the reporter is not
> CCed afaics (bugzilla privacy policy does not allow this, which
> complicates things a lot :-/ ). You have to use bugzilla to reach the
> reporter: https://bugzilla.kernel.org/show_bug.cgi?id=217536

Hi Manikanta,

I just wanted to report that this is likely related to QCN9074 when the host
system only has 1 MSI-X vector available for the modem and/or related to a
product named "WPEQ-405AX".

I have two different hosts running the exact same kernel, same QCN9074
firmware (WLAN.HK.2.7.0.1-01744-QCAHKSWPL_SILICONZ-1), etc. The only
differences are that the one which does not work is running on a slightly
older Intel SBC, with the older one leveraging mPCIe instead of PCIe and
only having one MSI-X vector.

I tried backing out the threaded NAPI commit and, as mentioned, everything
begins working again on the host with 1 MSI-X vector. I have also seen some
other oddities with the system only working with 1 MSI-X vector, such as
the modem not working when I boot with hpet=disabled. I am guessing it is
not related, but mentioning it just in case.

The only other thing I'll mention is that the CE desc errors are *only* seen
after upping the link (via `ip link set wlp1s0 up`). After this point, doing
something as simple as reading the temperature of the modem fails and the
kernel log starts printing the errors described above. Prior to that, however,
no error messages are seen.

I'm happy to be of service to test any changes you might suggest. Thanks
for the threaded NAPI work, by the way - it definitely provides a boost!

Regards,
Tyler

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Fwd: ath11k: QCN9074: ce desc not available for wmi command
  2023-09-03 23:30         ` Tyler Stachecki
@ 2023-09-04  9:47           ` Manikanta Pubbisetty
  2023-10-26 23:06             ` Tyler Stachecki
  0 siblings, 1 reply; 9+ messages in thread
From: Manikanta Pubbisetty @ 2023-09-04  9:47 UTC (permalink / raw)
  To: Tyler Stachecki
  Cc: Linux regressions mailing list, Bagas Sanjaya, Kalle Valo,
	Johannes Berg, Jakub Kicinski, Linux Kernel Mailing List,
	Linux Atheros 11K

On 9/4/2023 5:00 AM, Tyler Stachecki wrote:
>> On 25.07.23 11:17, Manikanta Pubbisetty wrote:
>>
>> FWIW, Bagas Sanjaya just forwarded the report and the reporter is not
>> CCed afaics (bugzilla privacy policy does not allow this, which
>> complicates things a lot :-/ ). You have to use bugzilla to reach the
>> reporter: https://bugzilla.kernel.org/show_bug.cgi?id=217536
> 
> Hi Manikanta,
> 
> I just wanted to report that this is likely related to QCN9074 when the host
> system only has 1 MSI-X vector available for the modem and/or related to a
> product named "WPEQ-405AX".
> 
> I have two different hosts running the exact same kernel, same QCN9074
> firmware (WLAN.HK.2.7.0.1-01744-QCAHKSWPL_SILICONZ-1), etc. The only
> differences are that the one which does not work is running on a slightly
> older Intel SBC, with the older one leveraging mPCIe instead of PCIe and
> only having one MSI-X vector.

Yes, you are right. This seems to be a problem with some hardware having 
QCN9074. We have tried to reproduce this problem in QC on different 
hardware but could not reproduce it even once. Not even with one MSI vector.

> 
> I tried backing out the threaded NAPI commit and, as mentioned, everything
> begins working again on the host with 1 MSI-X vector. I have also seen some
> other oddities with the system only working with 1 MSI-X vector, such as
> the modem not working when I boot with hpet=disabled. I am guessing it is
> not related, but mentioning it just in case.
> 
> The only other thing I'll mention is that the CE desc errors are *only* seen
> after upping the link (via `ip link set wlp1s0 up`). After this point, doing
> something as simple as reading the temperature of the modem fails and the
> kernel log starts printing the errors described above. Prior to that, however,
> no error messages are seen.
>

True, I had worked with the reporter for quite some time. To me at 
first, it seemed like a problem with improper configuration of the MSI 
DATA in the QCN9074 hardware. I'm investigating further.

> I'm happy to be of service to test any changes you might suggest. Thanks
> for the threaded NAPI work, by the way - it definitely provides a boost!
> 

Sure, Thanks a lot.

Manikanta

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Fwd: ath11k: QCN9074: ce desc not available for wmi command
  2023-09-04  9:47           ` Manikanta Pubbisetty
@ 2023-10-26 23:06             ` Tyler Stachecki
  0 siblings, 0 replies; 9+ messages in thread
From: Tyler Stachecki @ 2023-10-26 23:06 UTC (permalink / raw)
  To: Manikanta Pubbisetty
  Cc: Linux regressions mailing list, Bagas Sanjaya, Kalle Valo,
	Johannes Berg, Jakub Kicinski, Linux Kernel Mailing List,
	Linux Atheros 11K

On Mon, Sep 4, 2023 at 5:47 AM Manikanta Pubbisetty
<quic_mpubbise@quicinc.com> wrote:
> > Hi Manikanta,
> >
> > I just wanted to report that this is likely related to QCN9074 when the host
> > system only has 1 MSI-X vector available for the modem and/or related to a
> > product named "WPEQ-405AX".
> >
> > I have two different hosts running the exact same kernel, same QCN9074
> > firmware (WLAN.HK.2.7.0.1-01744-QCAHKSWPL_SILICONZ-1), etc. The only
> > differences are that the one which does not work is running on a slightly
> > older Intel SBC, with the older one leveraging mPCIe instead of PCIe and
> > only having one MSI-X vector.
>
> Yes, you are right. This seems to be a problem with some hardware having
> QCN9074. We have tried to reproduce this problem in QC on different
> hardware but could not reproduce it even once. Not even with one MSI vector.

Just as a heads up, this "Sparklan WPEQ-405AX" version of QCN9074 may
be adding to some of the confusion here and so there may be two
problems. As mentioned previously, CE desc errors stopped after
reverting the threaded NAPI patch. However, there's something odd
about this modem - it does not work with the board-2.bin that Kalle
provides as the OP noted.

Upon request, the vendor of this modem provides a board.bin for
WPEQ-405AX compatible with a copy of amss/m3 which appears to be
WLAN.HK.2.4.0.1-01838-QCAHKSWPL_SILICONZ-1 (based on checksums). I
found out the hard way that the ABI of the BDFs changed for
WLAN.HK.2.5.0 firmwares and beyond, making things non-backwards
compatible with the board.bin that the vendor provides.. which is
unfortunate as the vendor would not supply a BDF built against 2.5.0+
when requested.

The board-2.bin that Kalle allows the modem start, but it fails to
associate or really do anything useful beyond announcing a SSID in AP
mode.

Anyways: I think it's mostly just an issue with the IRQ affinity --
maybe the threaded NAPI patch is changing it somehow...

Cheers,
Tyler

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2023-10-26 23:06 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-06-15  7:07 Fwd: ath11k: QCN9074: ce desc not available for wmi command Bagas Sanjaya
2023-06-26 12:49 ` Linux regression tracking (Thorsten Leemhuis)
2023-07-11  6:57   ` Kalle Valo
2023-07-25  9:17   ` Manikanta Pubbisetty
2023-07-26  9:21     ` Linux regression tracking (Thorsten Leemhuis)
2023-07-26  9:49       ` Manikanta Pubbisetty
2023-09-03 23:30         ` Tyler Stachecki
2023-09-04  9:47           ` Manikanta Pubbisetty
2023-10-26 23:06             ` Tyler Stachecki

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).