All of lore.kernel.org
 help / color / mirror / Atom feed
From: Karol Herbst <kherbst-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
To: Marc MERLIN <marc_nouveau-xnduUnryOU1AfugRpC6u6w@public.gmane.org>
Cc: nouveau <nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org>
Subject: Re: 5.9.11 still hanging 2mn at each boot and looping on nvidia-gpu 0000:01:00.3: PME# enabled (Quadro RTX 4000 Mobile)
Date: Mon, 4 Jan 2021 14:28:37 +0100	[thread overview]
Message-ID: <CACO55tsdG37YKv7FV2er4hRnXk9vmwMbPuPptA+=ZtziWXC2+g@mail.gmail.com> (raw)
In-Reply-To: <20210104114955.GM32533-xnduUnryOU1AfugRpC6u6w@public.gmane.org>

mhh, that PCI config stuff should really not happen all the time, but
it also doesn't appear to. The other thing I really don't know is, how
well the runpm works with tools like TLP if there isn't only an audio
device, but also the USB stuff and all the subdevices have to be
turned off all the time in order for the GPU to stay powered down.

The firmware stuff is also just a functional problem, so you won't get
display offloading, but it shouldn't drain your battery as long as
nothing is connected. I'd check with "grep .
/sys/bus/pci/devices/*/power/runtime_status" if all subdevices of the
GPU are powered down, and check which one gets enabled regularly or
something.

On Mon, Jan 4, 2021 at 12:50 PM Marc MERLIN <marc_nouveau-xnduUnryOU1AfugRpC6u6w@public.gmane.org> wrote:
>
> On Tue, Dec 29, 2020 at 09:47:50AM -0800, Marc MERLIN wrote:
> > > Of course now that I read your email a bit more carefully, it seems
> > > your issue is with the "saving config space" messages. I'm not sure
> > > I've seen those before. Perhaps you have some sort of debug enabled.
> > > I'd find where in the kernel they are being produced, and what the
> > > conditions for it are. But the failure to load firmware isn't great --
> > > not 100% sure if it impacts runpm or not.
> >
> > Yes, I have 'nouveau.debug=disp=trace'
> > Someone on this list asked me to add this a few months back.
> >
> > > I just double-checked, TU10x accel came in via
> > > afa3b96b058d87c2c44d1c83dadb2ba6998d03ce, which was first in v5.6.
> > > Initial TU10x support came in v5.0. So that doesn't line up with your
> > > timeline.
> >
> > You know, I said 5.5, maybe it was 5.6 now, it's been a little while
> > since those issues started.
> >
> > Now we know I was missing the required firmware, it's a good place to
> > start, so I'll start there, thank you very much for the pointers.
>
> Sorry for the delay. I rebooted and everything worked great.
> No hang at boot.
> As for the PME loop I've been seeing, it hasn't happened so far.
>
> I can't comment on whether firmware should be required for the kernel to
> boot properly, but if it's at all possible, please try to make the
> driver fall back or shut down if the firmware is absent as opposed to
> hanging the boot 2mn.
>
> Also some drivers give a better clue that their firmware is missing
> and where to get it from. Adding a printk to help users could be a good
> idea.
>
> Below is the boot with firmware present.
>
> Thanks for your help
> Marc
>
> sauron:~$ grep nouveau /var/log/dmesg
> [   11.016605] nouveau: detected PR support, will not use DSM
> [   11.025191] nouveau 0000:01:00.0: runtime IRQ mapping not provided by arch
> [   11.071823] nouveau 0000:01:00.0: enabling device (0000 -> 0003)
> [   11.111588] nouveau 0000:01:00.0: NVIDIA TU104 (164000a1)
> [   11.203598] nouveau 0000:01:00.0: bios: version 90.04.4d.00.2c
> [   11.203921] nouveau 0000:01:00.0: pmu: firmware unavailable
> [   11.204229] nouveau 0000:01:00.0: enabling bus mastering
> [   11.204543] nouveau 0000:01:00.0: fb: 8192 MiB GDDR6
> [   11.215524] nouveau 0000:01:00.0: DRM: VRAM: 8192 MiB
> [   11.215525] nouveau 0000:01:00.0: DRM: GART: 536870912 MiB
> [   11.215527] nouveau 0000:01:00.0: DRM: BIT table 'A' not found
> [   11.215527] nouveau 0000:01:00.0: DRM: BIT table 'L' not found
> [   11.215528] nouveau 0000:01:00.0: DRM: TMDS table version 2.0
> [   11.215529] nouveau 0000:01:00.0: DRM: DCB version 4.1
> [   11.215530] nouveau 0000:01:00.0: DRM: DCB outp 00: 02800f66 04600020
> [   11.215531] nouveau 0000:01:00.0: DRM: DCB outp 01: 02011f52 00020010
> [   11.215532] nouveau 0000:01:00.0: DRM: DCB outp 02: 01022f36 04600010
> [   11.215532] nouveau 0000:01:00.0: DRM: DCB outp 03: 04033f76 04600010
> [   11.215533] nouveau 0000:01:00.0: DRM: DCB outp 04: 04044f86 04600020
> [   11.215533] nouveau 0000:01:00.0: DRM: DCB conn 00: 00020047
> [   11.215534] nouveau 0000:01:00.0: DRM: DCB conn 01: 00010161
> [   11.215534] nouveau 0000:01:00.0: DRM: DCB conn 02: 00001248
> [   11.215535] nouveau 0000:01:00.0: DRM: DCB conn 03: 01000348
> [   11.215535] nouveau 0000:01:00.0: DRM: DCB conn 04: 02000471
> [   11.216166] nouveau 0000:01:00.0: DRM: MM: using COPY for buffer copies
> [   11.526753] nouveau 0000:01:00.0: DRM: unknown connector type 48
> [   11.527077] nouveau 0000:01:00.0: DRM: unknown connector type 48
> [   11.552051] nouveau 0000:01:00.0: [drm] Cannot find any crtc or sizes
> [   11.554239] nouveau 0000:01:00.0: [drm] Cannot find any crtc or sizes
> [   11.555822] nouveau 0000:01:00.0: [drm] Cannot find any crtc or sizes
> [   11.556054] [drm] Initialized nouveau 1.3.1 20120801 for 0000:01:00.0 on minor 1
> [   11.556060] nouveau 0000:01:00.0: DRM: Disabling PCI power management to avoid bug
> [   18.887229] nouveau 0000:01:00.0: saving config space at offset 0x0 (reading 0x1eb610de)
> [   18.887231] nouveau 0000:01:00.0: saving config space at offset 0x4 (reading 0x100407)
> [   18.887233] nouveau 0000:01:00.0: saving config space at offset 0x8 (reading 0x30000a1)
> [   18.887235] nouveau 0000:01:00.0: saving config space at offset 0xc (reading 0x800000)
> [   18.887237] nouveau 0000:01:00.0: saving config space at offset 0x10 (reading 0xcd000000)
> [   18.887239] nouveau 0000:01:00.0: saving config space at offset 0x14 (reading 0xa000000c)
> [   18.887241] nouveau 0000:01:00.0: saving config space at offset 0x18 (reading 0x0)
> [   18.887243] nouveau 0000:01:00.0: saving config space at offset 0x1c (reading 0xb000000c)
> [   18.887245] nouveau 0000:01:00.0: saving config space at offset 0x20 (reading 0x0)
> [   18.887247] nouveau 0000:01:00.0: saving config space at offset 0x24 (reading 0x2001)
> [   18.887249] nouveau 0000:01:00.0: saving config space at offset 0x28 (reading 0x0)
> [   18.887251] nouveau 0000:01:00.0: saving config space at offset 0x2c (reading 0x229b17aa)
> [   18.887253] nouveau 0000:01:00.0: saving config space at offset 0x30 (reading 0xfff80000)
> [   18.887255] nouveau 0000:01:00.0: saving config space at offset 0x34 (reading 0x60)
> [   18.887257] nouveau 0000:01:00.0: saving config space at offset 0x38 (reading 0x0)
> [   18.887259] nouveau 0000:01:00.0: saving config space at offset 0x3c (reading 0x1ff)
> [   18.887311] nouveau 0000:01:00.0: power state changed by ACPI to D3cold
> [   42.094494] nouveau 0000:01:00.0: power state changed by ACPI to D0
> [   42.094663] nouveau 0000:01:00.0: restoring config space at offset 0x3c (was 0x100, writing 0x1ff)
> [   42.094679] nouveau 0000:01:00.0: restoring config space at offset 0x30 (was 0x0, writing 0xfff80000)
> [   42.094699] nouveau 0000:01:00.0: restoring config space at offset 0x24 (was 0x1, writing 0x2001)
> [   42.094721] nouveau 0000:01:00.0: restoring config space at offset 0x1c (was 0xc, writing 0xb000000c)
> [   42.094738] nouveau 0000:01:00.0: restoring config space at offset 0x14 (was 0xc, writing 0xa000000c)
> [   42.094769] nouveau 0000:01:00.0: restoring config space at offset 0x10 (was 0x0, writing 0xcd000000)
> [   42.094792] nouveau 0000:01:00.0: restoring config space at offset 0x4 (was 0x100000, writing 0x100407)
> [   42.538785] snd_hda_intel 0000:01:00.1: bound 0000:01:00.0 (ops nv50_audio_component_bind_ops [nouveau])
>
> --
> "A mouse is a device used to point at the xterm you want to type in" - A.S.R.
>
> Home page: http://marc.merlins.org/                       | PGP 7F55D5F27AAF9D08
> _______________________________________________
> Nouveau mailing list
> Nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org
> https://lists.freedesktop.org/mailman/listinfo/nouveau
>

  parent reply	other threads:[~2021-01-04 13:28 UTC|newest]

Thread overview: 79+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-10-04 12:39 [PATCH v2 0/2] PCI: Add missing link delays Mika Westerberg
2019-10-04 12:39 ` [PATCH v2 1/2] PCI: Introduce pcie_wait_for_link_delay() Mika Westerberg
2020-08-08 20:22   ` Marc MERLIN
2020-08-08 20:23     ` Marc MERLIN
2020-08-09 16:31     ` Marc MERLIN
2020-09-06 18:18     ` pcieport 0000:00:01.0: PME: Spurious native interrupt (nvidia with nouveau and thunderbolt on thinkpad P73) Marc MERLIN
2020-09-06 18:18       ` Marc MERLIN
2020-09-06 18:26       ` Matthias Andree
2020-09-07 19:14       ` [Nouveau] " Karol Herbst
2020-09-07 19:14         ` Karol Herbst
2020-09-07 20:58         ` [Nouveau] " Marc MERLIN
2020-09-07 20:58           ` Marc MERLIN
2020-09-07 23:51           ` [Nouveau] " Karol Herbst
2020-09-07 23:51             ` Karol Herbst
2020-09-08  0:29             ` [Nouveau] " Marc MERLIN
2020-05-29 18:03               ` 5.5 kernel: using nouveau or something else just long enough to turn off Quadro RTX 4000 Mobile for hybrid graphics? Marc MERLIN
     [not found]                 ` <20200529180315.GA18804-xnduUnryOU1AfugRpC6u6w@public.gmane.org>
2020-05-29 18:53                   ` Ilia Mirkin
     [not found]                     ` <CAKb7Uvhw2EYo1RR-=NGgLO3CU9QTRWchcAw1injffybZbJ-zOA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2020-05-29 19:46                       ` Marc MERLIN
     [not found]                         ` <20200529194605.GB18804-xnduUnryOU1AfugRpC6u6w@public.gmane.org>
2020-05-30 17:32                           ` Karol Herbst
2023-04-19  6:49                         ` [Nouveau] 6.1 still cannot get display on Thinkpad P73Quadro " Marc MERLIN
2023-04-21  5:46                           ` [Nouveau] 6.2 still cannot get hdmi display out on Thinkpad P73 Quadro RTX 4000 Mobile/TU104 Marc MERLIN
     [not found]                       ` <CACO55tsvY0t_z986VVoYCvxuBASdZ+rQcDtZ_dAtQR60NLmQQw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2020-05-31 18:31                         ` 5.5 kernel: using nouveau or something else just long enough to turn off Quadro RTX 4000 Mobile for hybrid graphics? Marc MERLIN
2020-12-26 11:12                 ` 5.9.11 still hanging 2mn at each boot and looping on nvidia-gpu 0000:01:00.3: PME# enabled (Quadro RTX 4000 Mobile) Marc MERLIN
2020-12-26 11:12                   ` Marc MERLIN
2020-12-27 18:28                   ` [Nouveau] " Ilia Mirkin
2020-12-27 18:28                     ` Ilia Mirkin
2021-01-27 21:33                   ` Bjorn Helgaas
2021-01-27 21:33                     ` Bjorn Helgaas
2021-01-28 20:59                     ` Bjorn Helgaas
2021-01-28 20:59                       ` [Nouveau] " Bjorn Helgaas
2021-01-29  0:56                     ` Marc MERLIN
2021-01-29  0:56                       ` [Nouveau] " Marc MERLIN
2021-01-29 21:20                       ` Bjorn Helgaas
2021-01-29 21:20                         ` [Nouveau] " Bjorn Helgaas
2021-01-30  2:04                         ` Marc MERLIN
2021-01-30  2:04                           ` [Nouveau] " Marc MERLIN
2021-05-05 21:42                           ` [Nouveau] 5.12.1 0010:nvkm_falcon_v1_wait_for_halt+0x8f/0xb9 [nouveau] Marc MERLIN
2021-05-06 14:50                             ` Bjorn Helgaas
2021-05-25  3:13                               ` Ben Skeggs
2020-12-29 15:51                 ` 5.9.11 still hanging 2mn at each boot and looping on nvidia-gpu 0000:01:00.3: PME# enabled (Quadro RTX 4000 Mobile) Marc MERLIN
2020-12-29 15:51                   ` Marc MERLIN
2020-12-29 16:33                   ` Ilia Mirkin
2020-12-29 16:33                     ` Ilia Mirkin
     [not found]                     ` <CAKb7UviFP_YVxC4PO7MDNnw6NDrD=3BCGF37umwAfaimjbX9Pw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2020-12-29 17:47                       ` Marc MERLIN
     [not found]                         ` <20201229174750.GI23389-xnduUnryOU1AfugRpC6u6w@public.gmane.org>
2021-01-04 11:49                           ` Marc MERLIN
     [not found]                             ` <20210104114955.GM32533-xnduUnryOU1AfugRpC6u6w@public.gmane.org>
2021-01-04 13:28                               ` Karol Herbst [this message]
     [not found]                                 ` <CACO55tsdG37YKv7FV2er4hRnXk9vmwMbPuPptA+=ZtziWXC2+g-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2021-01-07 11:49                                   ` Marc MERLIN
2020-12-30 12:16                       ` ael
2020-09-13 20:15               ` [Nouveau] pcieport 0000:00:01.0: PME: Spurious native interrupt (nvidia with nouveau and thunderbolt on thinkpad P73) Marc MERLIN
2020-09-13 20:15                 ` Marc MERLIN
     [not found]                 ` <20200913201545.GL2622-xnduUnryOU1AfugRpC6u6w@public.gmane.org>
2020-09-19 23:18                   ` Marc MERLIN
2019-10-04 12:39 ` [PATCH v2 2/2] PCI: Add missing link delays required by the PCIe spec Mika Westerberg
2019-10-26 14:19   ` Bjorn Helgaas
2019-10-28 11:28     ` Mika Westerberg
2019-10-28 13:42       ` Bjorn Helgaas
2019-10-28 18:06         ` Mika Westerberg
2019-10-28 20:16           ` Bjorn Helgaas
2019-10-29 11:15             ` Mika Westerberg
2019-10-29 20:27               ` Bjorn Helgaas
2019-10-30 11:15                 ` Mika Westerberg
2019-10-31 22:31                   ` Bjorn Helgaas
2019-11-01 11:19                     ` Mika Westerberg
2019-11-05  0:00                       ` Bjorn Helgaas
2019-11-05  9:54                         ` Mika Westerberg
2019-11-05 12:58                           ` Mika Westerberg
2019-11-05 20:01                             ` Bjorn Helgaas
2019-11-06 13:31                               ` Mika Westerberg
2019-11-05 15:00                           ` Bjorn Helgaas
2019-11-05 15:28                             ` Mika Westerberg
2019-11-05 16:10                               ` Bjorn Helgaas
2019-11-06 13:29                                 ` Mika Westerberg
2019-10-29 20:54   ` Bjorn Helgaas
2019-10-30 11:33     ` Mika Westerberg
2019-10-04 12:57 ` [PATCH v2 0/2] PCI: Add missing link delays Matthias Andree
2019-10-04 13:06   ` Mika Westerberg
2019-10-05  7:34     ` Matthias Andree
2019-10-07  9:32       ` Mika Westerberg
2019-10-07 15:15         ` Matthias Andree
2019-10-08  9:05           ` Mika Westerberg

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CACO55tsdG37YKv7FV2er4hRnXk9vmwMbPuPptA+=ZtziWXC2+g@mail.gmail.com' \
    --to=kherbst-h+wxahxf7alqt0dzr+alfa@public.gmane.org \
    --cc=marc_nouveau-xnduUnryOU1AfugRpC6u6w@public.gmane.org \
    --cc=nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.