dri-devel.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed
From: Karol Herbst <kherbst@redhat.com>
To: LKML <linux-kernel@vger.kernel.org>
Cc: Linux PM <linux-pm@vger.kernel.org>,
	Linux PCI <linux-pci@vger.kernel.org>,
	Mika Westerberg <mika.westerberg@intel.com>,
	"Rafael J . Wysocki" <rjw@rjwysocki.net>,
	dri-devel <dri-devel@lists.freedesktop.org>,
	nouveau <nouveau@lists.freedesktop.org>,
	Bjorn Helgaas <bhelgaas@google.com>
Subject: Re: [PATCH v4] pci: prevent putting nvidia GPUs into lower device states on certain intel bridges
Date: Thu, 14 Nov 2019 20:17:30 +0100	[thread overview]
Message-ID: <CACO55tuMvHtPSHmU_G_0f5P6O3Ao0OqVMDPvaaRCYrMSd29NMQ@mail.gmail.com> (raw)
Message-ID: <20191114191730.ITVFLQ4t3lkQrp3qgO3tmaYt6k84gSDq4YL_1C1_ehk@z> (raw)
In-Reply-To: <20191017121901.13699-1-kherbst@redhat.com>

ping on the patch.

I wasn't able to verify this issue on any other bridge controller, so
it really might be only this one.

On Thu, Oct 17, 2019 at 2:19 PM Karol Herbst <kherbst@redhat.com> wrote:
>
> Fixes state transitions of Nvidia Pascal GPUs from D3cold into higher device
> states.
>
> v2: convert to pci_dev quirk
>     put a proper technical explanation of the issue as a in-code comment
> v3: disable it only for certain combinations of intel and nvidia hardware
> v4: simplify quirk by setting flag on the GPU itself
>
> Signed-off-by: Karol Herbst <kherbst@redhat.com>
> Cc: Bjorn Helgaas <bhelgaas@google.com>
> Cc: Lyude Paul <lyude@redhat.com>
> Cc: Rafael J. Wysocki <rjw@rjwysocki.net>
> Cc: Mika Westerberg <mika.westerberg@intel.com>
> Cc: linux-pci@vger.kernel.org
> Cc: linux-pm@vger.kernel.org
> Cc: dri-devel@lists.freedesktop.org
> Cc: nouveau@lists.freedesktop.org
> ---
>  drivers/pci/pci.c    |  7 ++++++
>  drivers/pci/quirks.c | 53 ++++++++++++++++++++++++++++++++++++++++++++
>  include/linux/pci.h  |  1 +
>  3 files changed, 61 insertions(+)
>
> diff --git a/drivers/pci/pci.c b/drivers/pci/pci.c
> index b97d9e10c9cc..02e71e0bcdd7 100644
> --- a/drivers/pci/pci.c
> +++ b/drivers/pci/pci.c
> @@ -850,6 +850,13 @@ static int pci_raw_set_power_state(struct pci_dev *dev, pci_power_t state)
>            || (state == PCI_D2 && !dev->d2_support))
>                 return -EIO;
>
> +       /*
> +        * check if we have a bad combination of bridge controller and nvidia
> +         * GPU, see quirk_broken_nv_runpm for more info
> +        */
> +       if (state != PCI_D0 && dev->broken_nv_runpm)
> +               return 0;
> +
>         pci_read_config_word(dev, dev->pm_cap + PCI_PM_CTRL, &pmcsr);
>
>         /*
> diff --git a/drivers/pci/quirks.c b/drivers/pci/quirks.c
> index 44c4ae1abd00..0006c9e37b6f 100644
> --- a/drivers/pci/quirks.c
> +++ b/drivers/pci/quirks.c
> @@ -5268,3 +5268,56 @@ static void quirk_reset_lenovo_thinkpad_p50_nvgpu(struct pci_dev *pdev)
>  DECLARE_PCI_FIXUP_CLASS_FINAL(PCI_VENDOR_ID_NVIDIA, 0x13b1,
>                               PCI_CLASS_DISPLAY_VGA, 8,
>                               quirk_reset_lenovo_thinkpad_p50_nvgpu);
> +
> +/*
> + * Some Intel PCIe bridges cause devices to disappear from the PCIe bus after
> + * those were put into D3cold state if they were put into a non D0 PCI PM
> + * device state before doing so.
> + *
> + * This leads to various issue different issues which all manifest differently,
> + * but have the same root cause:
> + *  - AIML code execution hits an infinite loop (as the coe waits on device
> + *    memory to change).
> + *  - kernel crashes, as all pci reads return -1, which most code isn't able
> + *    to handle well enough.
> + *  - sudden shutdowns, as the kernel identified an unrecoverable error after
> + *    userspace tries to access the GPU.
> + *
> + * In all cases dmesg will contain at least one line like this:
> + * 'nouveau 0000:01:00.0: Refused to change power state, currently in D3'
> + * followed by a lot of nouveau timeouts.
> + *
> + * ACPI code writes bit 0x80 to the not documented PCI register 0x248 of the
> + * PCIe bridge controller in order to power down the GPU.
> + * Nonetheless, there are other code paths inside the ACPI firmware which use
> + * other registers, which seem to work fine:
> + *  - 0xbc bit 0x20 (publicly available documentation claims 'reserved')
> + *  - 0xb0 bit 0x10 (link disable)
> + * Changing the conditions inside the firmware by poking into the relevant
> + * addresses does resolve the issue, but it seemed to be ACPI private memory
> + * and not any device accessible memory at all, so there is no portable way of
> + * changing the conditions.
> + *
> + * The only systems where this behavior can be seen are hybrid graphics laptops
> + * with a secondary Nvidia Pascal GPU. It cannot be ruled out that this issue
> + * only occurs in combination with listed Intel PCIe bridge controllers and
> + * the mentioned GPUs or if it's only a hw bug in the bridge controller.
> + *
> + * But because this issue was NOT seen on laptops with an Nvidia Pascal GPU
> + * and an Intel Coffee Lake SoC, there is a higher chance of there being a bug
> + * in the bridge controller rather than in the GPU.
> + *
> + * This issue was not able to be reproduced on non laptop systems.
> + */
> +
> +static void quirk_broken_nv_runpm(struct pci_dev *dev)
> +{
> +       struct pci_dev *bridge = pci_upstream_bridge(dev);
> +
> +       if (bridge->vendor == PCI_VENDOR_ID_INTEL &&
> +           bridge->device == 0x1901)
> +               dev->broken_nv_runpm = 1;
> +}
> +DECLARE_PCI_FIXUP_CLASS_FINAL(PCI_VENDOR_ID_NVIDIA, PCI_ANY_ID,
> +                             PCI_BASE_CLASS_DISPLAY, 16,
> +                             quirk_broken_nv_runpm);
> diff --git a/include/linux/pci.h b/include/linux/pci.h
> index ac8a6c4e1792..903a0b3a39ec 100644
> --- a/include/linux/pci.h
> +++ b/include/linux/pci.h
> @@ -416,6 +416,7 @@ struct pci_dev {
>         unsigned int    __aer_firmware_first_valid:1;
>         unsigned int    __aer_firmware_first:1;
>         unsigned int    broken_intx_masking:1;  /* INTx masking can't be used */
> +       unsigned int    broken_nv_runpm:1;      /* some combinations of intel bridge controller and nvidia GPUs break rtd3 */
>         unsigned int    io_window_1k:1;         /* Intel bridge 1K I/O windows */
>         unsigned int    irq_managed:1;
>         unsigned int    has_secondary_link:1;
> --
> 2.21.0
>

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

  reply	other threads:[~2019-11-14 19:17 UTC|newest]

Thread overview: 117+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-10-17 12:19 [PATCH v4] pci: prevent putting nvidia GPUs into lower device states on certain intel bridges Karol Herbst
2019-11-14 19:17 ` Karol Herbst [this message]
2019-11-14 19:17   ` Karol Herbst
     [not found] ` <20191017121901.13699-1-kherbst-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2019-11-19 20:06   ` Dave Airlie
2019-11-19 20:06     ` Dave Airlie
2019-11-19 21:49 ` Bjorn Helgaas
2019-11-19 21:49   ` Bjorn Helgaas
     [not found]   ` <20191119214955.GA223696-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>
2019-11-19 22:26     ` Karol Herbst
2019-11-19 22:26       ` Karol Herbst
2019-11-19 22:57       ` Bjorn Helgaas
2019-11-19 22:57         ` Bjorn Helgaas
2019-11-20 10:18       ` Mika Westerberg
2019-11-20 10:18         ` Mika Westerberg
     [not found]         ` <20191120101816.GX11621-3PARRvDOhMZrdx17CPfAsdBPR1lH4CV8@public.gmane.org>
2019-11-20 10:52           ` Rafael J. Wysocki
2019-11-20 10:52             ` Rafael J. Wysocki
2019-11-20 11:22             ` Mika Westerberg
2019-11-20 11:22               ` Mika Westerberg
2019-11-20 11:48               ` Rafael J. Wysocki
2019-11-20 11:51                 ` Karol Herbst
2019-11-20 11:51                   ` Karol Herbst
     [not found]                   ` <CACO55tsjj+xkDjubz1J=fsPecW4H_J8AaBTeaMm+NYjp8Kiq8g-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2019-11-20 12:06                     ` Rafael J. Wysocki
2019-11-20 12:06                       ` Rafael J. Wysocki
     [not found]                       ` <CAJZ5v0ithxMPK2YxfTUx_Ygpze2FMDJ6LwKwJb2vx89dfgHX_A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2019-11-20 12:09                         ` Karol Herbst
2019-11-20 12:09                           ` Karol Herbst
2019-11-20 12:14                           ` Rafael J. Wysocki
2019-11-20 12:14                             ` Rafael J. Wysocki
2019-11-20 12:19                             ` Karol Herbst
2019-11-20 12:19                               ` Karol Herbst
2019-11-20 12:11                       ` Rafael J. Wysocki
2019-11-20 12:11                         ` Rafael J. Wysocki
2019-11-20 11:51               ` Mika Westerberg
2019-11-20 11:51                 ` Mika Westerberg
     [not found]                 ` <20191120115127.GD11621-3PARRvDOhMZrdx17CPfAsdBPR1lH4CV8@public.gmane.org>
2019-11-20 11:54                   ` Karol Herbst
2019-11-20 11:54                     ` Karol Herbst
     [not found]                     ` <CACO55tsfNOdtu5SZ-4HzO4Ji6gQtafvZ7Rm19nkPcJAgwUBFMw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2019-11-20 11:58                       ` Karol Herbst
2019-11-20 11:58                         ` Karol Herbst
2019-11-20 12:09                         ` Mika Westerberg
2019-11-20 12:09                           ` Mika Westerberg
     [not found]                           ` <20191120120913.GE11621-3PARRvDOhMZrdx17CPfAsdBPR1lH4CV8@public.gmane.org>
2019-11-20 12:11                             ` Karol Herbst
2019-11-20 12:11                               ` Karol Herbst
2019-11-20 15:15                               ` Mika Westerberg
2019-11-20 15:15                                 ` Mika Westerberg
2019-11-20 15:37                                 ` Karol Herbst
2019-11-20 15:53                                   ` Mika Westerberg
2019-11-20 15:53                                     ` Mika Westerberg
2019-11-20 16:23                                     ` Mika Westerberg
2019-11-20 16:23                                       ` Mika Westerberg
     [not found]                                       ` <20191120162306.GM11621-3PARRvDOhMZrdx17CPfAsdBPR1lH4CV8@public.gmane.org>
2019-11-20 21:36                                         ` Karol Herbst
2019-11-20 21:36                                           ` Karol Herbst
2019-11-21 10:14                                           ` Mika Westerberg
2019-11-21 10:14                                             ` Mika Westerberg
2019-11-21 11:03                                             ` Rafael J. Wysocki
     [not found]                                               ` <CAJZ5v0hAgz4Fu=83AJE2PYUsi+Jk=Lrr4MNp5ySA9yY=3wr5rg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2019-11-21 11:08                                                 ` Rafael J. Wysocki
2019-11-21 11:08                                                   ` Rafael J. Wysocki
2019-11-21 11:15                                                   ` Rafael J. Wysocki
2019-11-21 11:15                                                     ` Rafael J. Wysocki
2019-11-21 11:17                                               ` Mika Westerberg
2019-11-21 11:17                                                 ` Mika Westerberg
2019-11-21 11:31                                                 ` Rafael J. Wysocki
2019-11-21 11:31                                                   ` Rafael J. Wysocki
2019-11-20 21:37                                     ` Rafael J. Wysocki
2019-11-20 21:37                                       ` Rafael J. Wysocki
2019-11-20 21:40                                       ` Karol Herbst
2019-11-20 21:40                                         ` Karol Herbst
2019-11-20 22:29                                         ` Rafael J. Wysocki
2019-11-20 22:29                                           ` Rafael J. Wysocki
2019-11-21 11:28                                           ` Mika Westerberg
     [not found]                                             ` <20191121112821.GU11621-3PARRvDOhMZrdx17CPfAsdBPR1lH4CV8@public.gmane.org>
2019-11-21 11:34                                               ` Rafael J. Wysocki
2019-11-21 11:34                                                 ` Rafael J. Wysocki
2019-11-21 11:46                                                 ` Mika Westerberg
2019-11-21 11:46                                                   ` Mika Westerberg
2019-11-21 12:52                                                   ` Mika Westerberg
2019-11-21 12:56                                                     ` Karol Herbst
2019-11-21 12:56                                                       ` Karol Herbst
     [not found]                                                     ` <20191121125236.GX11621-3PARRvDOhMZrdx17CPfAsdBPR1lH4CV8@public.gmane.org>
2019-11-21 15:43                                                       ` Rafael J. Wysocki
2019-11-21 15:43                                                         ` Rafael J. Wysocki
2019-11-21 19:49                                                         ` Mika Westerberg
2019-11-21 19:49                                                           ` Mika Westerberg
     [not found]                                                           ` <20191121194942.GY11621-3PARRvDOhMZrdx17CPfAsdBPR1lH4CV8@public.gmane.org>
2019-11-21 22:39                                                             ` Rafael J. Wysocki
2019-11-21 22:39                                                               ` Rafael J. Wysocki
2019-11-21 22:50                                                               ` Karol Herbst
2019-11-22  0:13                                                                 ` Karol Herbst
2019-11-22  0:13                                                                   ` Karol Herbst
2019-11-22  9:07                                                                   ` Rafael J. Wysocki
2019-11-22  9:07                                                                     ` Rafael J. Wysocki
     [not found]                                                                     ` <CAJZ5v0jNq77xPXxeYeq_JJBCfekVPVPOye1mZwpQi=+=MKSS7w-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2019-11-22 11:30                                                                       ` Karol Herbst
2019-11-22 11:30                                                                         ` Karol Herbst
2019-11-22 10:36                                                               ` Mika Westerberg
2019-11-22 11:30                                                                 ` Rafael J. Wysocki
2019-11-22 11:30                                                                   ` Rafael J. Wysocki
     [not found]                                                                   ` <CAJZ5v0gifnGZcKr6mgc6C2EfqX13OyJnOac0uDxYNKN=A0cgMg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2019-11-22 11:34                                                                     ` Karol Herbst
2019-11-22 11:34                                                                       ` Karol Herbst
2019-11-22 11:54                                                                       ` Rafael J. Wysocki
2019-11-22 11:54                                                                         ` Rafael J. Wysocki
2019-11-22 11:52                                                                   ` Mika Westerberg
2019-11-22 12:15                                                                     ` Rafael J. Wysocki
2019-11-21 12:52                                                   ` Karol Herbst
2019-11-21 12:52                                                     ` Karol Herbst
2019-11-21 15:47                                                     ` Rafael J. Wysocki
2019-11-21 15:47                                                       ` Rafael J. Wysocki
2019-11-21 16:06                                                       ` Karol Herbst
2019-11-21 16:06                                                         ` Karol Herbst
2019-11-21 16:39                                                         ` Rafael J. Wysocki
2019-11-21 16:39                                                           ` Rafael J. Wysocki
2019-11-26 23:10                                                           ` Lyude Paul
2019-11-27 11:48                                                             ` Mika Westerberg
2019-11-27 11:48                                                               ` Mika Westerberg
2019-11-27 11:51                                                               ` Karol Herbst
2019-11-27 11:51                                                                 ` Karol Herbst
2019-11-27 19:51                                                                 ` Lyude Paul
2019-11-27 19:51                                                                   ` Lyude Paul
2019-12-09 11:17                                                                   ` Karol Herbst
2019-12-09 11:38                                                                     ` Rafael J. Wysocki
2019-12-09 12:24                                                                       ` Karol Herbst
2019-12-10 19:58                                                                       ` Dave Airlie
2019-12-10 20:49                                                                         ` Karol Herbst
2020-01-13 15:31                                                                           ` Karol Herbst

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CACO55tuMvHtPSHmU_G_0f5P6O3Ao0OqVMDPvaaRCYrMSd29NMQ@mail.gmail.com \
    --to=kherbst@redhat.com \
    --cc=bhelgaas@google.com \
    --cc=dri-devel@lists.freedesktop.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-pci@vger.kernel.org \
    --cc=linux-pm@vger.kernel.org \
    --cc=mika.westerberg@intel.com \
    --cc=nouveau@lists.freedesktop.org \
    --cc=rjw@rjwysocki.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).