From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id EFFF8C433F5 for ; Thu, 12 May 2022 19:15:48 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1357937AbiELTPs (ORCPT ); Thu, 12 May 2022 15:15:48 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34216 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1350207AbiELTPq (ORCPT ); Thu, 12 May 2022 15:15:46 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id A5533E9A for ; Thu, 12 May 2022 12:15:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1652382940; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=GsFPpBjvOchGxGH2M6tCpUwhyUvV0P5EUx9hZuUOXzU=; b=eV0WkW1KBEHU04pUn6rx4moHJ444QV9qLS89iiYuXYjIPPXKfHcp8RzxjqfZ9MwZCynRp2 6jSBaFd3r++EaLWptbp8aIrbftVO2rsyfRUIjsGAGllYkfCT0JR2eNrj0dTS8dylxYIXGX CJIp9s4nH+6GmBiIoezW4nq8PD2TBok= Received: from mail-ed1-f70.google.com (mail-ed1-f70.google.com [209.85.208.70]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-50-Ae6cWiKEOHSitkO5nDJnag-1; Thu, 12 May 2022 15:15:39 -0400 X-MC-Unique: Ae6cWiKEOHSitkO5nDJnag-1 Received: by mail-ed1-f70.google.com with SMTP id r26-20020a50aada000000b00425afa72622so3572281edc.19 for ; Thu, 12 May 2022 12:15:39 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:message-id:date:mime-version:user-agent:subject :content-language:to:cc:references:from:in-reply-to :content-transfer-encoding; bh=GsFPpBjvOchGxGH2M6tCpUwhyUvV0P5EUx9hZuUOXzU=; b=3jRSWwPvz0uXkzcvAZkO4tu9OUXkbFH0y9Pnoge1ZxR1iIjbOIvRFLz1+LNE2WKSGQ 92k1qZnhtgpRJdLjKiy+I8j372fBEn3TGkNOkqfiCcNne2jej2BVc72R3b6VDE/DKE2e jGkdo4yPlThN8mhgrWRJjfljGiLa8Q6TPikL09ksXJ8a3WkTVBflmxM5TjoYZfNftitV RNeZ3SgzqHHeIf9C+dDE31L8UB9PBJoeGwIhp6NK8uuRtP69nkgo5upTjTdIm8ugZTe2 KC74ZfyHEo+Sf0meSbz9Px/59nbodVnya6rlsiFdZ3svjBVYBprzMtqlItfTnr0/ymHD gXYw== X-Gm-Message-State: AOAM533ckgnHECGo2GbP81CMmCzuD3C8UGKmuy55XLALLa6Isnr23/kr DS3pnypaVPH/6QbhFu+2xU3IMmz9dqNYjM9/TEBoQr5lJfRZoyXjmXxAZKR/X8gStwCwFj7XJrb UoTM5lT3u50LVLMNqpiRWmw== X-Received: by 2002:a17:907:7f8a:b0:6f4:d25a:7fb6 with SMTP id qk10-20020a1709077f8a00b006f4d25a7fb6mr1280746ejc.586.1652382937987; Thu, 12 May 2022 12:15:37 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxWWuJ77/g8wNH5jCXisGSvRMlZKofBgT4P2xBEPWE3rP76jSaWcycc3GWd4/jsuExS0ViFjA== X-Received: by 2002:a17:907:7f8a:b0:6f4:d25a:7fb6 with SMTP id qk10-20020a1709077f8a00b006f4d25a7fb6mr1280718ejc.586.1652382937636; Thu, 12 May 2022 12:15:37 -0700 (PDT) Received: from ?IPV6:2001:1c00:c1e:bf00:d69d:5353:dba5:ee81? (2001-1c00-0c1e-bf00-d69d-5353-dba5-ee81.cable.dynamic.v6.ziggo.nl. [2001:1c00:c1e:bf00:d69d:5353:dba5:ee81]) by smtp.gmail.com with ESMTPSA id gk8-20020a17090790c800b006f3ef214db3sm54293ejb.25.2022.05.12.12.15.36 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Thu, 12 May 2022 12:15:37 -0700 (PDT) Message-ID: Date: Thu, 12 May 2022 21:15:36 +0200 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.8.0 Subject: Re: [PATCH v7 1/1] x86/PCI: Ignore E820 reservations for bridge windows on newer systems Content-Language: en-US To: Bjorn Helgaas Cc: "Rafael J . Wysocki" , Mika Westerberg , =?UTF-8?Q?Krzysztof_Wilczy=c5=84ski?= , Bjorn Helgaas , Myron Stowe , Juha-Pekka Heikkila , Thomas Gleixner , Ingo Molnar , Borislav Petkov , "H . Peter Anvin" , =?UTF-8?Q?Benoit_Gr=c3=a9goire?= , Hui Wang , linux-acpi@vger.kernel.org, linux-pci@vger.kernel.org, x86@kernel.org, linux-kernel@vger.kernel.org References: <20220509193629.GA611897@bhelgaas> From: Hans de Goede In-Reply-To: <20220509193629.GA611897@bhelgaas> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-acpi@vger.kernel.org Hi, On 5/9/22 21:36, Bjorn Helgaas wrote: > On Mon, May 09, 2022 at 07:33:27PM +0200, Hans de Goede wrote: >> Hi Bjorn, >> >> On 5/7/22 17:31, Bjorn Helgaas wrote: >>> On Sat, May 07, 2022 at 12:09:03PM +0200, Hans de Goede wrote: >>>> Hi Bjorn, >>>> >>>> On 5/6/22 18:51, Bjorn Helgaas wrote: >>>>> On Thu, May 05, 2022 at 05:20:16PM +0200, Hans de Goede wrote: >>>>>> Some BIOS-es contain bugs where they add addresses which are already >>>>>> used in some other manner to the PCI host bridge window returned by >>>>>> the ACPI _CRS method. To avoid this Linux by default excludes >>>>>> E820 reservations when allocating addresses since 2010, see: >>>>>> commit 4dc2287c1805 ("x86: avoid E820 regions when allocating address >>>>>> space"). >>>>>> >>>>>> Recently (2019) some systems have shown-up with E820 reservations which >>>>>> cover the entire _CRS returned PCI bridge memory window, causing all >>>>>> attempts to assign memory to PCI BARs which have not been setup by the >>>>>> BIOS to fail. For example here are the relevant dmesg bits from a >>>>>> Lenovo IdeaPad 3 15IIL 81WE: >>>>>> >>>>>> [mem 0x000000004bc50000-0x00000000cfffffff] reserved >>>>>> pci_bus 0000:00: root bus resource [mem 0x65400000-0xbfffffff window] >>>>>> >>>>>> The ACPI specifications appear to allow this new behavior: >>>>>> >>>>>> The relationship between E820 and ACPI _CRS is not really very clear. >>>>>> ACPI v6.3, sec 15, table 15-374, says AddressRangeReserved means: >>>>>> >>>>>> This range of addresses is in use or reserved by the system and is >>>>>> not to be included in the allocatable memory pool of the operating >>>>>> system's memory manager. >>>>>> >>>>>> and it may be used when: >>>>>> >>>>>> The address range is in use by a memory-mapped system device. >>>>>> >>>>>> Furthermore, sec 15.2 says: >>>>>> >>>>>> Address ranges defined for baseboard memory-mapped I/O devices, such >>>>>> as APICs, are returned as reserved. >>>>>> >>>>>> A PCI host bridge qualifies as a baseboard memory-mapped I/O device, >>>>>> and its apertures are in use and certainly should not be included in >>>>>> the general allocatable pool, so the fact that some BIOS-es reports >>>>>> the PCI aperture as "reserved" in E820 doesn't seem like a BIOS bug. >>>>>> >>>>>> So it seems that the excluding of E820 reserved addresses is a mistake. >>>>>> >>>>>> Ideally Linux would fully stop excluding E820 reserved addresses, >>>>>> but then various old systems will regress. >>>>>> Instead keep the old behavior for old systems, while ignoring >>>>>> the E820 reservations for any systems from now on. >>>>>> >>>>>> Old systems are defined here as BIOS year < 2018, this was chosen to >>>>>> make sure that pci_use_e820 will not be set on the currently affected >>>>>> systems, the oldest known one is from 2019. >>>>>> >>>>>> Testing has shown that some newer systems also have a bad _CRS return. >>>>>> The pci_crs_quirks DMI table is used to keep excluding E820 reservations >>>>>> from the bridge window on these systems. >>>>>> >>>>>> Also add pci=no_e820 and pci=use_e820 options to allow overriding >>>>>> the BIOS year + DMI matching logic. >>>>>> >>>>>> BugLink: https://bugzilla.kernel.org/show_bug.cgi?id=206459 >>>>>> BugLink: https://bugzilla.redhat.com/show_bug.cgi?id=1868899 >>>>>> BugLink: https://bugzilla.redhat.com/show_bug.cgi?id=1871793 >>>>>> BugLink: https://bugs.launchpad.net/bugs/1878279 >>>>>> BugLink: https://bugs.launchpad.net/bugs/1931715 >>>>>> BugLink: https://bugs.launchpad.net/bugs/1932069 >>>>>> BugLink: https://bugs.launchpad.net/bugs/1921649 >>>>>> Cc: Benoit Grégoire >>>>>> Cc: Hui Wang >>>>>> Signed-off-by: Hans de Goede >>>>> >>>>>> + * Ideally Linux would fully stop using E820 reservations, but then >>>>>> + * various old systems will regress. Instead keep the old behavior for >>>>>> + * old systems + known to be broken newer systems in pci_crs_quirks. >>>>>> + */ >>>>>> + if (year >= 0 && year < 2018) >>>>>> + pci_use_e820 = true; >>>>> >>>>> How did you pick 2018? Prior to this patch, we used E820 reservations >>>>> for all machines. This patch would change that for 2019-2022 >>>>> machines, so there's a risk of breaking some of them. >>>> >>>> Correct. I picked 2018 because the first devices where using E820 >>>> reservations are causing issues (i2c controller not getting resources >>>> leading to non working touchpad / thunderbolt hotplug issues) have >>>> BIOS dates starting in 2019. I added a year margin, so we could make >>>> this 2019. >>>> >>>>> I'm hesitant about changing the behavior for machines already in the >>>>> field because if they were tested at all with Linux, it was without >>>>> this patch. So I would lean toward preserving the current behavior >>>>> for BIOS year < 2023. >>>> >>>> I see, I presume the idea is to then use DMI to disable E820 clipping >>>> on current devices where this is known to cause problems ? >>>> >>>> So for v8 I would: >>>> >>>> 1. Change the cut-off check to < 2023 >>>> 2. Drop the DMI quirks I added for models which are known to need E820 >>>> clipping hit by the < 2018 check >>>> 3. Add DMI quirks for models for which it is known that we must _not_ >>>> do E820 clipping >>>> >>>> Is this the direction you want to go / does that sound right? >>> >>> Yes, I think that's what we should do. All the machines in the field >>> will be unaffected, except that we add quirks for known problems. >> >> I've been working on this today. I've mostly been going through >> the all the existing bugs about this, to make a list of DMI matches >> for devices on which we should _not_ do e820 clipping to fix th >> kernel being unable to assign BARs there. >> >> I've found an interesting pattern there, all affected devices >> are Lenovo devices with "IIL" in there device name, e.g. : >> "IdeaPad 3 15IIL05". I've looked up all Lenovo devices which >> have "IIL" as part of their DMI_PRODUCT_VERSION string here: >> https://github.com/linuxhw/DMI/ >> >> And then looked them up at https://linux-hardware.org/ and checked >> their dmesg to see if they have the e820 problem other ideapads >> have. I've gone through approx. half the list now and all >> except one model seem to have the e820 problem. >> >> So it looks like we might be able to match all problem models >> with a single DMI match. > > That sounds reasonable. I assume that if we skip the clipping for > every platform that matches "IIL", we can also add exceptions for the > inevitable "IIL" platforms that do need the clip? Yes we can add a more specific match higher up in the pci_crs_quirks[] array and then use a callback which returns non 0 to make dmi_check_system() abort checking the rest of the array. > E.g., specific > entries at the end of the list that override the previous generic > match? > >> So the problem seems to be limited to one specific device >> series / range and this is making me have second thoughts >> about doing a date based cut-off at all. Trying to switch >> over any models which are new in 2023 is fine, the problem >> with a DMI BIOS date approach though is that as soon as some >> new management-engine CVE comes out we will also see BIOS >> updates with a year of 2023 for many existing models, of >> up to 3-4 years old at least; and chances are that some of >> those older models getting BIOS updates will be bitten by >> this change. > > That's a good point and sounds fairly painful when that happens, > but I don't see a nice way out of this. > >> So as said I'm having second thoughts about the date based >> approach. Bjorn, what do you think of just using DMI quirks >> to disable e820 clipping on known problematic models and >> otherwise keeping things as is ? > > I think we need a long-term strategy that can be clearly expressed > in a sentence or two and is consistent with the ACPI and PCI specs, > and I don't think the current strategy is it. Clipping with E820 > regions happened to work for some machines, but there's no reason to > think it will work in general. Ok, so what I'm reading between the lines here is that despite the concerns which I've voiced you want to continue with disabling e820 clipping by default for machines with a DMI_BIOS_DATE year of 2023 or newer. Which is fine by me I just wanted to get my concerns out there. I'm almost done prepping a v8 now. I had to do some other stuff and spend a lot of time checking dmesg output for all the Lenovo *IIL* models. For v8 I've also added a quirk for the Acer model you pointed me to in another email. Regards, Hans