From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-11.4 required=3.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, INCLUDES_PATCH,MAILING_LIST_MULTI,NICE_REPLY_A,SIGNED_OFF_BY,SPF_HELO_NONE, SPF_PASS,URIBL_BLOCKED,USER_AGENT_SANE_1 autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 1DD2BC4363A for ; Tue, 20 Oct 2020 13:34:21 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id A930122247 for ; Tue, 20 Oct 2020 13:34:20 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (2048-bit key) header.d=nvidia.com header.i=@nvidia.com header.b="biLwzaOQ" Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2407490AbgJTNeU (ORCPT ); Tue, 20 Oct 2020 09:34:20 -0400 Received: from hqnvemgate24.nvidia.com ([216.228.121.143]:14091 "EHLO hqnvemgate24.nvidia.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2407488AbgJTNeU (ORCPT ); Tue, 20 Oct 2020 09:34:20 -0400 Received: from hqmail.nvidia.com (Not Verified[216.228.121.13]) by hqnvemgate24.nvidia.com (using TLS: TLSv1.2, AES256-SHA) id ; Tue, 20 Oct 2020 06:32:46 -0700 Received: from [10.40.203.32] (10.124.1.5) by HQMAIL107.nvidia.com (172.20.187.13) with Microsoft SMTP Server (TLS) id 15.0.1473.3; Tue, 20 Oct 2020 13:34:10 +0000 Subject: Re: [PATCH] PCI: dwc: Use ATU regions to map memory regions To: Lorenzo Pieralisi CC: , , , , , , , , , , , References: <20201005121351.32516-1-vidyas@nvidia.com> <20201020132037.GB25784@e121166-lin.cambridge.arm.com> From: Vidya Sagar Message-ID: <85e0d8f8-9dfe-b71b-f039-96eaf8f4c350@nvidia.com> Date: Tue, 20 Oct 2020 19:03:59 +0530 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.2.1 MIME-Version: 1.0 In-Reply-To: <20201020132037.GB25784@e121166-lin.cambridge.arm.com> Content-Type: text/plain; charset="utf-8"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.124.1.5] X-ClientProxiedBy: HQMAIL101.nvidia.com (172.20.187.10) To HQMAIL107.nvidia.com (172.20.187.13) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nvidia.com; s=n1; t=1603200766; bh=XbjLOg79CmuDWc3lI/dtwF7N4cCKVsmRoy588r0C9b8=; h=Subject:To:CC:References:From:Message-ID:Date:User-Agent: MIME-Version:In-Reply-To:Content-Type:Content-Language: Content-Transfer-Encoding:X-Originating-IP:X-ClientProxiedBy; b=biLwzaOQjMtXV08V8H5XnF6vEDAAtr7jkdqr+pSW/7FbTq3wiy87JK7X10ICJlPTM h5/VzUaI5fuou8D5HzpVNWYh8pj9Z0fPRjJV3LDCsZnHJjkbxKWcNQ1EBYNw8f2/lY pAK/zkH4F1Cs6zKR8TZng+n4fBOlVE68L5ODHNUsE5ud0GHLApzKfLPLbB64AkX8iH bR1C1pTC3j/MMBR7HAuqeeXcxWyQ5coS2ocnBUVckXudGJOcdJ4Pf1PLcsjWDkMKWD cilnlxHhYQzhhZ7clEX5kwLE0br/VhZ/z91HMH1A88LIQcqPwnRJtMZL8gk0PHcj8j 6s26MW2ErP1EA== Precedence: bulk List-ID: X-Mailing-List: linux-pci@vger.kernel.org On 10/20/2020 6:50 PM, Lorenzo Pieralisi wrote: > External email: Use caution opening links or attachments > > > On Mon, Oct 19, 2020 at 11:21:54AM +0530, Vidya Sagar wrote: >> Hi Lorenzo, Rob, Gustavo, >> Could you please review this change? > > Next cycle - we are in the middle of the merge window and I am not > queueing any more patches. Thanks for the update. FWIW, PCIe is broken on Tegra194 with Rob's patches (which got accepted already) and without the current patch. Thanks, Vidya Sagar > > Thanks, > Lorenzo > >> Thanks, >> Vidya Sagar >> >> On 10/5/2020 5:43 PM, Vidya Sagar wrote: >>> Use ATU region-3 and region-0 to setup mapping for prefetchable and >>> non-prefetchable memory regions respectively only if their respective CPU >>> and bus addresses are different. >>> >>> Signed-off-by: Vidya Sagar >>> --- >>> .../pci/controller/dwc/pcie-designware-host.c | 44 ++++++++++++++++--- >>> drivers/pci/controller/dwc/pcie-designware.c | 12 ++--- >>> drivers/pci/controller/dwc/pcie-designware.h | 4 +- >>> 3 files changed, 48 insertions(+), 12 deletions(-) >>> >>> diff --git a/drivers/pci/controller/dwc/pcie-designware-host.c b/drivers/pci/controller/dwc/pcie-designware-host.c >>> index 317ff512f8df..cefde8e813e9 100644 >>> --- a/drivers/pci/controller/dwc/pcie-designware-host.c >>> +++ b/drivers/pci/controller/dwc/pcie-designware-host.c >>> @@ -515,9 +515,40 @@ static struct pci_ops dw_pcie_ops = { >>> .write = pci_generic_config_write, >>> }; >>> +static void dw_pcie_setup_mem_atu(struct pcie_port *pp, >>> + struct resource_entry *win) >>> +{ >>> + struct dw_pcie *pci = to_dw_pcie_from_pp(pp); >>> + >>> + if (win->res->flags & IORESOURCE_PREFETCH && pci->num_viewport >= 4 && >>> + win->offset) { >>> + dw_pcie_prog_outbound_atu(pci, >>> + PCIE_ATU_REGION_INDEX3, >>> + PCIE_ATU_TYPE_MEM, >>> + win->res->start, >>> + win->res->start - win->offset, >>> + resource_size(win->res)); >>> + } else if (win->res->flags & IORESOURCE_PREFETCH && >>> + pci->num_viewport < 4) { >>> + dev_warn(pci->dev, >>> + "Insufficient ATU regions to map Prefetchable memory\n"); >>> + } else if (win->offset) { >>> + if (upper_32_bits(resource_size(win->res))) >>> + dev_warn(pci->dev, >>> + "Memory resource size exceeds max for 32 bits\n"); >>> + dw_pcie_prog_outbound_atu(pci, >>> + PCIE_ATU_REGION_INDEX0, >>> + PCIE_ATU_TYPE_MEM, >>> + win->res->start, >>> + win->res->start - win->offset, >>> + resource_size(win->res)); >>> + } >>> +} >>> + >>> void dw_pcie_setup_rc(struct pcie_port *pp) >>> { >>> u32 val, ctrl, num_ctrls; >>> + struct resource_entry *win; >>> struct dw_pcie *pci = to_dw_pcie_from_pp(pp); >>> /* >>> @@ -572,13 +603,14 @@ void dw_pcie_setup_rc(struct pcie_port *pp) >>> * ATU, so we should not program the ATU here. >>> */ >>> if (pp->bridge->child_ops == &dw_child_pcie_ops) { >>> - struct resource_entry *entry = >>> - resource_list_first_type(&pp->bridge->windows, IORESOURCE_MEM); >>> + resource_list_for_each_entry(win, &pp->bridge->windows) { >>> + switch (resource_type(win->res)) { >>> + case IORESOURCE_MEM: >>> + dw_pcie_setup_mem_atu(pp, win); >>> + break; >>> + } >>> + } >>> - dw_pcie_prog_outbound_atu(pci, PCIE_ATU_REGION_INDEX0, >>> - PCIE_ATU_TYPE_MEM, entry->res->start, >>> - entry->res->start - entry->offset, >>> - resource_size(entry->res)); >>> if (pci->num_viewport > 2) >>> dw_pcie_prog_outbound_atu(pci, PCIE_ATU_REGION_INDEX2, >>> PCIE_ATU_TYPE_IO, pp->io_base, >>> diff --git a/drivers/pci/controller/dwc/pcie-designware.c b/drivers/pci/controller/dwc/pcie-designware.c >>> index 3c1f17c78241..6033689abb15 100644 >>> --- a/drivers/pci/controller/dwc/pcie-designware.c >>> +++ b/drivers/pci/controller/dwc/pcie-designware.c >>> @@ -227,7 +227,7 @@ static void dw_pcie_writel_ob_unroll(struct dw_pcie *pci, u32 index, u32 reg, >>> static void dw_pcie_prog_outbound_atu_unroll(struct dw_pcie *pci, u8 func_no, >>> int index, int type, >>> u64 cpu_addr, u64 pci_addr, >>> - u32 size) >>> + u64 size) >>> { >>> u32 retries, val; >>> u64 limit_addr = cpu_addr + size - 1; >>> @@ -244,8 +244,10 @@ static void dw_pcie_prog_outbound_atu_unroll(struct dw_pcie *pci, u8 func_no, >>> lower_32_bits(pci_addr)); >>> dw_pcie_writel_ob_unroll(pci, index, PCIE_ATU_UNR_UPPER_TARGET, >>> upper_32_bits(pci_addr)); >>> - dw_pcie_writel_ob_unroll(pci, index, PCIE_ATU_UNR_REGION_CTRL1, >>> - type | PCIE_ATU_FUNC_NUM(func_no)); >>> + val = type | PCIE_ATU_FUNC_NUM(func_no); >>> + val = upper_32_bits(size - 1) ? >>> + val | PCIE_ATU_INCREASE_REGION_SIZE : val; >>> + dw_pcie_writel_ob_unroll(pci, index, PCIE_ATU_UNR_REGION_CTRL1, val); >>> dw_pcie_writel_ob_unroll(pci, index, PCIE_ATU_UNR_REGION_CTRL2, >>> PCIE_ATU_ENABLE); >>> @@ -266,7 +268,7 @@ static void dw_pcie_prog_outbound_atu_unroll(struct dw_pcie *pci, u8 func_no, >>> static void __dw_pcie_prog_outbound_atu(struct dw_pcie *pci, u8 func_no, >>> int index, int type, u64 cpu_addr, >>> - u64 pci_addr, u32 size) >>> + u64 pci_addr, u64 size) >>> { >>> u32 retries, val; >>> @@ -310,7 +312,7 @@ static void __dw_pcie_prog_outbound_atu(struct dw_pcie *pci, u8 func_no, >>> } >>> void dw_pcie_prog_outbound_atu(struct dw_pcie *pci, int index, int type, >>> - u64 cpu_addr, u64 pci_addr, u32 size) >>> + u64 cpu_addr, u64 pci_addr, u64 size) >>> { >>> __dw_pcie_prog_outbound_atu(pci, 0, index, type, >>> cpu_addr, pci_addr, size); >>> diff --git a/drivers/pci/controller/dwc/pcie-designware.h b/drivers/pci/controller/dwc/pcie-designware.h >>> index 97c7063b9e89..b81a1813cf9e 100644 >>> --- a/drivers/pci/controller/dwc/pcie-designware.h >>> +++ b/drivers/pci/controller/dwc/pcie-designware.h >>> @@ -80,10 +80,12 @@ >>> #define PCIE_ATU_VIEWPORT 0x900 >>> #define PCIE_ATU_REGION_INBOUND BIT(31) >>> #define PCIE_ATU_REGION_OUTBOUND 0 >>> +#define PCIE_ATU_REGION_INDEX3 0x3 >>> #define PCIE_ATU_REGION_INDEX2 0x2 >>> #define PCIE_ATU_REGION_INDEX1 0x1 >>> #define PCIE_ATU_REGION_INDEX0 0x0 >>> #define PCIE_ATU_CR1 0x904 >>> +#define PCIE_ATU_INCREASE_REGION_SIZE BIT(13) >>> #define PCIE_ATU_TYPE_MEM 0x0 >>> #define PCIE_ATU_TYPE_IO 0x2 >>> #define PCIE_ATU_TYPE_CFG0 0x4 >>> @@ -295,7 +297,7 @@ void dw_pcie_upconfig_setup(struct dw_pcie *pci); >>> int dw_pcie_wait_for_link(struct dw_pcie *pci); >>> void dw_pcie_prog_outbound_atu(struct dw_pcie *pci, int index, >>> int type, u64 cpu_addr, u64 pci_addr, >>> - u32 size); >>> + u64 size); >>> void dw_pcie_prog_ep_outbound_atu(struct dw_pcie *pci, u8 func_no, int index, >>> int type, u64 cpu_addr, u64 pci_addr, >>> u32 size); >>>