From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-7.1 required=3.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_PATCH, MAILING_LIST_MULTI,SIGNED_OFF_BY,SPF_PASS autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 0612EC10F0E for ; Mon, 15 Apr 2019 18:15:01 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id C25BC2073F for ; Mon, 15 Apr 2019 18:15:00 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (2048-bit key) header.d=nvidia.com header.i=@nvidia.com header.b="nLCj02Nj" Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727691AbfDOSPA (ORCPT ); Mon, 15 Apr 2019 14:15:00 -0400 Received: from hqemgate14.nvidia.com ([216.228.121.143]:5464 "EHLO hqemgate14.nvidia.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726182AbfDOSPA (ORCPT ); Mon, 15 Apr 2019 14:15:00 -0400 Received: from hqpgpgate101.nvidia.com (Not Verified[216.228.121.13]) by hqemgate14.nvidia.com (using TLS: TLSv1.2, DES-CBC3-SHA) id ; Mon, 15 Apr 2019 11:15:03 -0700 Received: from hqmail.nvidia.com ([172.20.161.6]) by hqpgpgate101.nvidia.com (PGP Universal service); Mon, 15 Apr 2019 11:14:57 -0700 X-PGP-Universal: processed; by hqpgpgate101.nvidia.com on Mon, 15 Apr 2019 11:14:57 -0700 Received: from [10.24.70.150] (10.124.1.5) by HQMAIL101.nvidia.com (172.20.187.10) with Microsoft SMTP Server (TLS) id 15.0.1473.3; Mon, 15 Apr 2019 18:14:47 +0000 Subject: Re: [PATCH 02/30] PCI: tegra: Fix PCIe host power up sequence To: Thierry Reding CC: , , , , , , , , References: <20190411170355.6882-1-mmaddireddy@nvidia.com> <20190411170355.6882-3-mmaddireddy@nvidia.com> <20190415110136.GC29254@ulmo> <63ed1e7e-6630-bdba-3acd-c4372e83b259@nvidia.com> <20190415143019.GE29254@ulmo> X-Nvconfidentiality: public From: Manikanta Maddireddy Message-ID: <33f945e1-ca97-b8ab-4687-62c5a0e4f45c@nvidia.com> Date: Mon, 15 Apr 2019 23:44:16 +0530 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.6.1 MIME-Version: 1.0 In-Reply-To: <20190415143019.GE29254@ulmo> X-Originating-IP: [10.124.1.5] X-ClientProxiedBy: HQMAIL105.nvidia.com (172.20.187.12) To HQMAIL101.nvidia.com (172.20.187.10) Content-Type: text/plain; charset="windows-1252" Content-Transfer-Encoding: quoted-printable Content-Language: en-US DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nvidia.com; s=n1; t=1555352103; bh=kNfUEbKjmDluCbgFvipIbKiMIhjGf95yidzDEPe4nsY=; h=X-PGP-Universal:Subject:To:CC:References:X-Nvconfidentiality:From: Message-ID:Date:User-Agent:MIME-Version:In-Reply-To: X-Originating-IP:X-ClientProxiedBy:Content-Type: Content-Transfer-Encoding:Content-Language; b=nLCj02Njtu6FFio3BQiGo8mN+K8f7Ka0E3FKViQcMATK76Xp43VGEp2OVkmRQ5fSn LVJSEtKJ9cv9kUZBbryXpCGHgiPwFPuCyqnPOBH2Zz5vSmso4Wk0ZhZSekSc0ReHwZ PkX6+/t6XfSYWBhb46tc4f78KhVLRbqAHNtTrjw5SqcsxWELG5M3OS4lGq/pPM+LKk EbBWAryYDTM5QT7CCdU1/diWoF5owzrUM8PL5CslRyeXt5zYTVcDTKTfiaVc7M5Jj7 phsBVWMy2gEwgnkjghukQOc+vt18LeAE1bL6vJ6qwrLMjMdJ3TNj2MtOCRMEvLw1H5 8wJp2n04T8Nig== Sender: linux-pci-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-pci@vger.kernel.org On 15-Apr-19 8:00 PM, Thierry Reding wrote: > On Mon, Apr 15, 2019 at 07:41:21PM +0530, Manikanta Maddireddy wrote: >> On 15-Apr-19 4:31 PM, Thierry Reding wrote: >>> On Thu, Apr 11, 2019 at 10:33:27PM +0530, Manikanta Maddireddy wrote: >>>> PCIe host power up sequence involves programming AFI(AXI to FPCI bridg= e) >>>> registers first and then PCIe registers. Otherwise AFI register settin= gs >>>> may not latch to PCIe IP. >>>> >>>> PCIe root port starts LTSSM as soon as PCIe xrst is deasserted. >>>> So deassert PCIe xrst after programming PCIe registers. >>>> >>>> Modify PCIe power up sequence as follows, >>>> - Power ungate PCIe partition >>>> - Enable AFI clock >>>> - Deassert AFI reset >>>> - Program AFI registers >>>> - Enable PCIe clock >>>> - Deassert PCIe reset >>>> - Program PCIe registers >>>> - Deassert PCIe xrst to start LTSSM >>>> >>>> Signed-off-by: Manikanta Maddireddy >>>> --- >>>> drivers/pci/controller/pci-tegra.c | 73 ++++++++++++++++++-----------= - >>>> 1 file changed, 43 insertions(+), 30 deletions(-) >>>> >>>> diff --git a/drivers/pci/controller/pci-tegra.c b/drivers/pci/controll= er/pci-tegra.c >>>> index f4f53d092e00..0bf270bcea34 100644 >>>> --- a/drivers/pci/controller/pci-tegra.c >>>> +++ b/drivers/pci/controller/pci-tegra.c >>>> @@ -966,9 +966,6 @@ static int tegra_pcie_enable_controller(struct teg= ra_pcie *pcie) >>>> } >>>> } >>>> =20 >>>> - /* take the PCIe interface module out of reset */ >>>> - reset_control_deassert(pcie->pcie_xrst); >>>> - >>>> /* finally enable PCIe */ >>>> value =3D afi_readl(pcie, AFI_CONFIGURATION); >>>> value |=3D AFI_CONFIGURATION_EN_FPCI; >>>> @@ -997,8 +994,6 @@ static void tegra_pcie_disable_controller(struct t= egra_pcie *pcie) >>>> { >>>> int err; >>>> =20 >>>> - reset_control_assert(pcie->pcie_xrst); >>>> - >>>> if (pcie->soc->program_uphy) { >>>> err =3D tegra_pcie_phy_power_off(pcie); >>>> if (err < 0) >>>> @@ -1014,13 +1009,11 @@ static void tegra_pcie_power_off(struct tegra_= pcie *pcie) >>>> int err; >>>> =20 >>>> reset_control_assert(pcie->afi_rst); >>>> - reset_control_assert(pcie->pex_rst); >>>> =20 >>>> clk_disable_unprepare(pcie->pll_e); >>>> if (soc->has_cml_clk) >>>> clk_disable_unprepare(pcie->cml_clk); >>>> clk_disable_unprepare(pcie->afi_clk); >>>> - clk_disable_unprepare(pcie->pex_clk); >>>> =20 >>>> if (!dev->pm_domain) >>>> tegra_powergate_power_off(TEGRA_POWERGATE_PCIE); >>>> @@ -1036,58 +1029,59 @@ static int tegra_pcie_power_on(struct tegra_pc= ie *pcie) >>>> const struct tegra_pcie_soc *soc =3D pcie->soc; >>>> int err; >>>> =20 >>>> - reset_control_assert(pcie->pcie_xrst); >>>> - reset_control_assert(pcie->afi_rst); >>>> - reset_control_assert(pcie->pex_rst); >>>> - >>>> - if (!dev->pm_domain) >>>> - tegra_powergate_power_off(TEGRA_POWERGATE_PCIE); >>>> - >>> This code was in place to ensure that PCIe was in a known good state >>> before following the power up sequence below. You mentioned elsewhere >>> that there's a regression on Cardhu after applying this series, so >>> perhaps Cardhu relies on the above hunk? >> No, Tegra30 and Tegra20 has legacy PHY which are dependent on PEX clk an= d rst. >> PHY power on is done in tegra_pcie_enable_controller(), but in this patc= h I am >> enabling PEX clk and rst after tegra_pcie_enable_controller(). This caus= ed >> regression on Cardhu. >> I realized that sanity test is failing on Cardhu after publishing this s= eries, I will >> fix the issue in V2. >> I believe you are talking about the bootloader(uboot) which can enable >> PCIe partition and reset. To bring the PCIe into good state then we >> have to take care of clocks as well. AFAIK clock_disable() is not added >> because it maintains the refcount and any mismatch in the count >> will thrown warning. I downstream kernel I see pmc driver itself taking >> care of initial state and there after maintaining the state with refcoun= t. >> Since bootloader may or may not enable PCIe, Can we get the state fixed >> in pmc driver instead of fixing it in PCIe driver? > I don't think we can do that. The PMC driver only knows about which > clocks and resets need to be controlled as part of which power partition > if we use power domains. We don't do that on all platforms, so there is > not enough information. > > Even if we had that information, we would still not be able to force the > clock off because of the reference counting. > > Generally, though, the clock don't matter all that much for getting the > hardware into a good state. All we really care about is that it is put > into reset so that when we take it out again we start from scratch. > > If we can verify that we don't need this anymore, I'm fine with taking > it out, though. Perhaps do it in a separate patch to make it easier to > revert if it turns out to be necessary on some platform after all. > > Thierry In one of the Nvidia internal bugs, HW team confirmed that clock matter bec= ause PCIe IP has non re-settable flops. They asked me to use following sequence, Power up case: =A0- Ungate PCIe partition =A0- Enable clock =A0- Deassert reset Power down case: =A0- Assert reset =A0- Disable clock =A0- Power gate PCIe partition. I agree that this clean up should be done in new patch. So when I make new = patch set for it, I will make sure that above sequence is followed to ensure that PCIe IP is = in good state. > >>>> /* enable regulators */ >>>> err =3D regulator_bulk_enable(pcie->num_supplies, pcie->supplies); >>>> if (err < 0) >>>> dev_err(dev, "failed to enable regulators: %d\n", err); >>>> =20 >>>> - if (dev->pm_domain) { >>>> - err =3D clk_prepare_enable(pcie->pex_clk); >>>> + if (!dev->pm_domain) { >>>> + err =3D tegra_powergate_power_on(TEGRA_POWERGATE_PCIE); >>>> if (err) { >>>> - dev_err(dev, "failed to enable PEX clock: %d\n", err); >>>> - return err; >>>> + dev_err(dev, "power ungate failed: %d\n", err); >>>> + goto regulator_disable; >>>> } >>>> - reset_control_deassert(pcie->pex_rst); >>>> - } else { >>>> - err =3D tegra_powergate_sequence_power_up(TEGRA_POWERGATE_PCIE, >>>> - pcie->pex_clk, >>>> - pcie->pex_rst); >>>> + err =3D tegra_powergate_remove_clamping(TEGRA_POWERGATE_PCIE); >>>> if (err) { >>>> - dev_err(dev, "powerup sequence failed: %d\n", err); >>>> - return err; >>>> + dev_err(dev, "remove clamp failed: %d\n", err); >>>> + goto powergate; >>>> } >>>> } >>>> =20 >>>> - reset_control_deassert(pcie->afi_rst); >>>> - >>>> err =3D clk_prepare_enable(pcie->afi_clk); >>>> if (err < 0) { >>>> dev_err(dev, "failed to enable AFI clock: %d\n", err); >>>> - return err; >>>> + goto powergate; >>>> } >>>> =20 >>>> if (soc->has_cml_clk) { >>>> err =3D clk_prepare_enable(pcie->cml_clk); >>>> if (err < 0) { >>>> dev_err(dev, "failed to enable CML clock: %d\n", err); >>>> - return err; >>>> + goto afi_clk_disable; >>>> } >>>> } >>>> =20 >>>> err =3D clk_prepare_enable(pcie->pll_e); >>>> if (err < 0) { >>>> dev_err(dev, "failed to enable PLLE clock: %d\n", err); >>>> - return err; >>>> + goto cml_clk_disable; >>>> } >>>> =20 >>>> + reset_control_deassert(pcie->afi_rst); >>>> + >>>> return 0; >>>> + >>>> +cml_clk_disable: >>>> + if (soc->has_cml_clk) >>>> + clk_disable_unprepare(pcie->cml_clk); >>>> +afi_clk_disable: >>>> + clk_disable_unprepare(pcie->afi_clk); >>>> +powergate: >>>> + if (!dev->pm_domain) >>>> + tegra_powergate_power_off(TEGRA_POWERGATE_PCIE); >>>> +regulator_disable: >>>> + regulator_bulk_disable(pcie->num_supplies, pcie->supplies); >>>> + return err; >>>> } >>>> =20 >>>> static int tegra_pcie_clocks_get(struct tegra_pcie *pcie) >>>> @@ -2108,7 +2102,12 @@ static void tegra_pcie_enable_ports(struct tegr= a_pcie *pcie) >>>> port->index, port->lanes); >>>> =20 >>>> tegra_pcie_port_enable(port); >>>> + } >>>> + >>>> + /* Start LTSSM from Tegra side */ >>>> + reset_control_deassert(pcie->pcie_xrst); >>>> =20 >>>> + list_for_each_entry_safe(port, tmp, &pcie->ports, list) { >>>> if (tegra_pcie_port_check_link(port)) >>>> continue; >>>> =20 >>>> @@ -2123,6 +2122,8 @@ static void tegra_pcie_disable_ports(struct tegr= a_pcie *pcie) >>>> { >>>> struct tegra_pcie_port *port, *tmp; >>>> =20 >>>> + reset_control_assert(pcie->pcie_xrst); >>>> + >>>> list_for_each_entry_safe(port, tmp, &pcie->ports, list) >>>> tegra_pcie_port_disable(port); >>>> } >>>> @@ -2472,6 +2473,9 @@ static int __maybe_unused tegra_pcie_pm_suspend(= struct device *dev) >>>> =20 >>>> tegra_pcie_disable_ports(pcie); >>>> =20 >>>> + reset_control_assert(pcie->pex_rst); >>>> + clk_disable_unprepare(pcie->pex_clk); >>>> + >>>> if (IS_ENABLED(CONFIG_PCI_MSI)) >>>> tegra_pcie_disable_msi(pcie); >>>> =20 >>>> @@ -2501,10 +2505,19 @@ static int __maybe_unused tegra_pcie_pm_resume= (struct device *dev) >>>> if (IS_ENABLED(CONFIG_PCI_MSI)) >>>> tegra_pcie_enable_msi(pcie); >>>> =20 >>>> + err =3D clk_prepare_enable(pcie->pex_clk); >>>> + if (err) { >>>> + dev_err(dev, "failed to enable PEX clock: %d\n", err); >>>> + goto disable_controller; >>>> + } >>>> + reset_control_deassert(pcie->pex_rst); >>>> + >>>> tegra_pcie_enable_ports(pcie); >>>> =20 >>>> return 0; >>>> =20 >>>> +disable_controller: >>>> + tegra_pcie_disable_controller(pcie); >>>> poweroff: >>>> tegra_pcie_power_off(pcie); >>>> =20 >>> There's quite a bit going on in this patch in general and I find it har= d >>> to review because not all the changes seem related to what you describe= d >>> in the commit message. >>> >>> Can you perhaps try to split out the error cleanup changes into a >>> separate patch where it makes sense? It seems to me like at least for >>> tegra_pcie_power_on() we're currently missing all of the cleanup code. >>> You could make that a preparatory patch that goes before this particula= r >>> patch, which will hopefully make this patch easier to review. >>> >>> Thierry >> Okay, I will prepare new patch for error handling and restrict this patc= h only for >> sequence correction