linux-renesas-soc.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Heiner Kallweit <hkallweit1@gmail.com>
To: Geert Uytterhoeven <geert@linux-m68k.org>,
	Florian Fainelli <f.fainelli@gmail.com>
Cc: David Miller <davem@davemloft.net>,
	netdev <netdev@vger.kernel.org>,
	Yoshihiro Shimoda <yoshihiro.shimoda.uh@renesas.com>,
	Andrew Lunn <andrew@lunn.ch>,
	Russell King <linux@armlinux.org.uk>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	Linux-Renesas <linux-renesas-soc@vger.kernel.org>
Subject: Re: [PATCH net] net: phy: Avoid multiple suspends
Date: Wed, 11 Mar 2020 22:22:27 +0100	[thread overview]
Message-ID: <ca2abe1a-a9ed-23c9-ceaa-b0042be49be9@gmail.com> (raw)
In-Reply-To: <CAMuHMdUMM0Q6W7A0mVgSf7XmF8yROZb3uzHPU1ETbMAfvTtfow@mail.gmail.com>

On 11.03.2020 10:17, Geert Uytterhoeven wrote:
> On Tue, Mar 10, 2020 at 5:47 PM Florian Fainelli <f.fainelli@gmail.com> wrote:
>>
>> On 3/10/20 7:16 AM, Geert Uytterhoeven wrote:
>>> Hi Florian, David,
>>>
>>> On Mon, Feb 24, 2020 at 5:59 AM David Miller <davem@davemloft.net> wrote:
>>>> From: Florian Fainelli <f.fainelli@gmail.com>
>>>> Date: Thu, 20 Feb 2020 15:34:53 -0800
>>>>
>>>>> It is currently possible for a PHY device to be suspended as part of a
>>>>> network device driver's suspend call while it is still being attached to
>>>>> that net_device, either via phy_suspend() or implicitly via phy_stop().
>>>>>
>>>>> Later on, when the MDIO bus controller get suspended, we would attempt
>>>>> to suspend again the PHY because it is still attached to a network
>>>>> device.
>>>>>
>>>>> This is both a waste of time and creates an opportunity for improper
>>>>> clock/power management bugs to creep in.
>>>>>
>>>>> Fixes: 803dd9c77ac3 ("net: phy: avoid suspending twice a PHY")
>>>>> Signed-off-by: Florian Fainelli <f.fainelli@gmail.com>
>>>>
>>>> Applied, and queued up for -stable, thanks Florian.
>>>
>>> This patch causes a regression on r8a73a4/ape6evm and sh73a0/kzm9g.
>>> After resume from s2ram, Ethernet no longer works:
>>>
>>>         PM: suspend exit
>>>         nfs: server aaa.bbb.ccc.ddd not responding, still trying
>>>         ...
>>>
>>> Reverting commit 503ba7c6961034ff ("net: phy: Avoid multiple suspends")
>>> fixes the issue.
>>>
>>> On both boards, an SMSC LAN9220 is connected to a power-managed local
>>> bus.
>>>
>>> I added some debug code to check when the clock driving the local bus
>>> is stopped and started, but I see no difference before/after.  Hence I
>>> suspect the Ethernet chip is no longer reinitialized after resume.
>>
>> Can you provide a complete log?
> 
> With some debug info:
> 
>     SDHI0 Vcc: disabling
>     PM: suspend entry (deep)
>     Filesystems sync: 0.002 seconds
>     Freezing user space processes ... (elapsed 0.001 seconds) done.
>     OOM killer disabled.
>     Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
>     PM: ==== a3sp/ee120000.sd: stop
>     PM: ==== a3sp/ee100000.sd: stop
>     smsc911x 8000000.ethernet: smsc911x_suspend:2577
>     smsc911x 8000000.ethernet: smsc911x_suspend:2579 running
>     smsc911x 8000000.ethernet: smsc911x_suspend:2584
>     PM: ==== a3sp/ee200000.mmc: stop
>     PM: ==== c4/fec10000.bus: stop
>     PM: ==== a3sp/e6c40000.serial: stop
>     PM: ==== c5/e61f0000.thermal: stop
>     PM: ==== c4/e61c0200.interrupt-controller: stop
>     PM: == a3sp: power off
>     rmobile_pd_power_down: a3sp
>     Disabling non-boot CPUs ...
>     PM: ==== c4/e61c0200.interrupt-controller: start
>     PM: ==== c5/e61f0000.thermal: start
>     PM: ==== a3sp/e6c40000.serial: start
>     PM: ==== c4/fec10000.bus: start
>     PM: ==== a3sp/ee200000.mmc: start
>     smsc911x 8000000.ethernet: smsc911x_resume:2606
>     smsc911x 8000000.ethernet: smsc911x_resume:2625 running
>     PM: ==== a3sp/ee100000.sd: start
>     OOM killer enabled.
>     Restarting tasks ... done.
>     PM: ==== a3sp/ee120000.sd: start
>     PM: suspend exit
>     nfs: server aaa.bbb.ccc.ddd not responding, still trying
>     ...
> 
> But no difference between the good and the bad case, except for the nfs
> failures.
> 
>> Do you use the Generic PHY driver or a
>> specialized one?
> 
> CONFIG_FIXED_PHY=y
> CONFIG_SMSC_PHY=y
> 
> Just the smsc,lan9115 node, cfr. arch/arm/boot/dts/r8a73a4-ape6evm.dts
> 
>> Do you have a way to dump the registers at the time of
>> failure and see if BMCR.PDOWN is still set somehow?
> 
> Added a hook into "nfs: server not responding", which prints:
> 
>     MII_BMCR = 0x1900
> 
> i.e. BMCR_PDOWN = 0x0800 is still set.
> 
>> Does the following help:
>>
>> diff --git a/drivers/net/ethernet/smsc/smsc911x.c
>> b/drivers/net/ethernet/smsc/smsc911x.c
>> index 49a6a9167af4..df17190c76c0 100644
>> --- a/drivers/net/ethernet/smsc/smsc911x.c
>> +++ b/drivers/net/ethernet/smsc/smsc911x.c
>> @@ -2618,6 +2618,7 @@ static int smsc911x_resume(struct device *dev)
>>         if (netif_running(ndev)) {
>>                 netif_device_attach(ndev);
>>                 netif_start_queue(ndev);
>> +               phy_resume(dev->phydev);
>>         }
>>
> 
> Yes i does, after s/dev->/ndev->/.
> Thanks!
> 

This seems to be a workaround. And the same issue we may have with
other drivers too. Could you please alternatively test the following?
It tackles the issue that mdio_bus_phy_may_suspend() is used in
suspend AND resume, and both calls may return different values.

With this patch we call mdio_bus_phy_may_suspend() only when
suspending, and let the phy_device store whether it was suspended
by MDIO bus PM.

diff --git a/drivers/net/phy/phy_device.c b/drivers/net/phy/phy_device.c
index 32a5ceddc..6d6c6a178 100644
--- a/drivers/net/phy/phy_device.c
+++ b/drivers/net/phy/phy_device.c
@@ -286,6 +286,8 @@ static int mdio_bus_phy_suspend(struct device *dev)
 	if (!mdio_bus_phy_may_suspend(phydev))
 		return 0;
 
+	phydev->suspended_by_mdio_bus = 1;
+
 	return phy_suspend(phydev);
 }
 
@@ -294,9 +296,11 @@ static int mdio_bus_phy_resume(struct device *dev)
 	struct phy_device *phydev = to_phy_device(dev);
 	int ret;
 
-	if (!mdio_bus_phy_may_suspend(phydev))
+	if (!phydev->suspended_by_mdio_bus)
 		goto no_resume;
 
+	phydev->suspended_by_mdio_bus = 0;
+
 	ret = phy_resume(phydev);
 	if (ret < 0)
 		return ret;
diff --git a/include/linux/phy.h b/include/linux/phy.h
index 8b299476b..118de9f5b 100644
--- a/include/linux/phy.h
+++ b/include/linux/phy.h
@@ -357,6 +357,7 @@ struct macsec_ops;
  * is_gigabit_capable: Set to true if PHY supports 1000Mbps
  * has_fixups: Set to true if this phy has fixups/quirks.
  * suspended: Set to true if this phy has been suspended successfully.
+ * suspended_by_mdio_bus: Set to true if this phy was suspended by MDIO bus.
  * sysfs_links: Internal boolean tracking sysfs symbolic links setup/removal.
  * loopback_enabled: Set true if this phy has been loopbacked successfully.
  * state: state of the PHY for management purposes
@@ -396,6 +397,7 @@ struct phy_device {
 	unsigned is_gigabit_capable:1;
 	unsigned has_fixups:1;
 	unsigned suspended:1;
+	unsigned suspended_by_mdio_bus:1;
 	unsigned sysfs_links:1;
 	unsigned loopback_enabled:1;
 
-- 
2.25.1







  reply	other threads:[~2020-03-11 21:22 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20200220233454.31514-1-f.fainelli@gmail.com>
     [not found] ` <20200223.205911.1667092059432885700.davem@davemloft.net>
2020-03-10 14:16   ` [PATCH net] net: phy: Avoid multiple suspends Geert Uytterhoeven
2020-03-10 16:46     ` Florian Fainelli
2020-03-10 17:34       ` Heiner Kallweit
2020-03-11  9:17       ` Geert Uytterhoeven
2020-03-11 21:22         ` Heiner Kallweit [this message]
2020-03-12  8:26           ` Geert Uytterhoeven

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ca2abe1a-a9ed-23c9-ceaa-b0042be49be9@gmail.com \
    --to=hkallweit1@gmail.com \
    --cc=andrew@lunn.ch \
    --cc=davem@davemloft.net \
    --cc=f.fainelli@gmail.com \
    --cc=geert@linux-m68k.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-renesas-soc@vger.kernel.org \
    --cc=linux@armlinux.org.uk \
    --cc=netdev@vger.kernel.org \
    --cc=yoshihiro.shimoda.uh@renesas.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).