From: Geert Uytterhoeven <geert@linux-m68k.org>
To: Bart Van Assche <bvanassche@acm.org>
Cc: Hans de Goede <hdegoede@redhat.com>,
"Martin K . Petersen" <martin.petersen@oracle.com>,
scsi <linux-scsi@vger.kernel.org>,
Damien Le Moal <damien.lemoal@opensource.wdc.com>,
Hannes Reinecke <hare@suse.de>,
gzhqyz@gmail.com, "James E.J. Bottomley" <jejb@linux.ibm.com>,
Linux-Renesas <linux-renesas-soc@vger.kernel.org>
Subject: Re: [PATCH] scsi: sd: Revert "Rework asynchronous resume support"
Date: Fri, 26 Aug 2022 09:54:40 +0200 [thread overview]
Message-ID: <CAMuHMdVYqceaamrYXK9bFW1mQ+m7HMUA-jhW_+co3b-GB6fAeA@mail.gmail.com> (raw)
In-Reply-To: <026ad7cc-5be9-e90b-8c95-0649caf68779@acm.org>
Hi Bart,
On Tue, Aug 23, 2022 at 8:10 PM Bart Van Assche <bvanassche@acm.org> wrote:
> On 8/22/22 23:41, Geert Uytterhoeven wrote:
> > A lock-up (magic sysrq does not work) during s2idle.
> > I tried bisecting it yesterday, but failed.
> > On v6.0-rc1 (and rc2) it happens ca. 25% of the time, but the closer
> > I get to v5.19, the less likely it is to happen. Apparently 100
> > successful s2idle cycles was not enough to declare a kernel good...
> >
> > Freezing ...
> > Filesystems sync: 0.001 seconds
> > Freezing user space processes ... (elapsed 0.001 seconds) done.
> > OOM killer disabled.
> > Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
> > sd 0:0:0:0: [sda] Synchronizing SCSI cache
> > sd 0:0:0:0: [sda] Stopping disk
> >
> > ---> hangs here if it happens
> >
> > ravb e6800000.ethernet eth0: Link is Down
> > sd 0:0:0:0: [sda] Starting disk
> > Micrel KSZ9031 Gigabit PHY e6800000.ethernet-ffffffff:00: attached
> > PHY driver (mii_bus:phy_addr=e6800000.ethernet-ffffffff:00, irq=186)
> > ata1: link resume succeeded after 1 retries
> > ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
> > ata1.00: configured for UDMA/133
> > OOM killer enabled.
> > Restarting tasks ... done.
> > random: crng reseeded on system resumption
> > PM: suspend exit
> > ravb e6800000.ethernet eth0: Link is Up - 1Gbps/Full - flow control off
>
> I'm not sure that is enough information to find the root cause. How
Sorry for not making it clear I didn't expect this to be enough
information.
> about enabling the tp_printk boot option and to enable tracing for
> suspend/resume operations, e.g. as follows?
>
> cd /sys/kernel/tracing &&
> echo 256 > /sys/kernel/tracing/buffer_size_kb &&
> echo nop > current_tracer &&
> echo > trace &&
> echo 1 > events/power/device_pm_callback_start/enable &&
> echo 1 > events/power/device_pm_callback_end/enable &&
> echo 1 > events/power/suspend_resume/enable &&
> echo 1 > tracing_on
Thanks, that generates lots of output (362 KiB/cycle)!
Unfortunately it also has an impact on the probability of lock-ups.
Combined with 'scsi: sd: Revert "Rework asynchronous resume support"',
s2idle now works almost always.
I did manage to trigger the lock-up once with tracing enabled:
device_pm_callback_end: gpio_rcar e6055400.gpio, err=0
device_pm_callback_start: gpio_rcar e6055800.gpio, parent: soc,
noirq power domain [suspend]
device_pm_callback_end: gpio_rcar e6055800.gpio, err=0
device_pm_callback_start: renesas-cpg-mssr
e6150000.clock-controller, parent: soc, noirq driver [suspend]
device_pm_callback_end: renesas-cpg-mssr e6150000.clock-controller, err=0
device_pm_callback_start: sh-pfc e6060000.pinctrl, parent: soc,
noirq driver [suspend]
device_pm_callback_end: sh-pfc e6060000.pinctrl, err=0
suspend_resume: dpm_suspend_noirq[2] end
suspend_resume: machine_suspend[1] begin
suspend_resume: timekeeping_freeze[5] begin
---> hang
suspend_resume: timekeeping_freeze[0] end
suspend_resume: machine_suspend[1] end
suspend_resume: dpm_resume_noirq[16] begin
device_pm_callback_start: sh-pfc e6060000.pinctrl, parent: soc,
noirq driver [resume]
device_pm_callback_end: sh-pfc e6060000.pinctrl, err=0
device_pm_callback_start: renesas-cpg-mssr
e6150000.clock-controller, parent: soc, noirq driver [resume]
device_pm_callback_end: renesas-cpg-mssr e6150000.clock-controller, err=0
device_pm_callback_start: gpio_rcar e6055800.gpio, parent: soc,
noirq power domain [resume]
Oops, timers...
At least it's not related to SCSI ;-)
Gr{oetje,eeting}s,
Geert
--
Geert Uytterhoeven -- There's lots of Linux beyond ia32 -- geert@linux-m68k.org
In personal conversations with technical people, I call myself a hacker. But
when I'm talking to journalists I just say "programmer" or something like that.
-- Linus Torvalds
prev parent reply other threads:[~2022-08-26 7:54 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-08-16 17:26 [PATCH] scsi: sd: Revert "Rework asynchronous resume support" Bart Van Assche
2022-08-16 18:00 ` John Garry
2022-08-16 18:06 ` Bart Van Assche
2022-08-17 8:20 ` John Garry
2022-08-17 20:06 ` Vlastimil Babka
2022-08-22 20:53 ` Vlastimil Babka
2022-08-20 15:37 ` Hans de Goede
2022-08-21 9:16 ` Geert Uytterhoeven
2022-08-22 2:52 ` Bart Van Assche
2022-08-23 6:41 ` Geert Uytterhoeven
2022-08-23 18:10 ` Bart Van Assche
2022-08-26 7:54 ` Geert Uytterhoeven [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAMuHMdVYqceaamrYXK9bFW1mQ+m7HMUA-jhW_+co3b-GB6fAeA@mail.gmail.com \
--to=geert@linux-m68k.org \
--cc=bvanassche@acm.org \
--cc=damien.lemoal@opensource.wdc.com \
--cc=gzhqyz@gmail.com \
--cc=hare@suse.de \
--cc=hdegoede@redhat.com \
--cc=jejb@linux.ibm.com \
--cc=linux-renesas-soc@vger.kernel.org \
--cc=linux-scsi@vger.kernel.org \
--cc=martin.petersen@oracle.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.