From: Tony Lindgren <tony@atomide.com> To: Kevin Hilman <khilman@linaro.org> Cc: Roger Quadros <rogerq@ti.com>, "Menon, Nishanth" <nm@ti.com>, Grygorii Strashko <grygorii.strashko@ti.com>, "Shilimkar, Santosh" <santosh.shilimkar@ti.com>, Paul Walmsley <paul@pwsan.com>, Taras Kondratiuk <taras.kondratiuk@linaro.org>, "linux-omap@vger.kernel.org" <linux-omap@vger.kernel.org>, Linux ARM Kernel Mailing List <linux-arm-kernel@lists.infradead.org>, "Kristo, Tero" <t-kristo@ti.com>, Paul Burton <paul.burton@imgtec.com>, Daniel Lezcano <daniel.lezcano@linaro.org>, "Rafael J. Wysocki" <rafael.j.wysocki@intel.com> Subject: Re: omap4-panda-es boot issues with v3.15-rc4 Date: Thu, 8 May 2014 11:40:56 -0700 [thread overview] Message-ID: <20140508184055.GC2198@atomide.com> (raw) In-Reply-To: <20140508165558.GB2198@atomide.com> Added few cpuidle people to Cc on this regression. * Tony Lindgren <tony@atomide.com> [140508 09:57]: > * Kevin Hilman <khilman@linaro.org> [140508 08:40]: > > On Thu, May 8, 2014 at 8:31 AM, Kevin Hilman <khilman@linaro.org> wrote: > > > Roger Quadros <rogerq@ti.com> writes: > > > > > >> Hi, > > >> > > >> Nishant pointed me to a booting issue with omap4-panda-es on linux-next but I'm observing > > >> similar issues, although less frequent, with v3.15-rc4 as well. > > >> > > >> Configuration: > > >> > > >> - kernel v3.15-rc4 or linux-next (20140507) > > >> - multi_v7_defconfig with LEDS_TRIGGER_HEARTBEAT and LEDS_GPIO enabled > > >> - u-boot/master 173d294b94cf > > >> > > >> Observations: > > >> > > >> - Out of 10 boots a few may not succeed and hang midway without any warnings. Heartbeat LED stops. > > >> e.g. http://www.hastebin.com/ebumojegoq.vhdl > > >> > > >> - Hang more noticeable on linux-next (20140507) than on v3.15-rc4 > > > > > > I've beeen noticing the same thing for awhile with my boot tests. For > > > me, next-20140508 is failing most of the time now. > > > > > >> - Hang more noticeable with USB_EHCI_HCD enabled but hang observed even without USB_EHCI_HCD. > > >> Maybe related to when high speed interrupts occur in the boot process. > > >> > > >> - On successful boots following warning is seen > > >> [ 4.010375] gic_timer_retrigger: lost localtimer interrupt > > >> > > >> - On successful boots heartbeat LED stops blinking after boot process and left idle. LED can remain stuck in > > >> ON state as well. It does blink again when doing activity on console. > > >> > > >> Workaround: > > >> > > >> - Disabling CPU_IDLE or even just disabling C3 (MPU OSWR) seems to fix all the above issues. > > >> > > >> I don't really know what exactly is the issue but it seems to be specific to OMAP4, GIC, MPU OSWR. > > > > > > I can confirm that disabling CONFIG_CPU_IDLE seems to make the problem > > > go away. Hmm.... > > > > Another finger pointing in the same direction: omap2plus_defconfig + > > CONFIG_CPU_IDLE=y also fails to boot rather consistently in today's > > -next. > > Booting today's next with multi_v7_defconfig (so cpuidle enabled) on > omap4 sdp seems to boot reliably. And it's not producing these: > > gic_timer_retrigger: lost localtimer interrupt Still seeing the above, looks like the lost localtimer interrupt above is a separate issue.. > while panda is producing those errors like Roger mentioned. > > It seems that the USB networking is the main difference between > omap4 sdp and panda? ..but I think I found the cause for recent hangs on panda, just a wild guess based on looking at the recent cpuidle patches after v3.14. Looks like reverting 0b89e9aa2856 (cpuidle: delay enabling interrupts until all coupled CPUs leave idle) makes booting work reliably again on panda. Can you guys confirm, so far no issues here after few boot tests, but it might be too early to tell. Regards, Tony
WARNING: multiple messages have this Message-ID (diff)
From: tony@atomide.com (Tony Lindgren) To: linux-arm-kernel@lists.infradead.org Subject: omap4-panda-es boot issues with v3.15-rc4 Date: Thu, 8 May 2014 11:40:56 -0700 [thread overview] Message-ID: <20140508184055.GC2198@atomide.com> (raw) In-Reply-To: <20140508165558.GB2198@atomide.com> Added few cpuidle people to Cc on this regression. * Tony Lindgren <tony@atomide.com> [140508 09:57]: > * Kevin Hilman <khilman@linaro.org> [140508 08:40]: > > On Thu, May 8, 2014 at 8:31 AM, Kevin Hilman <khilman@linaro.org> wrote: > > > Roger Quadros <rogerq@ti.com> writes: > > > > > >> Hi, > > >> > > >> Nishant pointed me to a booting issue with omap4-panda-es on linux-next but I'm observing > > >> similar issues, although less frequent, with v3.15-rc4 as well. > > >> > > >> Configuration: > > >> > > >> - kernel v3.15-rc4 or linux-next (20140507) > > >> - multi_v7_defconfig with LEDS_TRIGGER_HEARTBEAT and LEDS_GPIO enabled > > >> - u-boot/master 173d294b94cf > > >> > > >> Observations: > > >> > > >> - Out of 10 boots a few may not succeed and hang midway without any warnings. Heartbeat LED stops. > > >> e.g. http://www.hastebin.com/ebumojegoq.vhdl > > >> > > >> - Hang more noticeable on linux-next (20140507) than on v3.15-rc4 > > > > > > I've beeen noticing the same thing for awhile with my boot tests. For > > > me, next-20140508 is failing most of the time now. > > > > > >> - Hang more noticeable with USB_EHCI_HCD enabled but hang observed even without USB_EHCI_HCD. > > >> Maybe related to when high speed interrupts occur in the boot process. > > >> > > >> - On successful boots following warning is seen > > >> [ 4.010375] gic_timer_retrigger: lost localtimer interrupt > > >> > > >> - On successful boots heartbeat LED stops blinking after boot process and left idle. LED can remain stuck in > > >> ON state as well. It does blink again when doing activity on console. > > >> > > >> Workaround: > > >> > > >> - Disabling CPU_IDLE or even just disabling C3 (MPU OSWR) seems to fix all the above issues. > > >> > > >> I don't really know what exactly is the issue but it seems to be specific to OMAP4, GIC, MPU OSWR. > > > > > > I can confirm that disabling CONFIG_CPU_IDLE seems to make the problem > > > go away. Hmm.... > > > > Another finger pointing in the same direction: omap2plus_defconfig + > > CONFIG_CPU_IDLE=y also fails to boot rather consistently in today's > > -next. > > Booting today's next with multi_v7_defconfig (so cpuidle enabled) on > omap4 sdp seems to boot reliably. And it's not producing these: > > gic_timer_retrigger: lost localtimer interrupt Still seeing the above, looks like the lost localtimer interrupt above is a separate issue.. > while panda is producing those errors like Roger mentioned. > > It seems that the USB networking is the main difference between > omap4 sdp and panda? ..but I think I found the cause for recent hangs on panda, just a wild guess based on looking at the recent cpuidle patches after v3.14. Looks like reverting 0b89e9aa2856 (cpuidle: delay enabling interrupts until all coupled CPUs leave idle) makes booting work reliably again on panda. Can you guys confirm, so far no issues here after few boot tests, but it might be too early to tell. Regards, Tony
next prev parent reply other threads:[~2014-05-08 18:41 UTC|newest] Thread overview: 36+ messages / expand[flat|nested] mbox.gz Atom feed top 2014-05-08 12:53 omap4-panda-es boot issues with v3.15-rc4 Roger Quadros 2014-05-08 12:53 ` Roger Quadros 2014-05-08 15:31 ` Kevin Hilman 2014-05-08 15:31 ` Kevin Hilman 2014-05-08 15:40 ` Kevin Hilman 2014-05-08 15:40 ` Kevin Hilman 2014-05-08 16:55 ` Tony Lindgren 2014-05-08 16:55 ` Tony Lindgren 2014-05-08 18:40 ` Tony Lindgren [this message] 2014-05-08 18:40 ` Tony Lindgren 2014-05-08 22:15 ` Kevin Hilman 2014-05-08 22:15 ` Kevin Hilman 2014-05-09 8:23 ` Roger Quadros 2014-05-09 8:23 ` Roger Quadros 2014-05-09 23:45 ` Kevin Hilman 2014-05-09 23:45 ` Kevin Hilman 2014-05-11 15:55 ` Tony Lindgren 2014-05-11 15:55 ` Tony Lindgren 2014-05-12 21:40 ` Santosh Shilimkar 2014-05-12 21:40 ` Santosh Shilimkar 2014-05-12 22:07 ` Tony Lindgren 2014-05-12 22:07 ` Tony Lindgren 2014-05-13 8:10 ` Roger Quadros 2014-05-13 8:10 ` Roger Quadros 2014-05-13 14:19 ` Santosh Shilimkar 2014-05-13 14:19 ` Santosh Shilimkar 2014-05-12 23:56 ` Kevin Hilman 2014-05-12 23:56 ` Kevin Hilman 2014-05-09 8:20 ` Roger Quadros 2014-05-09 8:20 ` Roger Quadros 2014-05-08 17:12 ` Grygorii Strashko 2014-05-08 17:12 ` Grygorii Strashko 2014-05-09 8:30 ` Roger Quadros 2014-05-09 8:30 ` Roger Quadros 2014-05-09 12:33 ` Nishanth Menon 2014-05-09 12:33 ` Nishanth Menon
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=20140508184055.GC2198@atomide.com \ --to=tony@atomide.com \ --cc=daniel.lezcano@linaro.org \ --cc=grygorii.strashko@ti.com \ --cc=khilman@linaro.org \ --cc=linux-arm-kernel@lists.infradead.org \ --cc=linux-omap@vger.kernel.org \ --cc=nm@ti.com \ --cc=paul.burton@imgtec.com \ --cc=paul@pwsan.com \ --cc=rafael.j.wysocki@intel.com \ --cc=rogerq@ti.com \ --cc=santosh.shilimkar@ti.com \ --cc=t-kristo@ti.com \ --cc=taras.kondratiuk@linaro.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.