linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Kernel crash at boot time or reboot on ARM64/Hikey
@ 2017-03-15 21:40 Daniel Lezcano
       [not found] ` <CA+55aFx=TkLsZKZ5v-EQzhcvd+Cw22ZJsysh4JdnXFw3Bdz9aA@mail.gmail.com>
  0 siblings, 1 reply; 5+ messages in thread
From: Daniel Lezcano @ 2017-03-15 21:40 UTC (permalink / raw)
  To: kirill.shutemov, mhocko, torvalds
  Cc: linux-kernel, amit.kucheria, vincent.guittot, guodong.xu


Hi Kirill,

I'm facing an issue with my ARM64/Hikey board. The kernel crashes after booting
when running services. With the linaro toolchain, the same happens but when I
reboot the system.

It is hard to see a clear error, as randomly the console shows an infinite
dump, a couple of lines saying "Invalid address".

I have been asking Vincent to test if its board was crashing and it appears the
same happens.

After git bisecting, I found this commit is the offending one.

c2febafc67734a62196c1b9dfba926412d4077ba is the first bad commit
commit c2febafc67734a62196c1b9dfba926412d4077ba
Author: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
Date:   Thu Mar 9 17:24:07 2017 +0300

    mm: convert generic code to 5-level paging

    Convert all non-architecture-specific code to 5-level paging.

    It's mostly mechanical adding handling one more page table level in
    places where we deal with pud_t.

    Signed-off-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
    Acked-by: Michal Hocko <mhocko@suse.com>
    Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>

:040000 040000 492678235575b71f88062912e3d2a8f0983a7ab1 1e17bdef1b1ecae4261dd431bc4be67107dfdd5b M      drivers
:040000 040000 13e1ad3cf4523bab0e62efe9afa846c53b2738fc 183b6407eea0f34e5132dc67aaae1d8e3e23e555 M      fs
:040000 040000 1f90d08acc5850318753d5958f615e3b2a484ec7 dcdc7cce85ddbc5b5ef68fcfc77bb2809b4e1115 M      include
:040000 040000 9b3446e04a8d7939962c2df710f6113eea696a50 f6e627b0d378f0a00d2987fdd0c7b215306e6e3c M      lib
:040000 040000 77b15108634671b2286fcefcd3009ea96ba41543 6fc93cb7cb991c2c26b50ce7762ba469e5d4f367 M      mm

Was this issue already spotted ?

Thanks in advance

  -- Daniel

-- 

 <http://www.linaro.org/> Linaro.org │ Open source software for ARM SoCs

Follow Linaro:  <http://www.facebook.com/pages/Linaro> Facebook |
<http://twitter.com/#!/linaroorg> Twitter |
<http://www.linaro.org/linaro-blog/> Blog

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Kernel crash at boot time or reboot on ARM64/Hikey
       [not found] ` <CA+55aFx=TkLsZKZ5v-EQzhcvd+Cw22ZJsysh4JdnXFw3Bdz9aA@mail.gmail.com>
@ 2017-03-15 21:45   ` Daniel Lezcano
  2017-03-16 13:33   ` Daniel Lezcano
  1 sibling, 0 replies; 5+ messages in thread
From: Daniel Lezcano @ 2017-03-15 21:45 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: Vincent Guittot, guodong.xu, mhocko, amit.kucheria, linux-kernel,
	kirill.shutemov

On Wed, Mar 15, 2017 at 02:42:50PM -0700, Linus Torvalds wrote:
> Should be fixed in current Git already..

Ok, thanks.

  -- Daniel

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Kernel crash at boot time or reboot on ARM64/Hikey
       [not found] ` <CA+55aFx=TkLsZKZ5v-EQzhcvd+Cw22ZJsysh4JdnXFw3Bdz9aA@mail.gmail.com>
  2017-03-15 21:45   ` Daniel Lezcano
@ 2017-03-16 13:33   ` Daniel Lezcano
  2017-03-16 14:25     ` Kirill A. Shutemov
       [not found]     ` <CA+55aFxo266BOXVOJu_x9MJgH2HOMOg12-opqibX+0fPz2vCoA@mail.gmail.com>
  1 sibling, 2 replies; 5+ messages in thread
From: Daniel Lezcano @ 2017-03-16 13:33 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: Vincent Guittot, guodong.xu, mhocko, amit.kucheria, linux-kernel,
	kirill.shutemov

On Wed, Mar 15, 2017 at 02:42:50PM -0700, Linus Torvalds wrote:
> Should be fixed in current Git already..
> 

The v4.11-rc2 still has the issue.

At appears at boot time or when rebooting and spits a lot of traces:

[ ... ]

** 2467 printk messages dropped ** [   11.218861] 8540: ffff8000223e8550 ffff0000081a7634 ffff8000223e8560 ffff0000081a7634
** 2465 printk messages dropped ** [   11.229744] b980: ffff8000223fb990 ffff0000081a7634 ffff8000223fb9a0 ffff0000081a7634
** 2469 printk messages dropped ** [   11.240638] ee40: ffff80002240ee50 ffff0000081a7634 ffff80002240ee60 ffff0000081a7634
** 2464 printk messages dropped ** [   11.251519] 2260: ffff800022422270 ffff0000081a7634 ffff800022422280 ffff0000081a7634
** 2471 printk messages dropped ** [   11.262418] 5760: ffff800022435770 ffff0000081a7634 ffff800022435780 ffff0000081a7634
** 2468 printk messages dropped ** [   11.273302] 8c00: ffff800022448c10 ffff0000081a7634 ffff800022448c20 ffff0000081a7634
** 2467 printk messages dropped ** [   11.284187] c080: ffff80002245c090 ffff0000081a7634 ffff80002245c0a0 ffff0000081a7634
** 2466 printk messages dropped ** [   11.295069] f4e0: ffff80002246f4f0 ffff0000081a7634 ffff80002246f500 ffff0000081a7634
** 2468 printk messages dropped ** [   11.305967] 2980: ffff800022482990 ffff0000081a7634 ffff8000224829a0 ffff0000081a7634
** 2465 printk messages dropped ** [   11.316851] 5dc0: ffff800022495dd0 ffff0000081a7634 ffff800022495de0 ffff0000081a7634
** 2469 printk messages dropped ** [   11.327742] 9280: ffff8000224a9290 ffff0000081a7634 ffff8000224a92a0 ffff0000081a7634
** 2469 printk messages dropped ** [   11.338625] c740: ffff8000224bc750 ffff0000081a7634 ffff8000224bc760 ffff0000081a7634
** 2471 printk messages dropped ** [   11.349521] fc40: ffff8000224cfc50 ffff0000081a7634 ffff8000224cfc60 ffff0000081a7634

[ ... ]

>      Linus
> 
> On Mar 15, 2017 14:40, "Daniel Lezcano" <daniel.lezcano@linaro.org> wrote:
> 
> >
> > Hi Kirill,
> >
> > I'm facing an issue with my ARM64/Hikey board. The kernel crashes after
> > booting
> > when running services. With the linaro toolchain, the same happens but
> > when I
> > reboot the system.
> >
> > It is hard to see a clear error, as randomly the console shows an infinite
> > dump, a couple of lines saying "Invalid address".
> >
> > I have been asking Vincent to test if its board was crashing and it
> > appears the
> > same happens.
> >
> > After git bisecting, I found this commit is the offending one.
> >
> > c2febafc67734a62196c1b9dfba926412d4077ba is the first bad commit
> > commit c2febafc67734a62196c1b9dfba926412d4077ba
> > Author: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
> > Date:   Thu Mar 9 17:24:07 2017 +0300
> >
> >     mm: convert generic code to 5-level paging
> >
> >     Convert all non-architecture-specific code to 5-level paging.
> >
> >     It's mostly mechanical adding handling one more page table level in
> >     places where we deal with pud_t.
> >
> >     Signed-off-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
> >     Acked-by: Michal Hocko <mhocko@suse.com>
> >     Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
> >
> > :040000 040000 492678235575b71f88062912e3d2a8f0983a7ab1
> > 1e17bdef1b1ecae4261dd431bc4be67107dfdd5b M      drivers
> > :040000 040000 13e1ad3cf4523bab0e62efe9afa846c53b2738fc
> > 183b6407eea0f34e5132dc67aaae1d8e3e23e555 M      fs
> > :040000 040000 1f90d08acc5850318753d5958f615e3b2a484ec7
> > dcdc7cce85ddbc5b5ef68fcfc77bb2809b4e1115 M      include
> > :040000 040000 9b3446e04a8d7939962c2df710f6113eea696a50
> > f6e627b0d378f0a00d2987fdd0c7b215306e6e3c M      lib
> > :040000 040000 77b15108634671b2286fcefcd3009ea96ba41543
> > 6fc93cb7cb991c2c26b50ce7762ba469e5d4f367 M      mm
> >
> > Was this issue already spotted ?
> >
> > Thanks in advance
> >
> >   -- Daniel
> >
> > --
> >
> >  <http://www.linaro.org/> Linaro.org │ Open source software for ARM SoCs
> >
> > Follow Linaro:  <http://www.facebook.com/pages/Linaro> Facebook |
> > <http://twitter.com/#!/linaroorg> Twitter |
> > <http://www.linaro.org/linaro-blog/> Blog
> >

-- 

 <http://www.linaro.org/> Linaro.org │ Open source software for ARM SoCs

Follow Linaro:  <http://www.facebook.com/pages/Linaro> Facebook |
<http://twitter.com/#!/linaroorg> Twitter |
<http://www.linaro.org/linaro-blog/> Blog

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Kernel crash at boot time or reboot on ARM64/Hikey
  2017-03-16 13:33   ` Daniel Lezcano
@ 2017-03-16 14:25     ` Kirill A. Shutemov
       [not found]     ` <CA+55aFxo266BOXVOJu_x9MJgH2HOMOg12-opqibX+0fPz2vCoA@mail.gmail.com>
  1 sibling, 0 replies; 5+ messages in thread
From: Kirill A. Shutemov @ 2017-03-16 14:25 UTC (permalink / raw)
  To: Daniel Lezcano
  Cc: Linus Torvalds, Vincent Guittot, guodong.xu, mhocko,
	amit.kucheria, linux-kernel, kirill.shutemov

On Thu, Mar 16, 2017 at 02:33:40PM +0100, Daniel Lezcano wrote:
> On Wed, Mar 15, 2017 at 02:42:50PM -0700, Linus Torvalds wrote:
> > Should be fixed in current Git already..
> > 
> 
> The v4.11-rc2 still has the issue.

Have you tried actual master? There's fix after -rc2.

-- 
 Kirill A. Shutemov

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Kernel crash at boot time or reboot on ARM64/Hikey
       [not found]     ` <CA+55aFxo266BOXVOJu_x9MJgH2HOMOg12-opqibX+0fPz2vCoA@mail.gmail.com>
@ 2017-03-16 14:47       ` Daniel Lezcano
  0 siblings, 0 replies; 5+ messages in thread
From: Daniel Lezcano @ 2017-03-16 14:47 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: guodong.xu, Michal Hocko, linux-kernel, amit.kucheria,
	kirill.shutemov, Vincent Guittot

On Thu, Mar 16, 2017 at 07:10:24AM -0700, Linus Torvalds wrote:
> On Mar 16, 2017 6:33 AM, "Daniel Lezcano" <daniel.lezcano@linaro.org> wrote:
> 
> On Wed, Mar 15, 2017 at 02:42:50PM -0700, Linus Torvalds wrote:
> > Should be fixed in current Git already..
> >
> 
> The v4.11-rc2 still has the issue.
> 
> 
> Yes. The fix is in git, after rc2.

Ok, confirmed from my side. The issue is fixed on HEAD.

Thanks!

  -- Daniel

-- 

 <http://www.linaro.org/> Linaro.org │ Open source software for ARM SoCs

Follow Linaro:  <http://www.facebook.com/pages/Linaro> Facebook |
<http://twitter.com/#!/linaroorg> Twitter |
<http://www.linaro.org/linaro-blog/> Blog

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2017-03-18 17:01 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-03-15 21:40 Kernel crash at boot time or reboot on ARM64/Hikey Daniel Lezcano
     [not found] ` <CA+55aFx=TkLsZKZ5v-EQzhcvd+Cw22ZJsysh4JdnXFw3Bdz9aA@mail.gmail.com>
2017-03-15 21:45   ` Daniel Lezcano
2017-03-16 13:33   ` Daniel Lezcano
2017-03-16 14:25     ` Kirill A. Shutemov
     [not found]     ` <CA+55aFxo266BOXVOJu_x9MJgH2HOMOg12-opqibX+0fPz2vCoA@mail.gmail.com>
2017-03-16 14:47       ` Daniel Lezcano

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).