All of lore.kernel.org
 help / color / mirror / Atom feed
* exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
@ 2014-09-10 18:20 ` Tyler Baker
  0 siblings, 0 replies; 14+ messages in thread
From: Tyler Baker @ 2014-09-10 18:20 UTC (permalink / raw)
  To: linux-samsung-soc; +Cc: linux-arm-kernel

Exynos5420-based Arndale octa boards have recently started failing boot
tests due to imprecise external aborts.  This only appears to happen
when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
issue seems to be intermittent, so is not reliably reproducable and
difficult to bisect.  Here are a few boot logs from recent
mainline/linux-next kernels that are failing:

mainline/v3.17-rc3-165-g56c2285:
    http://armv7.com/scheduler/job/5525/log_file#L_59_276

mainline/v3.17-rc3-176-g2b12164:
    http://armv7.com/scheduler/job/5595/log_file#L_59_278

mainline/v3.17-rc4-131-gd030671:
    http://armv7.com/scheduler/job/5770/log_file#L_193_277

mainline/v3.17-rc4-146-g619df5d:
    http://armv7.com/scheduler/job/6014/log_file#L_124_0

next/20140908:
    http://armv7.com/scheduler/job/5815/log_file#L_59_276

next/20140910:
    http://armv7.com/scheduler/job/6161/log_file#L_59_278

The ramdisk is used is a very minimal open embedded based filesystem,
which can be found here:

http://images.armcloud.us/lava/common/linaro-image-minimal-initramfs-genericarmv7a.cpio.gz.u-boot

Please let me know if any more information is needed to help
reproduce. If/when a root cause is determined, I am more than willing
to assist in testing any fixes provided.

Thanks,

-- 
Tyler Baker
Tech Lead, LAVA
Linaro.org | Open source software for ARM SoCs
Follow Linaro: http://www.facebook.com/pages/Linaro
http://twitter.com/#!/linaroorg - http://www.linaro.org/linaro-blog

^ permalink raw reply	[flat|nested] 14+ messages in thread

* exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
@ 2014-09-10 18:20 ` Tyler Baker
  0 siblings, 0 replies; 14+ messages in thread
From: Tyler Baker @ 2014-09-10 18:20 UTC (permalink / raw)
  To: linux-arm-kernel

Exynos5420-based Arndale octa boards have recently started failing boot
tests due to imprecise external aborts.  This only appears to happen
when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
issue seems to be intermittent, so is not reliably reproducable and
difficult to bisect.  Here are a few boot logs from recent
mainline/linux-next kernels that are failing:

mainline/v3.17-rc3-165-g56c2285:
    http://armv7.com/scheduler/job/5525/log_file#L_59_276

mainline/v3.17-rc3-176-g2b12164:
    http://armv7.com/scheduler/job/5595/log_file#L_59_278

mainline/v3.17-rc4-131-gd030671:
    http://armv7.com/scheduler/job/5770/log_file#L_193_277

mainline/v3.17-rc4-146-g619df5d:
    http://armv7.com/scheduler/job/6014/log_file#L_124_0

next/20140908:
    http://armv7.com/scheduler/job/5815/log_file#L_59_276

next/20140910:
    http://armv7.com/scheduler/job/6161/log_file#L_59_278

The ramdisk is used is a very minimal open embedded based filesystem,
which can be found here:

http://images.armcloud.us/lava/common/linaro-image-minimal-initramfs-genericarmv7a.cpio.gz.u-boot

Please let me know if any more information is needed to help
reproduce. If/when a root cause is determined, I am more than willing
to assist in testing any fixes provided.

Thanks,

-- 
Tyler Baker
Tech Lead, LAVA
Linaro.org | Open source software for ARM SoCs
Follow Linaro: http://www.facebook.com/pages/Linaro
http://twitter.com/#!/linaroorg - http://www.linaro.org/linaro-blog

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
  2014-09-10 18:20 ` Tyler Baker
@ 2014-09-10 18:46   ` Kevin Hilman
  -1 siblings, 0 replies; 14+ messages in thread
From: Kevin Hilman @ 2014-09-10 18:46 UTC (permalink / raw)
  To: Tyler Baker; +Cc: linux-samsung-soc, linux-arm-kernel

Tyler Baker <tyler.baker@linaro.org> writes:

> Exynos5420-based Arndale octa boards have recently started failing boot
> tests due to imprecise external aborts.  This only appears to happen
> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
> issue seems to be intermittent, so is not reliably reproducable and
> difficult to bisect.  Here are a few boot logs from recent
> mainline/linux-next kernels that are failing:

FYI, I'm seeing the same periodic aborts.  For example, here's my boot
of next-20140910:
http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html

However, my userspace is much simpler and doesn't seem to cause a panic,
so my boot tests report passing. (I should fixup my scripts so these
imprecise aborts are reported as a FAIL.)

I'm glad you pointed out that it happens only with exynos_defconfig and
not multi_v7_defconfig because I noticed that too.  I haven't had the
time to track it any further than that, so maybe the exynos folks can
help track it down from here.

Thanks for reporting this,

Kevin

^ permalink raw reply	[flat|nested] 14+ messages in thread

* exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
@ 2014-09-10 18:46   ` Kevin Hilman
  0 siblings, 0 replies; 14+ messages in thread
From: Kevin Hilman @ 2014-09-10 18:46 UTC (permalink / raw)
  To: linux-arm-kernel

Tyler Baker <tyler.baker@linaro.org> writes:

> Exynos5420-based Arndale octa boards have recently started failing boot
> tests due to imprecise external aborts.  This only appears to happen
> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
> issue seems to be intermittent, so is not reliably reproducable and
> difficult to bisect.  Here are a few boot logs from recent
> mainline/linux-next kernels that are failing:

FYI, I'm seeing the same periodic aborts.  For example, here's my boot
of next-20140910:
http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html

However, my userspace is much simpler and doesn't seem to cause a panic,
so my boot tests report passing. (I should fixup my scripts so these
imprecise aborts are reported as a FAIL.)

I'm glad you pointed out that it happens only with exynos_defconfig and
not multi_v7_defconfig because I noticed that too.  I haven't had the
time to track it any further than that, so maybe the exynos folks can
help track it down from here.

Thanks for reporting this,

Kevin

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
  2014-09-10 18:46   ` Kevin Hilman
@ 2014-09-13 12:59     ` Thomas Abraham
  -1 siblings, 0 replies; 14+ messages in thread
From: Thomas Abraham @ 2014-09-13 12:59 UTC (permalink / raw)
  To: Kevin Hilman; +Cc: Tyler Baker, linux-samsung-soc, linux-arm-kernel

On Thu, Sep 11, 2014 at 12:16 AM, Kevin Hilman <khilman@kernel.org> wrote:
> Tyler Baker <tyler.baker@linaro.org> writes:
>
>> Exynos5420-based Arndale octa boards have recently started failing boot
>> tests due to imprecise external aborts.  This only appears to happen
>> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
>> issue seems to be intermittent, so is not reliably reproducable and
>> difficult to bisect.  Here are a few boot logs from recent
>> mainline/linux-next kernels that are failing:
>
> FYI, I'm seeing the same periodic aborts.  For example, here's my boot
> of next-20140910:
> http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>
> However, my userspace is much simpler and doesn't seem to cause a panic,
> so my boot tests report passing. (I should fixup my scripts so these
> imprecise aborts are reported as a FAIL.)
>
> I'm glad you pointed out that it happens only with exynos_defconfig and
> not multi_v7_defconfig because I noticed that too.  I haven't had the
> time to track it any further than that, so maybe the exynos folks can
> help track it down from here.
>
> Thanks for reporting this,
>
> Kevin

Hi Tyler, Kevin,

>From the bootlog you have shared,

[    1.060016] CPU4: failed to come online
[    2.070031] CPU5: failed to come online
[    3.080049] CPU6: failed to come online
[    4.090066] CPU7: failed to come online
[    4.090099] Brought up 4 CPUs
[    4.090109] SMP: Total of 4 processors activated.
[    4.090119] CPU: WARNING: CPU(s) started in wrong/inconsistent
modes (primary CPU mode 0x13)
[    4.090128] CPU: This may indicate a broken bootloader or firmware.

Would it be possible to set max cpus to 1, disable switcher and try
again. I don't have a arndale octa board but I have tested mainline
kernel with smdk5420 board. It boots all eight CPUs, switcher works
fine and there are no imprecise aborts seen.

Thanks,
Thomas.

> --
> To unsubscribe from this list: send the line "unsubscribe linux-samsung-soc" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 14+ messages in thread

* exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
@ 2014-09-13 12:59     ` Thomas Abraham
  0 siblings, 0 replies; 14+ messages in thread
From: Thomas Abraham @ 2014-09-13 12:59 UTC (permalink / raw)
  To: linux-arm-kernel

On Thu, Sep 11, 2014 at 12:16 AM, Kevin Hilman <khilman@kernel.org> wrote:
> Tyler Baker <tyler.baker@linaro.org> writes:
>
>> Exynos5420-based Arndale octa boards have recently started failing boot
>> tests due to imprecise external aborts.  This only appears to happen
>> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
>> issue seems to be intermittent, so is not reliably reproducable and
>> difficult to bisect.  Here are a few boot logs from recent
>> mainline/linux-next kernels that are failing:
>
> FYI, I'm seeing the same periodic aborts.  For example, here's my boot
> of next-20140910:
> http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>
> However, my userspace is much simpler and doesn't seem to cause a panic,
> so my boot tests report passing. (I should fixup my scripts so these
> imprecise aborts are reported as a FAIL.)
>
> I'm glad you pointed out that it happens only with exynos_defconfig and
> not multi_v7_defconfig because I noticed that too.  I haven't had the
> time to track it any further than that, so maybe the exynos folks can
> help track it down from here.
>
> Thanks for reporting this,
>
> Kevin

Hi Tyler, Kevin,

>From the bootlog you have shared,

[    1.060016] CPU4: failed to come online
[    2.070031] CPU5: failed to come online
[    3.080049] CPU6: failed to come online
[    4.090066] CPU7: failed to come online
[    4.090099] Brought up 4 CPUs
[    4.090109] SMP: Total of 4 processors activated.
[    4.090119] CPU: WARNING: CPU(s) started in wrong/inconsistent
modes (primary CPU mode 0x13)
[    4.090128] CPU: This may indicate a broken bootloader or firmware.

Would it be possible to set max cpus to 1, disable switcher and try
again. I don't have a arndale octa board but I have tested mainline
kernel with smdk5420 board. It boots all eight CPUs, switcher works
fine and there are no imprecise aborts seen.

Thanks,
Thomas.

> --
> To unsubscribe from this list: send the line "unsubscribe linux-samsung-soc" in
> the body of a message to majordomo at vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
  2014-09-13 12:59     ` Thomas Abraham
@ 2014-09-18  0:39       ` Kevin Hilman
  -1 siblings, 0 replies; 14+ messages in thread
From: Kevin Hilman @ 2014-09-18  0:39 UTC (permalink / raw)
  To: Thomas Abraham; +Cc: Tyler Baker, linux-samsung-soc, linux-arm-kernel

Thomas Abraham <ta.omasab@gmail.com> writes:

> On Thu, Sep 11, 2014 at 12:16 AM, Kevin Hilman <khilman@kernel.org> wrote:
>> Tyler Baker <tyler.baker@linaro.org> writes:
>>
>>> Exynos5420-based Arndale octa boards have recently started failing boot
>>> tests due to imprecise external aborts.  This only appears to happen
>>> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
>>> issue seems to be intermittent, so is not reliably reproducable and
>>> difficult to bisect.  Here are a few boot logs from recent
>>> mainline/linux-next kernels that are failing:
>>
>> FYI, I'm seeing the same periodic aborts.  For example, here's my boot
>> of next-20140910:
>> http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>>
>> However, my userspace is much simpler and doesn't seem to cause a panic,
>> so my boot tests report passing. (I should fixup my scripts so these
>> imprecise aborts are reported as a FAIL.)
>>
>> I'm glad you pointed out that it happens only with exynos_defconfig and
>> not multi_v7_defconfig because I noticed that too.  I haven't had the
>> time to track it any further than that, so maybe the exynos folks can
>> help track it down from here.
>>
>> Thanks for reporting this,
>>
>> Kevin
>
> Hi Tyler, Kevin,
>
> From the bootlog you have shared,
>
> [    1.060016] CPU4: failed to come online
> [    2.070031] CPU5: failed to come online
> [    3.080049] CPU6: failed to come online
> [    4.090066] CPU7: failed to come online
> [    4.090099] Brought up 4 CPUs
> [    4.090109] SMP: Total of 4 processors activated.
> [    4.090119] CPU: WARNING: CPU(s) started in wrong/inconsistent
> modes (primary CPU mode 0x13)
> [    4.090128] CPU: This may indicate a broken bootloader or firmware.
>
> Would it be possible to set max cpus to 1, disable switcher and try
> again. I don't have a arndale octa board but I have tested mainline
> kernel with smdk5420 board. It boots all eight CPUs, switcher works
> fine and there are no imprecise aborts seen.

Sorry for the delay, I'm travelling this week.

FWIW, the same CPU boot failures you hilight above are happening on
multi_v7_defconfig[1] which is not getting the imprecise abort.  This is
only happening on exynos_defconfig[2], so I'm curious why you think the
switcher or NR_CPUS might be the issues.

Anyways, I narrowed this down a bit and discovered it's
CONFIG_EXYNOS5420_MCPM=y that's the root cause.  If I use
exynos_defconfig and then disable that option, I don't get any more
imprecise aborts.

Kevin

[1] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-multi_v7_defconfig/boot-exynos5420-arndale-octa.html
[2] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html

^ permalink raw reply	[flat|nested] 14+ messages in thread

* exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
@ 2014-09-18  0:39       ` Kevin Hilman
  0 siblings, 0 replies; 14+ messages in thread
From: Kevin Hilman @ 2014-09-18  0:39 UTC (permalink / raw)
  To: linux-arm-kernel

Thomas Abraham <ta.omasab@gmail.com> writes:

> On Thu, Sep 11, 2014 at 12:16 AM, Kevin Hilman <khilman@kernel.org> wrote:
>> Tyler Baker <tyler.baker@linaro.org> writes:
>>
>>> Exynos5420-based Arndale octa boards have recently started failing boot
>>> tests due to imprecise external aborts.  This only appears to happen
>>> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
>>> issue seems to be intermittent, so is not reliably reproducable and
>>> difficult to bisect.  Here are a few boot logs from recent
>>> mainline/linux-next kernels that are failing:
>>
>> FYI, I'm seeing the same periodic aborts.  For example, here's my boot
>> of next-20140910:
>> http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>>
>> However, my userspace is much simpler and doesn't seem to cause a panic,
>> so my boot tests report passing. (I should fixup my scripts so these
>> imprecise aborts are reported as a FAIL.)
>>
>> I'm glad you pointed out that it happens only with exynos_defconfig and
>> not multi_v7_defconfig because I noticed that too.  I haven't had the
>> time to track it any further than that, so maybe the exynos folks can
>> help track it down from here.
>>
>> Thanks for reporting this,
>>
>> Kevin
>
> Hi Tyler, Kevin,
>
> From the bootlog you have shared,
>
> [    1.060016] CPU4: failed to come online
> [    2.070031] CPU5: failed to come online
> [    3.080049] CPU6: failed to come online
> [    4.090066] CPU7: failed to come online
> [    4.090099] Brought up 4 CPUs
> [    4.090109] SMP: Total of 4 processors activated.
> [    4.090119] CPU: WARNING: CPU(s) started in wrong/inconsistent
> modes (primary CPU mode 0x13)
> [    4.090128] CPU: This may indicate a broken bootloader or firmware.
>
> Would it be possible to set max cpus to 1, disable switcher and try
> again. I don't have a arndale octa board but I have tested mainline
> kernel with smdk5420 board. It boots all eight CPUs, switcher works
> fine and there are no imprecise aborts seen.

Sorry for the delay, I'm travelling this week.

FWIW, the same CPU boot failures you hilight above are happening on
multi_v7_defconfig[1] which is not getting the imprecise abort.  This is
only happening on exynos_defconfig[2], so I'm curious why you think the
switcher or NR_CPUS might be the issues.

Anyways, I narrowed this down a bit and discovered it's
CONFIG_EXYNOS5420_MCPM=y that's the root cause.  If I use
exynos_defconfig and then disable that option, I don't get any more
imprecise aborts.

Kevin

[1] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-multi_v7_defconfig/boot-exynos5420-arndale-octa.html
[2] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
  2014-09-18  0:39       ` Kevin Hilman
@ 2014-09-30 18:17         ` Bartlomiej Zolnierkiewicz
  -1 siblings, 0 replies; 14+ messages in thread
From: Bartlomiej Zolnierkiewicz @ 2014-09-30 18:17 UTC (permalink / raw)
  To: Kevin Hilman
  Cc: Thomas Abraham, Tyler Baker, linux-samsung-soc, linux-arm-kernel,
	Sachin Kamat, Kukjin Kim


Hi,

On Wednesday, September 17, 2014 05:39:29 PM Kevin Hilman wrote:
> Thomas Abraham <ta.omasab@gmail.com> writes:
> 
> > On Thu, Sep 11, 2014 at 12:16 AM, Kevin Hilman <khilman@kernel.org> wrote:
> >> Tyler Baker <tyler.baker@linaro.org> writes:
> >>
> >>> Exynos5420-based Arndale octa boards have recently started failing boot
> >>> tests due to imprecise external aborts.  This only appears to happen
> >>> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
> >>> issue seems to be intermittent, so is not reliably reproducable and
> >>> difficult to bisect.  Here are a few boot logs from recent
> >>> mainline/linux-next kernels that are failing:
> >>
> >> FYI, I'm seeing the same periodic aborts.  For example, here's my boot
> >> of next-20140910:
> >> http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
> >>
> >> However, my userspace is much simpler and doesn't seem to cause a panic,
> >> so my boot tests report passing. (I should fixup my scripts so these
> >> imprecise aborts are reported as a FAIL.)
> >>
> >> I'm glad you pointed out that it happens only with exynos_defconfig and
> >> not multi_v7_defconfig because I noticed that too.  I haven't had the
> >> time to track it any further than that, so maybe the exynos folks can
> >> help track it down from here.
> >>
> >> Thanks for reporting this,
> >>
> >> Kevin
> >
> > Hi Tyler, Kevin,
> >
> > From the bootlog you have shared,
> >
> > [    1.060016] CPU4: failed to come online
> > [    2.070031] CPU5: failed to come online
> > [    3.080049] CPU6: failed to come online
> > [    4.090066] CPU7: failed to come online
> > [    4.090099] Brought up 4 CPUs
> > [    4.090109] SMP: Total of 4 processors activated.
> > [    4.090119] CPU: WARNING: CPU(s) started in wrong/inconsistent
> > modes (primary CPU mode 0x13)
> > [    4.090128] CPU: This may indicate a broken bootloader or firmware.
> >
> > Would it be possible to set max cpus to 1, disable switcher and try

Which "max cpus" do you mean?  For NR_CPUS the lower limit is 2.

> > again. I don't have a arndale octa board but I have tested mainline
> > kernel with smdk5420 board. It boots all eight CPUs, switcher works
> > fine and there are no imprecise aborts seen.
> 
> Sorry for the delay, I'm travelling this week.
> 
> FWIW, the same CPU boot failures you hilight above are happening on
> multi_v7_defconfig[1] which is not getting the imprecise abort.  This is
> only happening on exynos_defconfig[2], so I'm curious why you think the
> switcher or NR_CPUS might be the issues.
> 
> Anyways, I narrowed this down a bit and discovered it's
> CONFIG_EXYNOS5420_MCPM=y that's the root cause.  If I use
> exynos_defconfig and then disable that option, I don't get any more
> imprecise aborts.

I have exactly the same issue with Exynos5420 Arndale Octa.

CONFIG_EXYNOS5420_MCPM was enabled by commit fc3791f3a95d
("ARM: exynos_defconfig: Update exynos_defconfig") which
get merged into v3.17-rc1.  It seems that this part of
the patch should be reverted for the final v3.17 kernel.

> Kevin
> 
> [1] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-multi_v7_defconfig/boot-exynos5420-arndale-octa.html
> [2] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html

Best regards,
--
Bartlomiej Zolnierkiewicz
Samsung R&D Institute Poland
Samsung Electronics

^ permalink raw reply	[flat|nested] 14+ messages in thread

* exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
@ 2014-09-30 18:17         ` Bartlomiej Zolnierkiewicz
  0 siblings, 0 replies; 14+ messages in thread
From: Bartlomiej Zolnierkiewicz @ 2014-09-30 18:17 UTC (permalink / raw)
  To: linux-arm-kernel


Hi,

On Wednesday, September 17, 2014 05:39:29 PM Kevin Hilman wrote:
> Thomas Abraham <ta.omasab@gmail.com> writes:
> 
> > On Thu, Sep 11, 2014 at 12:16 AM, Kevin Hilman <khilman@kernel.org> wrote:
> >> Tyler Baker <tyler.baker@linaro.org> writes:
> >>
> >>> Exynos5420-based Arndale octa boards have recently started failing boot
> >>> tests due to imprecise external aborts.  This only appears to happen
> >>> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
> >>> issue seems to be intermittent, so is not reliably reproducable and
> >>> difficult to bisect.  Here are a few boot logs from recent
> >>> mainline/linux-next kernels that are failing:
> >>
> >> FYI, I'm seeing the same periodic aborts.  For example, here's my boot
> >> of next-20140910:
> >> http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
> >>
> >> However, my userspace is much simpler and doesn't seem to cause a panic,
> >> so my boot tests report passing. (I should fixup my scripts so these
> >> imprecise aborts are reported as a FAIL.)
> >>
> >> I'm glad you pointed out that it happens only with exynos_defconfig and
> >> not multi_v7_defconfig because I noticed that too.  I haven't had the
> >> time to track it any further than that, so maybe the exynos folks can
> >> help track it down from here.
> >>
> >> Thanks for reporting this,
> >>
> >> Kevin
> >
> > Hi Tyler, Kevin,
> >
> > From the bootlog you have shared,
> >
> > [    1.060016] CPU4: failed to come online
> > [    2.070031] CPU5: failed to come online
> > [    3.080049] CPU6: failed to come online
> > [    4.090066] CPU7: failed to come online
> > [    4.090099] Brought up 4 CPUs
> > [    4.090109] SMP: Total of 4 processors activated.
> > [    4.090119] CPU: WARNING: CPU(s) started in wrong/inconsistent
> > modes (primary CPU mode 0x13)
> > [    4.090128] CPU: This may indicate a broken bootloader or firmware.
> >
> > Would it be possible to set max cpus to 1, disable switcher and try

Which "max cpus" do you mean?  For NR_CPUS the lower limit is 2.

> > again. I don't have a arndale octa board but I have tested mainline
> > kernel with smdk5420 board. It boots all eight CPUs, switcher works
> > fine and there are no imprecise aborts seen.
> 
> Sorry for the delay, I'm travelling this week.
> 
> FWIW, the same CPU boot failures you hilight above are happening on
> multi_v7_defconfig[1] which is not getting the imprecise abort.  This is
> only happening on exynos_defconfig[2], so I'm curious why you think the
> switcher or NR_CPUS might be the issues.
> 
> Anyways, I narrowed this down a bit and discovered it's
> CONFIG_EXYNOS5420_MCPM=y that's the root cause.  If I use
> exynos_defconfig and then disable that option, I don't get any more
> imprecise aborts.

I have exactly the same issue with Exynos5420 Arndale Octa.

CONFIG_EXYNOS5420_MCPM was enabled by commit fc3791f3a95d
("ARM: exynos_defconfig: Update exynos_defconfig") which
get merged into v3.17-rc1.  It seems that this part of
the patch should be reverted for the final v3.17 kernel.

> Kevin
> 
> [1] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-multi_v7_defconfig/boot-exynos5420-arndale-octa.html
> [2] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html

Best regards,
--
Bartlomiej Zolnierkiewicz
Samsung R&D Institute Poland
Samsung Electronics

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
  2014-09-30 18:17         ` Bartlomiej Zolnierkiewicz
@ 2014-10-06  7:55           ` Thomas Abraham
  -1 siblings, 0 replies; 14+ messages in thread
From: Thomas Abraham @ 2014-10-06  7:55 UTC (permalink / raw)
  To: Bartlomiej Zolnierkiewicz
  Cc: Kevin Hilman, Tyler Baker, linux-samsung-soc, linux-arm-kernel,
	Sachin Kamat, Kukjin Kim

Hi Bartlomiej,

On Tue, Sep 30, 2014 at 11:47 PM, Bartlomiej Zolnierkiewicz
<b.zolnierkie@samsung.com> wrote:
>
> Hi,
>
> On Wednesday, September 17, 2014 05:39:29 PM Kevin Hilman wrote:
>> Thomas Abraham <ta.omasab@gmail.com> writes:
>>
>> > On Thu, Sep 11, 2014 at 12:16 AM, Kevin Hilman <khilman@kernel.org> wrote:
>> >> Tyler Baker <tyler.baker@linaro.org> writes:
>> >>
>> >>> Exynos5420-based Arndale octa boards have recently started failing boot
>> >>> tests due to imprecise external aborts.  This only appears to happen
>> >>> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
>> >>> issue seems to be intermittent, so is not reliably reproducable and
>> >>> difficult to bisect.  Here are a few boot logs from recent
>> >>> mainline/linux-next kernels that are failing:
>> >>
>> >> FYI, I'm seeing the same periodic aborts.  For example, here's my boot
>> >> of next-20140910:
>> >> http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>> >>
>> >> However, my userspace is much simpler and doesn't seem to cause a panic,
>> >> so my boot tests report passing. (I should fixup my scripts so these
>> >> imprecise aborts are reported as a FAIL.)
>> >>
>> >> I'm glad you pointed out that it happens only with exynos_defconfig and
>> >> not multi_v7_defconfig because I noticed that too.  I haven't had the
>> >> time to track it any further than that, so maybe the exynos folks can
>> >> help track it down from here.
>> >>
>> >> Thanks for reporting this,
>> >>
>> >> Kevin
>> >
>> > Hi Tyler, Kevin,
>> >
>> > From the bootlog you have shared,
>> >
>> > [    1.060016] CPU4: failed to come online
>> > [    2.070031] CPU5: failed to come online
>> > [    3.080049] CPU6: failed to come online
>> > [    4.090066] CPU7: failed to come online
>> > [    4.090099] Brought up 4 CPUs
>> > [    4.090109] SMP: Total of 4 processors activated.
>> > [    4.090119] CPU: WARNING: CPU(s) started in wrong/inconsistent
>> > modes (primary CPU mode 0x13)
>> > [    4.090128] CPU: This may indicate a broken bootloader or firmware.
>> >
>> > Would it be possible to set max cpus to 1, disable switcher and try
>
> Which "max cpus" do you mean?  For NR_CPUS the lower limit is 2.

The "maxcpus=1" kernel parameter could be used here. The reason for
trying with this option was to understand the probable location of
this imprecise abort (mcpm init or secondary cpu boot).

Thanks,
Thomas.

>
>> > again. I don't have a arndale octa board but I have tested mainline
>> > kernel with smdk5420 board. It boots all eight CPUs, switcher works
>> > fine and there are no imprecise aborts seen.
>>
>> Sorry for the delay, I'm travelling this week.
>>
>> FWIW, the same CPU boot failures you hilight above are happening on
>> multi_v7_defconfig[1] which is not getting the imprecise abort.  This is
>> only happening on exynos_defconfig[2], so I'm curious why you think the
>> switcher or NR_CPUS might be the issues.
>>
>> Anyways, I narrowed this down a bit and discovered it's
>> CONFIG_EXYNOS5420_MCPM=y that's the root cause.  If I use
>> exynos_defconfig and then disable that option, I don't get any more
>> imprecise aborts.
>
> I have exactly the same issue with Exynos5420 Arndale Octa.
>
> CONFIG_EXYNOS5420_MCPM was enabled by commit fc3791f3a95d
> ("ARM: exynos_defconfig: Update exynos_defconfig") which
> get merged into v3.17-rc1.  It seems that this part of
> the patch should be reverted for the final v3.17 kernel.
>
>> Kevin
>>
>> [1] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-multi_v7_defconfig/boot-exynos5420-arndale-octa.html
>> [2] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>
> Best regards,
> --
> Bartlomiej Zolnierkiewicz
> Samsung R&D Institute Poland
> Samsung Electronics
>

^ permalink raw reply	[flat|nested] 14+ messages in thread

* exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
@ 2014-10-06  7:55           ` Thomas Abraham
  0 siblings, 0 replies; 14+ messages in thread
From: Thomas Abraham @ 2014-10-06  7:55 UTC (permalink / raw)
  To: linux-arm-kernel

Hi Bartlomiej,

On Tue, Sep 30, 2014 at 11:47 PM, Bartlomiej Zolnierkiewicz
<b.zolnierkie@samsung.com> wrote:
>
> Hi,
>
> On Wednesday, September 17, 2014 05:39:29 PM Kevin Hilman wrote:
>> Thomas Abraham <ta.omasab@gmail.com> writes:
>>
>> > On Thu, Sep 11, 2014 at 12:16 AM, Kevin Hilman <khilman@kernel.org> wrote:
>> >> Tyler Baker <tyler.baker@linaro.org> writes:
>> >>
>> >>> Exynos5420-based Arndale octa boards have recently started failing boot
>> >>> tests due to imprecise external aborts.  This only appears to happen
>> >>> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
>> >>> issue seems to be intermittent, so is not reliably reproducable and
>> >>> difficult to bisect.  Here are a few boot logs from recent
>> >>> mainline/linux-next kernels that are failing:
>> >>
>> >> FYI, I'm seeing the same periodic aborts.  For example, here's my boot
>> >> of next-20140910:
>> >> http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>> >>
>> >> However, my userspace is much simpler and doesn't seem to cause a panic,
>> >> so my boot tests report passing. (I should fixup my scripts so these
>> >> imprecise aborts are reported as a FAIL.)
>> >>
>> >> I'm glad you pointed out that it happens only with exynos_defconfig and
>> >> not multi_v7_defconfig because I noticed that too.  I haven't had the
>> >> time to track it any further than that, so maybe the exynos folks can
>> >> help track it down from here.
>> >>
>> >> Thanks for reporting this,
>> >>
>> >> Kevin
>> >
>> > Hi Tyler, Kevin,
>> >
>> > From the bootlog you have shared,
>> >
>> > [    1.060016] CPU4: failed to come online
>> > [    2.070031] CPU5: failed to come online
>> > [    3.080049] CPU6: failed to come online
>> > [    4.090066] CPU7: failed to come online
>> > [    4.090099] Brought up 4 CPUs
>> > [    4.090109] SMP: Total of 4 processors activated.
>> > [    4.090119] CPU: WARNING: CPU(s) started in wrong/inconsistent
>> > modes (primary CPU mode 0x13)
>> > [    4.090128] CPU: This may indicate a broken bootloader or firmware.
>> >
>> > Would it be possible to set max cpus to 1, disable switcher and try
>
> Which "max cpus" do you mean?  For NR_CPUS the lower limit is 2.

The "maxcpus=1" kernel parameter could be used here. The reason for
trying with this option was to understand the probable location of
this imprecise abort (mcpm init or secondary cpu boot).

Thanks,
Thomas.

>
>> > again. I don't have a arndale octa board but I have tested mainline
>> > kernel with smdk5420 board. It boots all eight CPUs, switcher works
>> > fine and there are no imprecise aborts seen.
>>
>> Sorry for the delay, I'm travelling this week.
>>
>> FWIW, the same CPU boot failures you hilight above are happening on
>> multi_v7_defconfig[1] which is not getting the imprecise abort.  This is
>> only happening on exynos_defconfig[2], so I'm curious why you think the
>> switcher or NR_CPUS might be the issues.
>>
>> Anyways, I narrowed this down a bit and discovered it's
>> CONFIG_EXYNOS5420_MCPM=y that's the root cause.  If I use
>> exynos_defconfig and then disable that option, I don't get any more
>> imprecise aborts.
>
> I have exactly the same issue with Exynos5420 Arndale Octa.
>
> CONFIG_EXYNOS5420_MCPM was enabled by commit fc3791f3a95d
> ("ARM: exynos_defconfig: Update exynos_defconfig") which
> get merged into v3.17-rc1.  It seems that this part of
> the patch should be reverted for the final v3.17 kernel.
>
>> Kevin
>>
>> [1] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-multi_v7_defconfig/boot-exynos5420-arndale-octa.html
>> [2] http://images.armcloud.us/kernel-ci/mainline/v3.17-rc5-25-g8ba4caf/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>
> Best regards,
> --
> Bartlomiej Zolnierkiewicz
> Samsung R&D Institute Poland
> Samsung Electronics
>

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
  2014-09-18  0:39       ` Kevin Hilman
@ 2014-10-31 17:41         ` Kevin Hilman
  -1 siblings, 0 replies; 14+ messages in thread
From: Kevin Hilman @ 2014-10-31 17:41 UTC (permalink / raw)
  To: Kevin Hilman
  Cc: Thomas Abraham, Tyler Baker, linux-samsung-soc, linux-arm-kernel

On Wed, Sep 17, 2014 at 5:39 PM, Kevin Hilman <khilman@kernel.org> wrote:
> Thomas Abraham <ta.omasab@gmail.com> writes:
>
>> On Thu, Sep 11, 2014 at 12:16 AM, Kevin Hilman <khilman@kernel.org> wrote:
>>> Tyler Baker <tyler.baker@linaro.org> writes:
>>>
>>>> Exynos5420-based Arndale octa boards have recently started failing boot
>>>> tests due to imprecise external aborts.  This only appears to happen
>>>> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
>>>> issue seems to be intermittent, so is not reliably reproducable and
>>>> difficult to bisect.  Here are a few boot logs from recent
>>>> mainline/linux-next kernels that are failing:
>>>
>>> FYI, I'm seeing the same periodic aborts.  For example, here's my boot
>>> of next-20140910:
>>> http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>>>
>>> However, my userspace is much simpler and doesn't seem to cause a panic,
>>> so my boot tests report passing. (I should fixup my scripts so these
>>> imprecise aborts are reported as a FAIL.)
>>>
>>> I'm glad you pointed out that it happens only with exynos_defconfig and
>>> not multi_v7_defconfig because I noticed that too.  I haven't had the
>>> time to track it any further than that, so maybe the exynos folks can
>>> help track it down from here.
>>>
>>> Thanks for reporting this,
>>>
>>> Kevin
>>
>> Hi Tyler, Kevin,
>>
>> From the bootlog you have shared,
>>
>> [    1.060016] CPU4: failed to come online
>> [    2.070031] CPU5: failed to come online
>> [    3.080049] CPU6: failed to come online
>> [    4.090066] CPU7: failed to come online
>> [    4.090099] Brought up 4 CPUs
>> [    4.090109] SMP: Total of 4 processors activated.
>> [    4.090119] CPU: WARNING: CPU(s) started in wrong/inconsistent
>> modes (primary CPU mode 0x13)
>> [    4.090128] CPU: This may indicate a broken bootloader or firmware.
>>
>> Would it be possible to set max cpus to 1, disable switcher and try
>> again. I don't have a arndale octa board but I have tested mainline
>> kernel with smdk5420 board. It boots all eight CPUs, switcher works
>> fine and there are no imprecise aborts seen.
>
> Sorry for the delay, I'm travelling this week.
>
> FWIW, the same CPU boot failures you hilight above are happening on
> multi_v7_defconfig[1] which is not getting the imprecise abort.  This is
> only happening on exynos_defconfig[2], so I'm curious why you think the
> switcher or NR_CPUS might be the issues.
>
> Anyways, I narrowed this down a bit and discovered it's
> CONFIG_EXYNOS5420_MCPM=y that's the root cause.  If I use
> exynos_defconfig and then disable that option, I don't get any more
> imprecise aborts.

These imprecise aborts are still happening, and preventing running
full userspace.

I'm going to send a patch to disable this CONFIG_EXYNOS5420_MCPM until
someone can figure out what's going on.

Kevin

^ permalink raw reply	[flat|nested] 14+ messages in thread

* exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig
@ 2014-10-31 17:41         ` Kevin Hilman
  0 siblings, 0 replies; 14+ messages in thread
From: Kevin Hilman @ 2014-10-31 17:41 UTC (permalink / raw)
  To: linux-arm-kernel

On Wed, Sep 17, 2014 at 5:39 PM, Kevin Hilman <khilman@kernel.org> wrote:
> Thomas Abraham <ta.omasab@gmail.com> writes:
>
>> On Thu, Sep 11, 2014 at 12:16 AM, Kevin Hilman <khilman@kernel.org> wrote:
>>> Tyler Baker <tyler.baker@linaro.org> writes:
>>>
>>>> Exynos5420-based Arndale octa boards have recently started failing boot
>>>> tests due to imprecise external aborts.  This only appears to happen
>>>> when using exynos_defconfig and boots fine with multi_v7_defconfig.  The
>>>> issue seems to be intermittent, so is not reliably reproducable and
>>>> difficult to bisect.  Here are a few boot logs from recent
>>>> mainline/linux-next kernels that are failing:
>>>
>>> FYI, I'm seeing the same periodic aborts.  For example, here's my boot
>>> of next-20140910:
>>> http://images.armcloud.us/kernel-ci/next/next-20140910/arm-exynos_defconfig/boot-exynos5420-arndale-octa.html
>>>
>>> However, my userspace is much simpler and doesn't seem to cause a panic,
>>> so my boot tests report passing. (I should fixup my scripts so these
>>> imprecise aborts are reported as a FAIL.)
>>>
>>> I'm glad you pointed out that it happens only with exynos_defconfig and
>>> not multi_v7_defconfig because I noticed that too.  I haven't had the
>>> time to track it any further than that, so maybe the exynos folks can
>>> help track it down from here.
>>>
>>> Thanks for reporting this,
>>>
>>> Kevin
>>
>> Hi Tyler, Kevin,
>>
>> From the bootlog you have shared,
>>
>> [    1.060016] CPU4: failed to come online
>> [    2.070031] CPU5: failed to come online
>> [    3.080049] CPU6: failed to come online
>> [    4.090066] CPU7: failed to come online
>> [    4.090099] Brought up 4 CPUs
>> [    4.090109] SMP: Total of 4 processors activated.
>> [    4.090119] CPU: WARNING: CPU(s) started in wrong/inconsistent
>> modes (primary CPU mode 0x13)
>> [    4.090128] CPU: This may indicate a broken bootloader or firmware.
>>
>> Would it be possible to set max cpus to 1, disable switcher and try
>> again. I don't have a arndale octa board but I have tested mainline
>> kernel with smdk5420 board. It boots all eight CPUs, switcher works
>> fine and there are no imprecise aborts seen.
>
> Sorry for the delay, I'm travelling this week.
>
> FWIW, the same CPU boot failures you hilight above are happening on
> multi_v7_defconfig[1] which is not getting the imprecise abort.  This is
> only happening on exynos_defconfig[2], so I'm curious why you think the
> switcher or NR_CPUS might be the issues.
>
> Anyways, I narrowed this down a bit and discovered it's
> CONFIG_EXYNOS5420_MCPM=y that's the root cause.  If I use
> exynos_defconfig and then disable that option, I don't get any more
> imprecise aborts.

These imprecise aborts are still happening, and preventing running
full userspace.

I'm going to send a patch to disable this CONFIG_EXYNOS5420_MCPM until
someone can figure out what's going on.

Kevin

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2014-10-31 17:41 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2014-09-10 18:20 exynos5420/arndale-octa: imprecise external aborts on exynos_defconfig Tyler Baker
2014-09-10 18:20 ` Tyler Baker
2014-09-10 18:46 ` Kevin Hilman
2014-09-10 18:46   ` Kevin Hilman
2014-09-13 12:59   ` Thomas Abraham
2014-09-13 12:59     ` Thomas Abraham
2014-09-18  0:39     ` Kevin Hilman
2014-09-18  0:39       ` Kevin Hilman
2014-09-30 18:17       ` Bartlomiej Zolnierkiewicz
2014-09-30 18:17         ` Bartlomiej Zolnierkiewicz
2014-10-06  7:55         ` Thomas Abraham
2014-10-06  7:55           ` Thomas Abraham
2014-10-31 17:41       ` Kevin Hilman
2014-10-31 17:41         ` Kevin Hilman

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.