All of lore.kernel.org
 help / color / mirror / Atom feed
* [BISECTED][REGRESSION] Kernel does not boot on EeePC
@ 2014-06-08  9:02 Sitsofe Wheeler
  2014-06-08  9:45 ` Ingo Molnar
  2014-06-09 10:26 ` Igor Mammedov
  0 siblings, 2 replies; 10+ messages in thread
From: Sitsofe Wheeler @ 2014-06-08  9:02 UTC (permalink / raw)
  To: Igor Mammedov; +Cc: Toshi Kani, Thomas Gleixner, Ingo Molnar, linux-kernel

Hi,

The latest kernel (c593e8978722f7f4a12932733cfeed6c0c74fbaa) refuses to
boot on my EeePC - after grub is finished the screen just remains black
and the only thing that does something is pressing the power button. The
problem commit seems to be:

commit 3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f
Author: Igor Mammedov <imammedo@redhat.com>
Date:   Thu Jun 5 15:42:45 2014 +0200

    x86/smpboot: Initialize secondary CPU only if master CPU will wait for it

Here's the bisection log:
# bad: [c593e8978722f7f4a12932733cfeed6c0c74fbaa] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs
# good: [14186fea0cb06bc43181ce239efe0df6f1af260a] Merge tag 'locks-v3.15-4' of git://git.samba.org/jlayton/linux
git bisect start 'HEAD@{3}' 'HEAD@{4}'
# good: [5fa6a683c061cd4662b436135cc76552dbc29a43] Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net
git bisect good 5fa6a683c061cd4662b436135cc76552dbc29a43
# good: [9f12600fe425bc28f0ccba034a77783c09c15af4] dcache: add missing lockdep annotation
git bisect good 9f12600fe425bc28f0ccba034a77783c09c15af4
# good: [ca755175f245b91f72cfa474aaa8acd9c26996f4] Merge tag 'md/3.15-fixes' of git://neil.brown.name/md
git bisect good ca755175f245b91f72cfa474aaa8acd9c26996f4
# bad: [3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f] x86/smpboot: Initialize secondary CPU only if master CPU will wait for it
git bisect bad 3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f
# good: [c65c7a306610ee7c13669a8f5601b472c19dc6f1] bridge: notify user space after fdb update
git bisect good c65c7a306610ee7c13669a8f5601b472c19dc6f1
# good: [3aab01d800eb195de3f273d9a195fe070a507d56] net: ec_bhf: Add runtime dependencies
git bisect good 3aab01d800eb195de3f273d9a195fe070a507d56
# good: [d2cfd3105094f593bc1fbd0b042a7752ddf08691] Merge tag 'sound-3.15' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound
git bisect good d2cfd3105094f593bc1fbd0b042a7752ddf08691
# good: [89f898c1e195fa6235c869bb457e500b7b3ac49d] x86: Fix list/memory corruption on CPU hotplug
git bisect good 89f898c1e195fa6235c869bb457e500b7b3ac49d
# good: [feef1e8ecbadf24f8e6829c935df8683cabae41b] x86/smpboot: Log error on secondary CPU wakeup failure at ERR level
git bisect good feef1e8ecbadf24f8e6829c935df8683cabae41b
# first bad commit: [3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f] x86/smpboot: Initialize secondary CPU only if master CPU will wait for it

-- 
Sitsofe | http://sucs.org/~sits/

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [BISECTED][REGRESSION] Kernel does not boot on EeePC
  2014-06-08  9:02 [BISECTED][REGRESSION] Kernel does not boot on EeePC Sitsofe Wheeler
@ 2014-06-08  9:45 ` Ingo Molnar
  2014-06-08 17:50   ` H. Peter Anvin
  2014-06-09 10:26 ` Igor Mammedov
  1 sibling, 1 reply; 10+ messages in thread
From: Ingo Molnar @ 2014-06-08  9:45 UTC (permalink / raw)
  To: Sitsofe Wheeler, Linus Torvalds
  Cc: Igor Mammedov, Toshi Kani, Thomas Gleixner, linux-kernel, H. Peter Anvin


* Sitsofe Wheeler <sitsofe@gmail.com> wrote:

> Hi,
> 
> The latest kernel (c593e8978722f7f4a12932733cfeed6c0c74fbaa) refuses to
> boot on my EeePC - after grub is finished the screen just remains black
> and the only thing that does something is pressing the power button. The
> problem commit seems to be:
> 
> commit 3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f
> Author: Igor Mammedov <imammedo@redhat.com>
> Date:   Thu Jun 5 15:42:45 2014 +0200
> 
>     x86/smpboot: Initialize secondary CPU only if master CPU will wait for it

Argh, and have read this just a minute after expressing my hope that 
the hotplug fix is correct :-/

Linus, please revert commit 3e1a878b7ccdb.

  Acked-by: Ingo Molnar <mingo@kernel.org>

Thanks,

	Ingo

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [BISECTED][REGRESSION] Kernel does not boot on EeePC
  2014-06-08  9:45 ` Ingo Molnar
@ 2014-06-08 17:50   ` H. Peter Anvin
  2014-06-09 14:02     ` Ingo Molnar
  0 siblings, 1 reply; 10+ messages in thread
From: H. Peter Anvin @ 2014-06-08 17:50 UTC (permalink / raw)
  To: Ingo Molnar, Sitsofe Wheeler, Linus Torvalds
  Cc: Igor Mammedov, Toshi Kani, Thomas Gleixner, linux-kernel

On 06/08/2014 02:45 AM, Ingo Molnar wrote:
> 
> * Sitsofe Wheeler <sitsofe@gmail.com> wrote:
> 
>> Hi,
>>
>> The latest kernel (c593e8978722f7f4a12932733cfeed6c0c74fbaa) refuses to
>> boot on my EeePC - after grub is finished the screen just remains black
>> and the only thing that does something is pressing the power button. The
>> problem commit seems to be:
>>
>> commit 3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f
>> Author: Igor Mammedov <imammedo@redhat.com>
>> Date:   Thu Jun 5 15:42:45 2014 +0200
>>
>>     x86/smpboot: Initialize secondary CPU only if master CPU will wait for it
> 
> Argh, and have read this just a minute after expressing my hope that 
> the hotplug fix is correct :-/
> 
> Linus, please revert commit 3e1a878b7ccdb.
> 
>   Acked-by: Ingo Molnar <mingo@kernel.org>
> 

I really want to say *THANK YOU* to Sitsofe for a very timely and
complete report.  You really saved us some highly valuable time.

	-hpa



^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [BISECTED][REGRESSION] Kernel does not boot on EeePC
  2014-06-08  9:02 [BISECTED][REGRESSION] Kernel does not boot on EeePC Sitsofe Wheeler
  2014-06-08  9:45 ` Ingo Molnar
@ 2014-06-09 10:26 ` Igor Mammedov
  2014-06-11  6:16   ` Sitsofe Wheeler
  1 sibling, 1 reply; 10+ messages in thread
From: Igor Mammedov @ 2014-06-09 10:26 UTC (permalink / raw)
  To: Sitsofe Wheeler; +Cc: Toshi Kani, Thomas Gleixner, Ingo Molnar, linux-kernel

On Sun, 8 Jun 2014 10:02:23 +0100
Sitsofe Wheeler <sitsofe@gmail.com> wrote:

> Hi,
> 
> The latest kernel (c593e8978722f7f4a12932733cfeed6c0c74fbaa) refuses to
> boot on my EeePC - after grub is finished the screen just remains black
> and the only thing that does something is pressing the power button. The
> problem commit seems to be:
I'm sorry for introducing mess, I don't have EeePC to test this on.
Could you help to track problem on it?
Is there any output if you boot affected kernel with
"earlyprintk=vga ignore_loglevel" options?
 

> 
> commit 3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f
> Author: Igor Mammedov <imammedo@redhat.com>
> Date:   Thu Jun 5 15:42:45 2014 +0200
> 
>     x86/smpboot: Initialize secondary CPU only if master CPU will wait for it
> 
> Here's the bisection log:
> # bad: [c593e8978722f7f4a12932733cfeed6c0c74fbaa] Merge branch 'for-linus'
> of git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs # good:
> [14186fea0cb06bc43181ce239efe0df6f1af260a] Merge tag 'locks-v3.15-4' of
> git://git.samba.org/jlayton/linux git bisect start 'HEAD@{3}' 'HEAD@{4}' #
> good: [5fa6a683c061cd4662b436135cc76552dbc29a43] Merge
> git://git.kernel.org/pub/scm/linux/kernel/git/davem/net git bisect good
> 5fa6a683c061cd4662b436135cc76552dbc29a43 # good:
> [9f12600fe425bc28f0ccba034a77783c09c15af4] dcache: add missing lockdep
> annotation git bisect good 9f12600fe425bc28f0ccba034a77783c09c15af4 # good:
> [ca755175f245b91f72cfa474aaa8acd9c26996f4] Merge tag 'md/3.15-fixes' of
> git://neil.brown.name/md git bisect good
> ca755175f245b91f72cfa474aaa8acd9c26996f4 # bad:
> [3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f] x86/smpboot: Initialize secondary
> CPU only if master CPU will wait for it git bisect bad
> 3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f # good:
> [c65c7a306610ee7c13669a8f5601b472c19dc6f1] bridge: notify user space after
> fdb update git bisect good c65c7a306610ee7c13669a8f5601b472c19dc6f1 # good:
> [3aab01d800eb195de3f273d9a195fe070a507d56] net: ec_bhf: Add runtime
> dependencies git bisect good 3aab01d800eb195de3f273d9a195fe070a507d56 #
> good: [d2cfd3105094f593bc1fbd0b042a7752ddf08691] Merge tag 'sound-3.15' of
> git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound git bisect good
> d2cfd3105094f593bc1fbd0b042a7752ddf08691 # good:
> [89f898c1e195fa6235c869bb457e500b7b3ac49d] x86: Fix list/memory corruption
> on CPU hotplug git bisect good 89f898c1e195fa6235c869bb457e500b7b3ac49d #
> good: [feef1e8ecbadf24f8e6829c935df8683cabae41b] x86/smpboot: Log error on
> secondary CPU wakeup failure at ERR level git bisect good
> feef1e8ecbadf24f8e6829c935df8683cabae41b # first bad commit:
> [3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f] x86/smpboot: Initialize secondary
> CPU only if master CPU will wait for it
> 
> -- 
> Sitsofe | http://sucs.org/~sits/


-- 
Regards,
  Igor

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [BISECTED][REGRESSION] Kernel does not boot on EeePC
  2014-06-08 17:50   ` H. Peter Anvin
@ 2014-06-09 14:02     ` Ingo Molnar
  2014-06-10 19:19       ` Sitsofe Wheeler
  0 siblings, 1 reply; 10+ messages in thread
From: Ingo Molnar @ 2014-06-09 14:02 UTC (permalink / raw)
  To: H. Peter Anvin
  Cc: Sitsofe Wheeler, Linus Torvalds, Igor Mammedov, Toshi Kani,
	Thomas Gleixner, linux-kernel


* H. Peter Anvin <hpa@zytor.com> wrote:

> On 06/08/2014 02:45 AM, Ingo Molnar wrote:
> > 
> > * Sitsofe Wheeler <sitsofe@gmail.com> wrote:
> > 
> >> Hi,
> >>
> >> The latest kernel (c593e8978722f7f4a12932733cfeed6c0c74fbaa) refuses to
> >> boot on my EeePC - after grub is finished the screen just remains black
> >> and the only thing that does something is pressing the power button. The
> >> problem commit seems to be:
> >>
> >> commit 3e1a878b7ccdb31da6d9d2b855c72ad87afeba3f
> >> Author: Igor Mammedov <imammedo@redhat.com>
> >> Date:   Thu Jun 5 15:42:45 2014 +0200
> >>
> >>     x86/smpboot: Initialize secondary CPU only if master CPU will wait for it
> > 
> > Argh, and have read this just a minute after expressing my hope that 
> > the hotplug fix is correct :-/
> > 
> > Linus, please revert commit 3e1a878b7ccdb.
> > 
> >   Acked-by: Ingo Molnar <mingo@kernel.org>
> > 
> 
> I really want to say *THANK YOU* to Sitsofe for a very timely and 
> complete report.  You really saved us some highly valuable time.

Seconded!

Thanks,

	Ingo

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [BISECTED][REGRESSION] Kernel does not boot on EeePC
  2014-06-09 14:02     ` Ingo Molnar
@ 2014-06-10 19:19       ` Sitsofe Wheeler
  0 siblings, 0 replies; 10+ messages in thread
From: Sitsofe Wheeler @ 2014-06-10 19:19 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: H. Peter Anvin, Linus Torvalds, Igor Mammedov, Toshi Kani,
	Thomas Gleixner, linux-kernel

On Mon, Jun 09, 2014 at 04:02:15PM +0200, Ingo Molnar wrote:
> 
> * H. Peter Anvin <hpa@zytor.com> wrote:
> 
> > I really want to say *THANK YOU* to Sitsofe for a very timely and 
> > complete report.  You really saved us some highly valuable time.
> 
> Seconded!

You're welcome. For what it's worth these are the things that allowed
this drive-by testing to occur:

git bisect:
Without this there would have been a generic complaint that would have
only gone to the general kernel mailing list. I thought the issue was
going to lie with the GPU drivers but just following the bisection and
testing the final commit zoned in on the true cause.

Able to cut out most features:
I tend to build cut down kernels which only supports my hardware because
they take less time to compile and boot faster.

Single file builds:
I often do builds on Internet connected machines. Because I can 
can build everything in I only have one file to copy at the end.

Distro doesn't need initrd/initramfs:
One less thing to setup because everything is built in but some distros
seem dependent on having one these days.

I guess the one nice thing to have in the future would be a make
checkedconfig that automatically turned on all checking/verification
features (e.g. slab debugging/memory poisoning) for the current config.
This is would make it easier to get such kernels 

-- 
Sitsofe | http://sucs.org/~sits/

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [BISECTED][REGRESSION] Kernel does not boot on EeePC
  2014-06-09 10:26 ` Igor Mammedov
@ 2014-06-11  6:16   ` Sitsofe Wheeler
  2014-06-11 12:48     ` Igor Mammedov
  0 siblings, 1 reply; 10+ messages in thread
From: Sitsofe Wheeler @ 2014-06-11  6:16 UTC (permalink / raw)
  To: Igor Mammedov; +Cc: Toshi Kani, Thomas Gleixner, Ingo Molnar, linux-kernel

On Mon, Jun 09, 2014 at 12:26:16PM +0200, Igor Mammedov wrote:
> On Sun, 8 Jun 2014 10:02:23 +0100
> Sitsofe Wheeler <sitsofe@gmail.com> wrote:
> 
> > The latest kernel (c593e8978722f7f4a12932733cfeed6c0c74fbaa) refuses to
> > boot on my EeePC - after grub is finished the screen just remains black
> > and the only thing that does something is pressing the power button. The
> > problem commit seems to be:
> I'm sorry for introducing mess, I don't have EeePC to test this on.
> Could you help to track problem on it?
> Is there any output if you boot affected kernel with
> "earlyprintk=vga ignore_loglevel" options?

It basically gives up just before Initializing CPU#0. The following is
an approximation of the boot messages that I see on the screen (the
early ones are gone before I can eyeball them so this is a blend of a
previous dmesg and what I see on the screen during a failure):

[    0.000000] Linux version 3.15.0-0001-gd37cf73 (swheeler@eject) (gcc version 4.8.2 20131212 (Red Hat 4.8.2-7) (GCC) ) #52 Sun Jun 8 22:43:01 BST 2014
[    0.000000] KERNEL supported cpus:
[    0.000000]   Intel GenuineIntel
[    0.000000] Disabled fast string operations
[    0.000000] e820: BIOS-provided physical RAM map:
[    0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009fbff] usable
[    0.000000] BIOS-e820: [mem 0x000000000009fc00-0x000000000009ffff] reserved
[    0.000000] BIOS-e820: [mem 0x00000000000e4000-0x00000000000fffff] reserved
[    0.000000] BIOS-e820: [mem 0x0000000000100000-0x000000003f77ffff] usable
[    0.000000] BIOS-e820: [mem 0x000000003f780000-0x000000003f78ffff] ACPI data
[    0.000000] BIOS-e820: [mem 0x000000003f790000-0x000000003f7cffff] ACPI NVS
[    0.000000] BIOS-e820: [mem 0x000000003f7d0000-0x000000003f7ddfff] reserved
[    0.000000] BIOS-e820: [mem 0x000000003f7e0000-0x000000003f7fffff] reserved
[    0.000000] BIOS-e820: [mem 0x00000000fee00000-0x00000000fee00fff] reserved
[    0.000000] BIOS-e820: [mem 0x00000000fff80000-0x00000000ffffffff] reserved
[    0.000000] bootconsole [earlyvga0] enabled
[    0.000000] Notice: NX (Execute Disable) protection cannot be enabled: non-PAE kernel!
[    0.000000] SMBIOS 2.5 present.
[    0.000000] DMI: ASUSTeK Computer INC. 900/900, BIOS 1006    03/03/2009
[    0.000000] e820: update [mem 0x00000000-0x00000fff] usable ==> reserved
[    0.000000] e820: remove [mem 0x000a0000-0x000fffff] usable
[    0.000000] e820: last_pfn = 0x3f780 max_arch_pfn = 0x100000
[    0.000000] MTRR default type: uncachable
[    0.000000] MTRR fixed ranges enabled:
[    0.000000]   00000-9FFFF write-back
[    0.000000]   A0000-DFFFF uncachable
[    0.000000]   E0000-EFFFF write-through
[    0.000000]   F0000-FFFFF write-protect
[    0.000000] MTRR variable ranges enabled:
[    0.000000]   0 base 000000000 mask FC0000000 write-back
[    0.000000]   1 base 03F800000 mask FFF800000 uncachable
[    0.000000]   2 disabled
[    0.000000]   3 disabled
[    0.000000]   4 disabled
[    0.000000]   5 disabled
[    0.000000]   6 disabled
[    0.000000]   7 disabled
[    0.000000] initial memory mapped: [mem 0x00000000-0x00bfffff]
[    0.000000] Base memory trampoline at [b009b000] 9b000 size 16384
[    0.000000] init_memory_mapping: [mem 0x00000000-0x000fffff]
[    0.000000]  [mem 0x00000000-0x000fffff] page 4k
[    0.000000] init_memory_mapping: [mem 0x3f000000-0x3f3fffff]
[    0.000000]  [mem 0x3f000000-0x3f3fffff] page 2M
[    0.000000] init_memory_mapping: [mem 0x38000000-0x3effffff]
[    0.000000]  [mem 0x38000000-0x3effffff] page 2M
[    0.000000] init_memory_mapping: [mem 0x00100000-0x37ffffff]
[    0.000000]  [mem 0x00100000-0x003fffff] page 4k
[    0.000000]  [mem 0x00400000-0x37ffffff] page 2M
[    0.000000] init_memory_mapping: [mem 0x3f400000-0x3f77ffff]
[    0.000000]  [mem 0x3f400000-0x3f77ffff] page 4k
[    0.000000] BRK [0x00667000, 0x00667fff] PGTABLE
[    0.000000] ACPI: RSDP 0x000FBE90 000014 (v00 ACPIAM)
[    0.000000] ACPI: RSDT 0x3F780000 000034 (v01 A M I  OEMRSDT  03000903 MSFT 00000097)
[    0.000000] ACPI: FACP 0x3F780200 000081 (v01 A M I  OEMFACP  03000903 MSFT 00000097)
[    0.000000] ACPI: DSDT 0x3F780400 0060FD (v01 A0979  A0979036 00000036 INTL 20060113)
[    0.000000] ACPI: FACS 0x3F790000 000040
[    0.000000] ACPI: APIC 0x3F780390 000068 (v01 A M I  OEMAPIC  03000903 MSFT 00000097)
[    0.000000] ACPI: OEMB 0x3F790040 000046 (v01 A M I  AMI_OEM  03000903 MSFT 00000097)
[    0.000000] ACPI: MCFG 0x3F786500 00003C (v01 A M I  OEMMCFG  03000903 MSFT 00000097)
[    0.000000] ACPI: Local APIC address 0xfee00000
[    0.000000] 1015MB LOWMEM available.
[    0.000000]   mapped low ram: 0 - 3f780000
[    0.000000]   low ram: 0 - 3f780000
[    0.000000] Zone ranges:
[    0.000000]   Normal   [mem 0x00001000-0x3f77ffff]
[    0.000000] Movable zone start for each node
[    0.000000] Early memory node ranges
[    0.000000]   node   0: [mem 0x00001000-0x0009efff]
[    0.000000]   node   0: [mem 0x00100000-0x3f77ffff]
[    0.000000] On node 0 totalpages: 259870
[    0.000000] free_area_init_node: node 0, pgdat b05e48c4, node_mem_map eef90020
[    0.000000]   Normal zone: 2031 pages used for memmap
[    0.000000]   Normal zone: 0 pages reserved
[    0.000000]   Normal zone: 259870 pages, LIFO batch:31
[    0.000000] Using APIC driver default
[    0.000000] Reserving Intel graphics stolen memory at 0x3f800000-0x3fffffff
[    0.000000] ACPI: PM-Timer IO Port: 0x808
[    0.000000] ACPI: Local APIC address 0xfee00000
[    0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
[    0.000000] ACPI: IOAPIC (id[0x01] address[0xfec00000] gsi_base[0])
[    0.000000] IOAPIC[0]: apic_id 1, version 32, address 0xfec00000, GSI 0-23
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
[    0.000000] ACPI: IRQ0 used by override.
[    0.000000] ACPI: IRQ2 used by override.
[    0.000000] ACPI: IRQ9 used by override.
[    0.000000] Using ACPI (MADT) for SMP configuration information
[    0.000000] nr_irqs_gsi: 40
[    0.000000] e820: [mem 0x40000000-0xfedfffff] available for PCI devices
[    0.000000] pcpu-alloc: s0 r0 d32768 u32768 alloc=1*32768
[    0.000000] pcpu-alloc: [0] 0 
[    0.000000] Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 257839
[    0.000000] Kernel command line: BOOT_IMAGE=/home/eeeuser/bzImage root=/dev/sdb2 ro rootfstype=ext4 usb_storage.delay_use=0 boot_delay=50 earlyprintk=vga ignore_loglevel
[    0.000000] PID hash table entries: 4096 (order: 2, 16384 bytes)
[    0.000000] Dentry cache hash table entries: 131072 (order: 7, 524288 bytes)
[    0.000000] Inode-cache hash table entries: 65536 (order: 6, 262144 bytes)

-- 
Sitsofe | http://sucs.org/~sits/

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [BISECTED][REGRESSION] Kernel does not boot on EeePC
  2014-06-11  6:16   ` Sitsofe Wheeler
@ 2014-06-11 12:48     ` Igor Mammedov
  2014-06-11 13:05       ` Sitsofe Wheeler
  0 siblings, 1 reply; 10+ messages in thread
From: Igor Mammedov @ 2014-06-11 12:48 UTC (permalink / raw)
  To: Sitsofe Wheeler; +Cc: Toshi Kani, Thomas Gleixner, Ingo Molnar, linux-kernel

On Wed, 11 Jun 2014 07:16:24 +0100
Sitsofe Wheeler <sitsofe@gmail.com> wrote:

> On Mon, Jun 09, 2014 at 12:26:16PM +0200, Igor Mammedov wrote:
> > On Sun, 8 Jun 2014 10:02:23 +0100
> > Sitsofe Wheeler <sitsofe@gmail.com> wrote:
> > 
> > > The latest kernel (c593e8978722f7f4a12932733cfeed6c0c74fbaa) refuses to
> > > boot on my EeePC - after grub is finished the screen just remains black
> > > and the only thing that does something is pressing the power button. The
> > > problem commit seems to be:
> > I'm sorry for introducing mess, I don't have EeePC to test this on.
> > Could you help to track problem on it?
> > Is there any output if you boot affected kernel with
> > "earlyprintk=vga ignore_loglevel" options?
> 
> It basically gives up just before Initializing CPU#0. The following is
> an approximation of the boot messages that I see on the screen (the
> early ones are gone before I can eyeball them so this is a blend of a
> previous dmesg and what I see on the screen during a failure):
Thanks for answer,

I've overlooked nonsmp case when testing where no one sets
cpu_callout_mask for boot CPU and as result kernel hangs in cpu_init()

#0  wait_for_master_cpu () at arch/x86/kernel/cpu/common.c:1232
#1  cpu_init () at arch/x86/kernel/cpu/common.c:1254
#2  0xffffffff81b0a038 in trap_init () at arch/x86/kernel/traps.c:786
#3  0xffffffff81b06ee3 in start_kernel () at init/main.c:536
#4  0xffffffff81b065df in x86_64_start_reservations (real_mode_data=<value optimized out>) at arch/x86/kernel/head64.c:193
#5  0xffffffff81b06785 in x86_64_start_kernel (real_mode_data=0x13dc0 <Address 0x13dc0 out of bounds>) at arch/x86/kernel/head64.c:182

I also need double-check xen setup since it also uses cpu_init().

I'll resubmit fixed patch after testing is done.

> 
> [    0.000000] Linux version 3.15.0-0001-gd37cf73 (swheeler@eject) (gcc version 4.8.2 20131212 (Red Hat 4.8.2-7) (GCC) ) #52 Sun Jun 8 22:43:01 BST 2014
> [    0.000000] KERNEL supported cpus:
> [    0.000000]   Intel GenuineIntel
> [    0.000000] Disabled fast string operations
> [    0.000000] e820: BIOS-provided physical RAM map:
> [    0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009fbff] usable
> [    0.000000] BIOS-e820: [mem 0x000000000009fc00-0x000000000009ffff] reserved
> [    0.000000] BIOS-e820: [mem 0x00000000000e4000-0x00000000000fffff] reserved
> [    0.000000] BIOS-e820: [mem 0x0000000000100000-0x000000003f77ffff] usable
> [    0.000000] BIOS-e820: [mem 0x000000003f780000-0x000000003f78ffff] ACPI data
> [    0.000000] BIOS-e820: [mem 0x000000003f790000-0x000000003f7cffff] ACPI NVS
> [    0.000000] BIOS-e820: [mem 0x000000003f7d0000-0x000000003f7ddfff] reserved
> [    0.000000] BIOS-e820: [mem 0x000000003f7e0000-0x000000003f7fffff] reserved
> [    0.000000] BIOS-e820: [mem 0x00000000fee00000-0x00000000fee00fff] reserved
> [    0.000000] BIOS-e820: [mem 0x00000000fff80000-0x00000000ffffffff] reserved
> [    0.000000] bootconsole [earlyvga0] enabled
> [    0.000000] Notice: NX (Execute Disable) protection cannot be enabled: non-PAE kernel!
> [    0.000000] SMBIOS 2.5 present.
> [    0.000000] DMI: ASUSTeK Computer INC. 900/900, BIOS 1006    03/03/2009
> [    0.000000] e820: update [mem 0x00000000-0x00000fff] usable ==> reserved
> [    0.000000] e820: remove [mem 0x000a0000-0x000fffff] usable
> [    0.000000] e820: last_pfn = 0x3f780 max_arch_pfn = 0x100000
> [    0.000000] MTRR default type: uncachable
> [    0.000000] MTRR fixed ranges enabled:
> [    0.000000]   00000-9FFFF write-back
> [    0.000000]   A0000-DFFFF uncachable
> [    0.000000]   E0000-EFFFF write-through
> [    0.000000]   F0000-FFFFF write-protect
> [    0.000000] MTRR variable ranges enabled:
> [    0.000000]   0 base 000000000 mask FC0000000 write-back
> [    0.000000]   1 base 03F800000 mask FFF800000 uncachable
> [    0.000000]   2 disabled
> [    0.000000]   3 disabled
> [    0.000000]   4 disabled
> [    0.000000]   5 disabled
> [    0.000000]   6 disabled
> [    0.000000]   7 disabled
> [    0.000000] initial memory mapped: [mem 0x00000000-0x00bfffff]
> [    0.000000] Base memory trampoline at [b009b000] 9b000 size 16384
> [    0.000000] init_memory_mapping: [mem 0x00000000-0x000fffff]
> [    0.000000]  [mem 0x00000000-0x000fffff] page 4k
> [    0.000000] init_memory_mapping: [mem 0x3f000000-0x3f3fffff]
> [    0.000000]  [mem 0x3f000000-0x3f3fffff] page 2M
> [    0.000000] init_memory_mapping: [mem 0x38000000-0x3effffff]
> [    0.000000]  [mem 0x38000000-0x3effffff] page 2M
> [    0.000000] init_memory_mapping: [mem 0x00100000-0x37ffffff]
> [    0.000000]  [mem 0x00100000-0x003fffff] page 4k
> [    0.000000]  [mem 0x00400000-0x37ffffff] page 2M
> [    0.000000] init_memory_mapping: [mem 0x3f400000-0x3f77ffff]
> [    0.000000]  [mem 0x3f400000-0x3f77ffff] page 4k
> [    0.000000] BRK [0x00667000, 0x00667fff] PGTABLE
> [    0.000000] ACPI: RSDP 0x000FBE90 000014 (v00 ACPIAM)
> [    0.000000] ACPI: RSDT 0x3F780000 000034 (v01 A M I  OEMRSDT  03000903 MSFT 00000097)
> [    0.000000] ACPI: FACP 0x3F780200 000081 (v01 A M I  OEMFACP  03000903 MSFT 00000097)
> [    0.000000] ACPI: DSDT 0x3F780400 0060FD (v01 A0979  A0979036 00000036 INTL 20060113)
> [    0.000000] ACPI: FACS 0x3F790000 000040
> [    0.000000] ACPI: APIC 0x3F780390 000068 (v01 A M I  OEMAPIC  03000903 MSFT 00000097)
> [    0.000000] ACPI: OEMB 0x3F790040 000046 (v01 A M I  AMI_OEM  03000903 MSFT 00000097)
> [    0.000000] ACPI: MCFG 0x3F786500 00003C (v01 A M I  OEMMCFG  03000903 MSFT 00000097)
> [    0.000000] ACPI: Local APIC address 0xfee00000
> [    0.000000] 1015MB LOWMEM available.
> [    0.000000]   mapped low ram: 0 - 3f780000
> [    0.000000]   low ram: 0 - 3f780000
> [    0.000000] Zone ranges:
> [    0.000000]   Normal   [mem 0x00001000-0x3f77ffff]
> [    0.000000] Movable zone start for each node
> [    0.000000] Early memory node ranges
> [    0.000000]   node   0: [mem 0x00001000-0x0009efff]
> [    0.000000]   node   0: [mem 0x00100000-0x3f77ffff]
> [    0.000000] On node 0 totalpages: 259870
> [    0.000000] free_area_init_node: node 0, pgdat b05e48c4, node_mem_map eef90020
> [    0.000000]   Normal zone: 2031 pages used for memmap
> [    0.000000]   Normal zone: 0 pages reserved
> [    0.000000]   Normal zone: 259870 pages, LIFO batch:31
> [    0.000000] Using APIC driver default
> [    0.000000] Reserving Intel graphics stolen memory at 0x3f800000-0x3fffffff
> [    0.000000] ACPI: PM-Timer IO Port: 0x808
> [    0.000000] ACPI: Local APIC address 0xfee00000
> [    0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
> [    0.000000] ACPI: IOAPIC (id[0x01] address[0xfec00000] gsi_base[0])
> [    0.000000] IOAPIC[0]: apic_id 1, version 32, address 0xfec00000, GSI 0-23
> [    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
> [    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
> [    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
> [    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
> [    0.000000] ACPI: IRQ0 used by override.
> [    0.000000] ACPI: IRQ2 used by override.
> [    0.000000] ACPI: IRQ9 used by override.
> [    0.000000] Using ACPI (MADT) for SMP configuration information
> [    0.000000] nr_irqs_gsi: 40
> [    0.000000] e820: [mem 0x40000000-0xfedfffff] available for PCI devices
> [    0.000000] pcpu-alloc: s0 r0 d32768 u32768 alloc=1*32768
> [    0.000000] pcpu-alloc: [0] 0 
> [    0.000000] Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 257839
> [    0.000000] Kernel command line: BOOT_IMAGE=/home/eeeuser/bzImage root=/dev/sdb2 ro rootfstype=ext4 usb_storage.delay_use=0 boot_delay=50 earlyprintk=vga ignore_loglevel
> [    0.000000] PID hash table entries: 4096 (order: 2, 16384 bytes)
> [    0.000000] Dentry cache hash table entries: 131072 (order: 7, 524288 bytes)
> [    0.000000] Inode-cache hash table entries: 65536 (order: 6, 262144 bytes)
> 


^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [BISECTED][REGRESSION] Kernel does not boot on EeePC
  2014-06-11 12:48     ` Igor Mammedov
@ 2014-06-11 13:05       ` Sitsofe Wheeler
  2014-06-11 13:19         ` Igor Mammedov
  0 siblings, 1 reply; 10+ messages in thread
From: Sitsofe Wheeler @ 2014-06-11 13:05 UTC (permalink / raw)
  To: Igor Mammedov; +Cc: Toshi Kani, Thomas Gleixner, Ingo Molnar, linux-kernel

On Wed, Jun 11, 2014 at 02:48:46PM +0200, Igor Mammedov wrote:
> On Wed, 11 Jun 2014 07:16:24 +0100
> Sitsofe Wheeler <sitsofe@gmail.com> wrote:
> 
> > It basically gives up just before Initializing CPU#0. The following is
> > an approximation of the boot messages that I see on the screen (the
> > early ones are gone before I can eyeball them so this is a blend of a
> > previous dmesg and what I see on the screen during a failure):
> Thanks for answer,
> 
> I've overlooked nonsmp case when testing where no one sets
> cpu_callout_mask for boot CPU and as result kernel hangs in cpu_init()

What gave you the tip off?

Additionally, does that also mean Ingo's test stand doesn't run nonsmp
kernels?

-- 
Sitsofe | http://sucs.org/~sits/

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [BISECTED][REGRESSION] Kernel does not boot on EeePC
  2014-06-11 13:05       ` Sitsofe Wheeler
@ 2014-06-11 13:19         ` Igor Mammedov
  0 siblings, 0 replies; 10+ messages in thread
From: Igor Mammedov @ 2014-06-11 13:19 UTC (permalink / raw)
  To: Sitsofe Wheeler; +Cc: Toshi Kani, Thomas Gleixner, Ingo Molnar, linux-kernel

On Wed, 11 Jun 2014 14:05:58 +0100
Sitsofe Wheeler <sitsofe@gmail.com> wrote:

> On Wed, Jun 11, 2014 at 02:48:46PM +0200, Igor Mammedov wrote:
> > On Wed, 11 Jun 2014 07:16:24 +0100
> > Sitsofe Wheeler <sitsofe@gmail.com> wrote:
> > 
> > > It basically gives up just before Initializing CPU#0. The following is
> > > an approximation of the boot messages that I see on the screen (the
> > > early ones are gone before I can eyeball them so this is a blend of a
> > > previous dmesg and what I see on the screen during a failure):
> > Thanks for answer,
> > 
> > I've overlooked nonsmp case when testing where no one sets
> > cpu_callout_mask for boot CPU and as result kernel hangs in cpu_init()
> 
> What gave you the tip off?
Just that you were booting on 1 core CPU with custom config,
then the only place where it could hang would be cpu_init().


^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2014-06-11 13:19 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2014-06-08  9:02 [BISECTED][REGRESSION] Kernel does not boot on EeePC Sitsofe Wheeler
2014-06-08  9:45 ` Ingo Molnar
2014-06-08 17:50   ` H. Peter Anvin
2014-06-09 14:02     ` Ingo Molnar
2014-06-10 19:19       ` Sitsofe Wheeler
2014-06-09 10:26 ` Igor Mammedov
2014-06-11  6:16   ` Sitsofe Wheeler
2014-06-11 12:48     ` Igor Mammedov
2014-06-11 13:05       ` Sitsofe Wheeler
2014-06-11 13:19         ` Igor Mammedov

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.