linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [Bug report] Kernel 5.7 become unbootable
@ 2020-04-09 19:12 Mikhail Gavrilov
  2020-04-10  8:16 ` Like Xu
  0 siblings, 1 reply; 6+ messages in thread
From: Mikhail Gavrilov @ 2020-04-09 19:12 UTC (permalink / raw)
  To: Linux List Kernel Mailing, linux-x86_64

Hi folks.
I use Fedora Rawhide and today's update came with the new kernel
5.7.0-0.rc0.git6.1.
As it turned out this kernel won't boot. Instead, I see the black
screen and some times see messages:
exit_boot() failed!
efi_main() failed!

The last working kernel is kernel-core-5.7.0-0.rc0.git2.1

I looked at the spec file and found out that the last bootable kernel
was related to the commit cad18da0afb1
(https://src.fedoraproject.org/rpms/kernel/c/9949db6070c6ca501d7235760ca8aba120db7448?branch=master),
and the first non-bootable kernel related to the commit a10c9c710f9e
(https://src.fedoraproject.org/rpms/kernel/c/0561e0bf0e459dd4b27a77670de838128b87bd0e?branch=master)

Of course, knowing these two commits, I made bisecting with the vanilla kernel:

$ git bisect log
git bisect start
# good: [cad18da0afb1bc7b37d73a74067ab7ff5974897c] Merge tag
'please-pull-ia64_for_5.7' of
git://git.kernel.org/pub/scm/linux/kernel/git/aegl/linux
git bisect good cad18da0afb1bc7b37d73a74067ab7ff5974897c
# bad: [a10c9c710f9ecea87b9f4bbb837467893b4bef01] Merge tag 'for-v5.7'
of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply
git bisect bad a10c9c710f9ecea87b9f4bbb837467893b4bef01
# bad: [f365ab31efacb70bed1e821f7435626e0b2528a6] Merge tag
'drm-next-2020-04-01' of git://anongit.freedesktop.org/drm/drm
git bisect bad f365ab31efacb70bed1e821f7435626e0b2528a6
# good: [7f80ccfe996871ca69648efee74a60ae7ad0dcd9] net: ipv6:
rpl_iptunnel: Fix potential memory leak in rpl_do_srh_inline
git bisect good 7f80ccfe996871ca69648efee74a60ae7ad0dcd9
# good: [9001b17698d86f842e2b13e0cafe8021d43209e9] Merge tag
'drm-intel-next-2020-03-13' of
git://anongit.freedesktop.org/drm/drm-intel into drm-next
git bisect good 9001b17698d86f842e2b13e0cafe8021d43209e9
# bad: [dfabb077d62552797ca0ae7756cb30d3e195ead5] Merge tag 'mmc-v5.7'
of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/mmc
git bisect bad dfabb077d62552797ca0ae7756cb30d3e195ead5
# bad: [15c981d16d70e8a5be297fa4af07a64ab7e080ed] Merge tag
'for-5.7-tag' of
git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux
git bisect bad 15c981d16d70e8a5be297fa4af07a64ab7e080ed
# good: [dc9492c14c758639d7b2468d4ed3c77e785c1a35] btrfs: hold a ref
on the root on the dead roots list
git bisect good dc9492c14c758639d7b2468d4ed3c77e785c1a35
# good: [3cd86a58f7734bf9cef38f6f899608ebcaa3da13] Merge tag
'arm64-upstream' of
git://git.kernel.org/pub/scm/linux/kernel/git/arm64/linux
git bisect good 3cd86a58f7734bf9cef38f6f899608ebcaa3da13
# bad: [fdf5563a720004199324371c08071b8ea27bd994] Merge branch
'x86-cleanups-for-linus' of
git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip
git bisect bad fdf5563a720004199324371c08071b8ea27bd994
# skip: [cf122cfba5b1d9daf64009d143f51dfec4b1705a] kill uaccess_try()
git bisect skip cf122cfba5b1d9daf64009d143f51dfec4b1705a
# skip: [4dd2a1b92b91b5f2acf853ee1dc0df135054698f] x86: Replace
setup_irq() by request_irq()
git bisect skip 4dd2a1b92b91b5f2acf853ee1dc0df135054698f
# skip: [5c1f178094631e8b9acc67e4a9b6e03abfbc2529] x86:
__setup_frame(): consolidate uaccess areas
git bisect skip 5c1f178094631e8b9acc67e4a9b6e03abfbc2529
# skip: [3add42c29cebb1d5f83c6205c59466a06ccf8da1] x86: get rid of
get_user_ex() in restore_sigcontext()
git bisect skip 3add42c29cebb1d5f83c6205c59466a06ccf8da1
# skip: [ead8e4e7e2c75ced6fcd9a53d3e9a2ecd7368553] x86:
__setup_rt_frame(): consolidate uaccess areas
git bisect skip ead8e4e7e2c75ced6fcd9a53d3e9a2ecd7368553
# skip: [31a9122058bc5f042cb04bcdb8cd9e6c77fdae8d] x86/mm: Drop
pud_mknotpresent()
git bisect skip 31a9122058bc5f042cb04bcdb8cd9e6c77fdae8d
# skip: [791612e9668cecbf5dd24d13400ac74e099f005c] x86:
x32_setup_rt_frame(): consolidate uaccess areas
git bisect skip 791612e9668cecbf5dd24d13400ac74e099f005c
# skip: [77f3c6166ddc7567455b244074b3ebb63862b56f] x86: kill
get_user_{try,catch,ex}
git bisect skip 77f3c6166ddc7567455b244074b3ebb63862b56f
# skip: [b00d8f8f0b2b39223c3fd6713d318aba95420264] x86:
setup_sigcontext(): list user_access_{begin,end}() into callers
git bisect skip b00d8f8f0b2b39223c3fd6713d318aba95420264
# skip: [0e79ad863df43b01090ae18c97de5c3787f069c6] x86/cpu: Fix a
-Wmissing-prototypes warning for init_ia32_feat_ctl()
git bisect skip 0e79ad863df43b01090ae18c97de5c3787f069c6
# skip: [b87df6594486626a9ae5944807307f2604cea3e2] x86:
unsafe_put-style macro for sigmask
git bisect skip b87df6594486626a9ae5944807307f2604cea3e2
# skip: [a37d01ead405e3aa14d72d284721fe46422b3b63] x86: switch
save_v86_state() to unsafe_put_user()
git bisect skip a37d01ead405e3aa14d72d284721fe46422b3b63
# skip: [119cd59fcfbe70fb3fcab4e64cd232bcc3807585] x86: get rid of
put_user_try in __setup_rt_frame() (both 32bit and 64bit)
git bisect skip 119cd59fcfbe70fb3fcab4e64cd232bcc3807585
# skip: [119cd59fcfbe70fb3fcab4e64cd232bcc3807585] x86: get rid of
put_user_try in __setup_rt_frame() (both 32bit and 64bit)
git bisect skip 119cd59fcfbe70fb3fcab4e64cd232bcc3807585
# bad: [9589351ccf47a85a75180a430627c16bc28da929] Merge branch
'x86-boot-for-linus' of
git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip
git bisect bad 9589351ccf47a85a75180a430627c16bc28da929
# skip: [003602ad5516e59940de42e44c8d8033387bb363] x86/*/Makefile: Use
-fno-asynchronous-unwind-tables to suppress .eh_frame sections
git bisect skip 003602ad5516e59940de42e44c8d8033387bb363
# skip: [0eea39a234dc52063d14541fabcb2c64516a2328]
x86/boot/compressed: Remove .eh_frame section from bzImage
git bisect skip 0eea39a234dc52063d14541fabcb2c64516a2328
# skip: [bac59d18c7018a2fd5e800a1e72a8271bf404977] x86/setup: Fix
static memory detection
git bisect skip bac59d18c7018a2fd5e800a1e72a8271bf404977
# skip: [3ee372ccce4d4e7c610748d0583979d3ed3a0cf4]
x86/boot/compressed/64: Remove .bss/.pgtable from bzImage
git bisect skip 3ee372ccce4d4e7c610748d0583979d3ed3a0cf4
# skip: [a86255fe5258714e1f7c1bdfe95f08e4d098d450]
x86/boot/compressed/64: Use 32-bit (zero-extended) MOV for
z_output_len
git bisect skip a86255fe5258714e1f7c1bdfe95f08e4d098d450
# skip: [6f8f0dc980028e98ae339876a8403edae4d20e39] x86/vmlinux: Drop
unneeded linker script discard of .eh_frame
git bisect skip 6f8f0dc980028e98ae339876a8403edae4d20e39
# skip: [48bfdb9deffdc6b683feb25e15f4f26aac503501]
x86/boot/compressed/64: Use LEA to initialize boot stack pointer
git bisect skip 48bfdb9deffdc6b683feb25e15f4f26aac503501
# skip: [c90beea22a2bece4b0bbb39789bf835504421594]
x86/boot/compressed: Fix debug_puthex() parameter type
git bisect skip c90beea22a2bece4b0bbb39789bf835504421594
# only skipped commits left to test
# possible first bad commit:
[9589351ccf47a85a75180a430627c16bc28da929] Merge branch
'x86-boot-for-linus' of
git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip
# possible first bad commit:
[c90beea22a2bece4b0bbb39789bf835504421594] x86/boot/compressed: Fix
debug_puthex() parameter type
# possible first bad commit:
[bac59d18c7018a2fd5e800a1e72a8271bf404977] x86/setup: Fix static
memory detection
# possible first bad commit:
[6f8f0dc980028e98ae339876a8403edae4d20e39] x86/vmlinux: Drop unneeded
linker script discard of .eh_frame
# possible first bad commit:
[003602ad5516e59940de42e44c8d8033387bb363] x86/*/Makefile: Use
-fno-asynchronous-unwind-tables to suppress .eh_frame sections
# possible first bad commit:
[0eea39a234dc52063d14541fabcb2c64516a2328] x86/boot/compressed: Remove
.eh_frame section from bzImage
# possible first bad commit:
[3ee372ccce4d4e7c610748d0583979d3ed3a0cf4] x86/boot/compressed/64:
Remove .bss/.pgtable from bzImage
# possible first bad commit:
[a86255fe5258714e1f7c1bdfe95f08e4d098d450] x86/boot/compressed/64: Use
32-bit (zero-extended) MOV for z_output_len
# possible first bad commit:
[48bfdb9deffdc6b683feb25e15f4f26aac503501] x86/boot/compressed/64: Use
LEA to initialize boot stack pointer

Unfortunately, kernel bisecting not telling exactly which commit is blame.
In my bisecting a lot of skipping commits because of the unsuccessful builds:
ld: arch/x86/boot/compressed/pgtable_64.o:(.bss+0x0): multiple
definition of `__force_order';
arch/x86/boot/compressed/kaslr_64.o:(.bss+0x0): first defined here
make[2]: *** [arch/x86/boot/compressed/Makefile:120:
arch/x86/boot/compressed/vmlinux] Error 1
make[1]: *** [arch/x86/boot/Makefile:114:
arch/x86/boot/compressed/vmlinux] Error 2
make: *** [arch/x86/Makefile:283: bzImage] Error 2

Can anyone here help me with this issue?

Thanks.

Downstream bugreport:
https://bugzilla.redhat.com/show_bug.cgi?id=1822731

--
Best Regards,
Mike Gavrilov.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Bug report] Kernel 5.7 become unbootable
  2020-04-09 19:12 [Bug report] Kernel 5.7 become unbootable Mikhail Gavrilov
@ 2020-04-10  8:16 ` Like Xu
  2020-04-10 14:29   ` Arvind Sankar
  0 siblings, 1 reply; 6+ messages in thread
From: Like Xu @ 2020-04-10  8:16 UTC (permalink / raw)
  To: Mikhail Gavrilov, Linux List Kernel Mailing, linux-x86_64, nivedita

On 2020/4/10 3:12, Mikhail Gavrilov wrote:
> Hi folks.
> I use Fedora Rawhide and today's update came with the new kernel
> 5.7.0-0.rc0.git6.1.
> As it turned out this kernel won't boot. Instead, I see the black
> screen and some times see messages:
> exit_boot() failed!
> efi_main() failed!
> 
> The last working kernel is kernel-core-5.7.0-0.rc0.git2.1
> 
> I looked at the spec file and found out that the last bootable kernel
> was related to the commit cad18da0afb1
> (https://src.fedoraproject.org/rpms/kernel/c/9949db6070c6ca501d7235760ca8aba120db7448?branch=master),
> and the first non-bootable kernel related to the commit a10c9c710f9e
> (https://src.fedoraproject.org/rpms/kernel/c/0561e0bf0e459dd4b27a77670de838128b87bd0e?branch=master)
> 
> Of course, knowing these two commits, I made bisecting with the vanilla kernel:
> 
> $ git bisect log
> git bisect start
> # good: [cad18da0afb1bc7b37d73a74067ab7ff5974897c] Merge tag
> 'please-pull-ia64_for_5.7' of
> git://git.kernel.org/pub/scm/linux/kernel/git/aegl/linux
> git bisect good cad18da0afb1bc7b37d73a74067ab7ff5974897c
> # bad: [a10c9c710f9ecea87b9f4bbb837467893b4bef01] Merge tag 'for-v5.7'
> of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply
> git bisect bad a10c9c710f9ecea87b9f4bbb837467893b4bef01
> # bad: [f365ab31efacb70bed1e821f7435626e0b2528a6] Merge tag
> 'drm-next-2020-04-01' of git://anongit.freedesktop.org/drm/drm
> git bisect bad f365ab31efacb70bed1e821f7435626e0b2528a6
> # good: [7f80ccfe996871ca69648efee74a60ae7ad0dcd9] net: ipv6:
> rpl_iptunnel: Fix potential memory leak in rpl_do_srh_inline
> git bisect good 7f80ccfe996871ca69648efee74a60ae7ad0dcd9
> # good: [9001b17698d86f842e2b13e0cafe8021d43209e9] Merge tag
> 'drm-intel-next-2020-03-13' of
> git://anongit.freedesktop.org/drm/drm-intel into drm-next
> git bisect good 9001b17698d86f842e2b13e0cafe8021d43209e9
> # bad: [dfabb077d62552797ca0ae7756cb30d3e195ead5] Merge tag 'mmc-v5.7'
> of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/mmc
> git bisect bad dfabb077d62552797ca0ae7756cb30d3e195ead5
> # bad: [15c981d16d70e8a5be297fa4af07a64ab7e080ed] Merge tag
> 'for-5.7-tag' of
> git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux
> git bisect bad 15c981d16d70e8a5be297fa4af07a64ab7e080ed
> # good: [dc9492c14c758639d7b2468d4ed3c77e785c1a35] btrfs: hold a ref
> on the root on the dead roots list
> git bisect good dc9492c14c758639d7b2468d4ed3c77e785c1a35
> # good: [3cd86a58f7734bf9cef38f6f899608ebcaa3da13] Merge tag
> 'arm64-upstream' of
> git://git.kernel.org/pub/scm/linux/kernel/git/arm64/linux
> git bisect good 3cd86a58f7734bf9cef38f6f899608ebcaa3da13
> # bad: [fdf5563a720004199324371c08071b8ea27bd994] Merge branch
> 'x86-cleanups-for-linus' of
> git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip
> git bisect bad fdf5563a720004199324371c08071b8ea27bd994
> # skip: [cf122cfba5b1d9daf64009d143f51dfec4b1705a] kill uaccess_try()
> git bisect skip cf122cfba5b1d9daf64009d143f51dfec4b1705a
> # skip: [4dd2a1b92b91b5f2acf853ee1dc0df135054698f] x86: Replace
> setup_irq() by request_irq()
> git bisect skip 4dd2a1b92b91b5f2acf853ee1dc0df135054698f
> # skip: [5c1f178094631e8b9acc67e4a9b6e03abfbc2529] x86:
> __setup_frame(): consolidate uaccess areas
> git bisect skip 5c1f178094631e8b9acc67e4a9b6e03abfbc2529
> # skip: [3add42c29cebb1d5f83c6205c59466a06ccf8da1] x86: get rid of
> get_user_ex() in restore_sigcontext()
> git bisect skip 3add42c29cebb1d5f83c6205c59466a06ccf8da1
> # skip: [ead8e4e7e2c75ced6fcd9a53d3e9a2ecd7368553] x86:
> __setup_rt_frame(): consolidate uaccess areas
> git bisect skip ead8e4e7e2c75ced6fcd9a53d3e9a2ecd7368553
> # skip: [31a9122058bc5f042cb04bcdb8cd9e6c77fdae8d] x86/mm: Drop
> pud_mknotpresent()
> git bisect skip 31a9122058bc5f042cb04bcdb8cd9e6c77fdae8d
> # skip: [791612e9668cecbf5dd24d13400ac74e099f005c] x86:
> x32_setup_rt_frame(): consolidate uaccess areas
> git bisect skip 791612e9668cecbf5dd24d13400ac74e099f005c
> # skip: [77f3c6166ddc7567455b244074b3ebb63862b56f] x86: kill
> get_user_{try,catch,ex}
> git bisect skip 77f3c6166ddc7567455b244074b3ebb63862b56f
> # skip: [b00d8f8f0b2b39223c3fd6713d318aba95420264] x86:
> setup_sigcontext(): list user_access_{begin,end}() into callers
> git bisect skip b00d8f8f0b2b39223c3fd6713d318aba95420264
> # skip: [0e79ad863df43b01090ae18c97de5c3787f069c6] x86/cpu: Fix a
> -Wmissing-prototypes warning for init_ia32_feat_ctl()
> git bisect skip 0e79ad863df43b01090ae18c97de5c3787f069c6
> # skip: [b87df6594486626a9ae5944807307f2604cea3e2] x86:
> unsafe_put-style macro for sigmask
> git bisect skip b87df6594486626a9ae5944807307f2604cea3e2
> # skip: [a37d01ead405e3aa14d72d284721fe46422b3b63] x86: switch
> save_v86_state() to unsafe_put_user()
> git bisect skip a37d01ead405e3aa14d72d284721fe46422b3b63
> # skip: [119cd59fcfbe70fb3fcab4e64cd232bcc3807585] x86: get rid of
> put_user_try in __setup_rt_frame() (both 32bit and 64bit)
> git bisect skip 119cd59fcfbe70fb3fcab4e64cd232bcc3807585
> # skip: [119cd59fcfbe70fb3fcab4e64cd232bcc3807585] x86: get rid of
> put_user_try in __setup_rt_frame() (both 32bit and 64bit)
> git bisect skip 119cd59fcfbe70fb3fcab4e64cd232bcc3807585
> # bad: [9589351ccf47a85a75180a430627c16bc28da929] Merge branch
> 'x86-boot-for-linus' of
> git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip
> git bisect bad 9589351ccf47a85a75180a430627c16bc28da929
> # skip: [003602ad5516e59940de42e44c8d8033387bb363] x86/*/Makefile: Use
> -fno-asynchronous-unwind-tables to suppress .eh_frame sections
> git bisect skip 003602ad5516e59940de42e44c8d8033387bb363
> # skip: [0eea39a234dc52063d14541fabcb2c64516a2328]
> x86/boot/compressed: Remove .eh_frame section from bzImage
> git bisect skip 0eea39a234dc52063d14541fabcb2c64516a2328
> # skip: [bac59d18c7018a2fd5e800a1e72a8271bf404977] x86/setup: Fix
> static memory detection
> git bisect skip bac59d18c7018a2fd5e800a1e72a8271bf404977
> # skip: [3ee372ccce4d4e7c610748d0583979d3ed3a0cf4]
> x86/boot/compressed/64: Remove .bss/.pgtable from bzImage
> git bisect skip 3ee372ccce4d4e7c610748d0583979d3ed3a0cf4
> # skip: [a86255fe5258714e1f7c1bdfe95f08e4d098d450]
> x86/boot/compressed/64: Use 32-bit (zero-extended) MOV for
> z_output_len
> git bisect skip a86255fe5258714e1f7c1bdfe95f08e4d098d450
> # skip: [6f8f0dc980028e98ae339876a8403edae4d20e39] x86/vmlinux: Drop
> unneeded linker script discard of .eh_frame
> git bisect skip 6f8f0dc980028e98ae339876a8403edae4d20e39
> # skip: [48bfdb9deffdc6b683feb25e15f4f26aac503501]
> x86/boot/compressed/64: Use LEA to initialize boot stack pointer
> git bisect skip 48bfdb9deffdc6b683feb25e15f4f26aac503501
> # skip: [c90beea22a2bece4b0bbb39789bf835504421594]
> x86/boot/compressed: Fix debug_puthex() parameter type
> git bisect skip c90beea22a2bece4b0bbb39789bf835504421594
> # only skipped commits left to test
> # possible first bad commit:
> [9589351ccf47a85a75180a430627c16bc28da929] Merge branch
> 'x86-boot-for-linus' of
> git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip
> # possible first bad commit:
> [c90beea22a2bece4b0bbb39789bf835504421594] x86/boot/compressed: Fix
> debug_puthex() parameter type
> # possible first bad commit:
> [bac59d18c7018a2fd5e800a1e72a8271bf404977] x86/setup: Fix static
> memory detection
> # possible first bad commit:
> [6f8f0dc980028e98ae339876a8403edae4d20e39] x86/vmlinux: Drop unneeded
> linker script discard of .eh_frame
> # possible first bad commit:
> [003602ad5516e59940de42e44c8d8033387bb363] x86/*/Makefile: Use
> -fno-asynchronous-unwind-tables to suppress .eh_frame sections
> # possible first bad commit:
> [0eea39a234dc52063d14541fabcb2c64516a2328] x86/boot/compressed: Remove
> .eh_frame section from bzImage
> # possible first bad commit:
> [3ee372ccce4d4e7c610748d0583979d3ed3a0cf4] x86/boot/compressed/64:
> Remove .bss/.pgtable from bzImage
> # possible first bad commit:
> [a86255fe5258714e1f7c1bdfe95f08e4d098d450] x86/boot/compressed/64: Use
> 32-bit (zero-extended) MOV for z_output_len
> # possible first bad commit:
> [48bfdb9deffdc6b683feb25e15f4f26aac503501] x86/boot/compressed/64: Use
> LEA to initialize boot stack pointer
> 
> Unfortunately, kernel bisecting not telling exactly which commit is blame.
> In my bisecting a lot of skipping commits because of the unsuccessful builds:
> ld: arch/x86/boot/compressed/pgtable_64.o:(.bss+0x0): multiple
> definition of `__force_order';
> arch/x86/boot/compressed/kaslr_64.o:(.bss+0x0): first defined here
> make[2]: *** [arch/x86/boot/compressed/Makefile:120:
> arch/x86/boot/compressed/vmlinux] Error 1
> make[1]: *** [arch/x86/boot/Makefile:114:
> arch/x86/boot/compressed/vmlinux] Error 2
> make: *** [arch/x86/Makefile:283: bzImage] Error 2
> 
> Can anyone here help me with this issue?

I encountered the same issue as you:

exit_boot() failed!
efi_main() failed!

and the following patches are very suspicious:

x86/boot/compressed/64: Use LEA to initialize boot stack pointer
x86/boot/compressed/64: Use 32-bit (zero-extended) MOV for z_output_len
x86/boot/compressed/64: Remove .bss/.pgtable from bzImage
x86/boot/compressed: Remove .eh_frame section from bzImage

I am not sure if reverting these patches can help you.

Thanks,
Like Xu

> 
> Thanks.
> 
> Downstream bugreport:
> https://bugzilla.redhat.com/show_bug.cgi?id=1822731
> 
> --
> Best Regards,
> Mike Gavrilov.
> 


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Bug report] Kernel 5.7 become unbootable
  2020-04-10  8:16 ` Like Xu
@ 2020-04-10 14:29   ` Arvind Sankar
  2020-04-10 19:23     ` Mikhail Gavrilov
       [not found]     ` <20200411020051.20448-1-hdanton@sina.com>
  0 siblings, 2 replies; 6+ messages in thread
From: Arvind Sankar @ 2020-04-10 14:29 UTC (permalink / raw)
  To: Like Xu
  Cc: Mikhail Gavrilov, Linux List Kernel Mailing, linux-x86_64, nivedita

On Fri, Apr 10, 2020 at 04:16:58PM +0800, Like Xu wrote:
> On 2020/4/10 3:12, Mikhail Gavrilov wrote:
> > Hi folks.
> > I use Fedora Rawhide and today's update came with the new kernel
> > 5.7.0-0.rc0.git6.1.
> > As it turned out this kernel won't boot. Instead, I see the black
> > screen and some times see messages:
> > exit_boot() failed!
> > efi_main() failed!
> > 
> > The last working kernel is kernel-core-5.7.0-0.rc0.git2.1
> > 
> > I looked at the spec file and found out that the last bootable kernel
> > was related to the commit cad18da0afb1
> > (https://src.fedoraproject.org/rpms/kernel/c/9949db6070c6ca501d7235760ca8aba120db7448?branch=master),
> > and the first non-bootable kernel related to the commit a10c9c710f9e
> > (https://src.fedoraproject.org/rpms/kernel/c/0561e0bf0e459dd4b27a77670de838128b87bd0e?branch=master)
> > 
> > Of course, knowing these two commits, I made bisecting with the vanilla kernel:
> > 
> > 
> > Can anyone here help me with this issue?
> 
> I encountered the same issue as you:
> 
> exit_boot() failed!
> efi_main() failed!
> 
> and the following patches are very suspicious:
> 
> x86/boot/compressed/64: Use LEA to initialize boot stack pointer
> x86/boot/compressed/64: Use 32-bit (zero-extended) MOV for z_output_len
> x86/boot/compressed/64: Remove .bss/.pgtable from bzImage
> x86/boot/compressed: Remove .eh_frame section from bzImage
> 
> I am not sure if reverting these patches can help you.
> 
> Thanks,
> Like Xu
> 

Please try with the patches at
https://lore.kernel.org/linux-efi/20200409130434.6736-1-ardb@kernel.org/

In particular, patches 3 and 4. The first one is a fix for upstream
commit 3ee372ccce4d ("x86/boot/compressed/64: Remove .bss/.pgtable from bzImage")

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Bug report] Kernel 5.7 become unbootable
  2020-04-10 14:29   ` Arvind Sankar
@ 2020-04-10 19:23     ` Mikhail Gavrilov
  2020-04-11  1:09       ` Arvind Sankar
       [not found]     ` <20200411020051.20448-1-hdanton@sina.com>
  1 sibling, 1 reply; 6+ messages in thread
From: Mikhail Gavrilov @ 2020-04-10 19:23 UTC (permalink / raw)
  To: Arvind Sankar; +Cc: Like Xu, Linux List Kernel Mailing, linux-x86_64

On Fri, 10 Apr 2020 at 19:29, Arvind Sankar <nivedita@alum.mit.edu> wrote:
>
> Please try with the patches at
> https://lore.kernel.org/linux-efi/20200409130434.6736-1-ardb@kernel.org/
>
> In particular, patches 3 and 4. The first one is a fix for upstream
> commit 3ee372ccce4d ("x86/boot/compressed/64: Remove .bss/.pgtable from bzImage")

Thanks, I can confirm that these patches solve the issue with unbootable kernel.
But, now after successful booting, I see another issue.
Kernel logs flooded with the error messages:
[   18.253336] BUG: sleeping function called from invalid context at
drivers/gpu/drm/drm_mm.c:371
[   18.253339] in_atomic(): 1, irqs_disabled(): 0, non_block: 0, pid:
1723, name: gnome-shell
[   18.253340] INFO: lockdep is turned off.
[   18.253342] CPU: 23 PID: 1723 Comm: gnome-shell Tainted: G      D W
        5.7.0-0.rc0.git6.1.2.fc33.x86_64 #1
[   18.253343] Hardware name: System manufacturer System Product
Name/ROG STRIX X570-I GAMING, BIOS 1405 11/19/2019
[   18.253344] Call Trace:
[   18.253350]  dump_stack+0x8b/0xc8
[   18.253352]  ___might_sleep.cold+0xb6/0xc6
[   18.253362]  drm_mm_insert_node_in_range+0x31b/0x490 [drm]
[   18.253365]  ? trace_kmalloc_node+0xbb/0x110
[   18.253408]  amdgpu_vram_mgr_new+0x1e8/0x3c0 [amdgpu]
[   18.253413]  ttm_bo_mem_space+0xd5/0x300 [ttm]
[   18.253417]  ttm_bo_validate+0x122/0x1c0 [ttm]
[   18.253420]  ? mutex_trylock+0x116/0x130
[   18.253423]  ttm_bo_init_reserved+0x30f/0x3e0 [ttm]
[   18.253456]  amdgpu_bo_do_create+0x1a3/0x630 [amdgpu]
[   18.253489]  ? amdgpu_bo_subtract_pin_size+0x60/0x60 [amdgpu]
[   18.253521]  amdgpu_bo_create+0x30/0x2e0 [amdgpu]
[   18.253524]  ? lock_acquire+0xa4/0x2e0
[   18.253557]  amdgpu_gem_object_create+0x7b/0xf0 [amdgpu]
[   18.253590]  ? amdgpu_gem_object_close+0x180/0x180 [amdgpu]
[   18.253622]  amdgpu_gem_create_ioctl+0x1ba/0x280 [amdgpu]
[   18.253655]  ? amdgpu_gem_object_close+0x180/0x180 [amdgpu]
[   18.253663]  drm_ioctl_kernel+0x86/0xd0 [drm]
[   18.253671]  drm_ioctl+0x206/0x390 [drm]
[   18.253703]  ? amdgpu_gem_object_close+0x180/0x180 [amdgpu]
[   18.253736]  amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
[   18.253739]  ksys_ioctl+0x82/0xc0
[   18.253741]  __x64_sys_ioctl+0x16/0x20
[   18.253744]  do_syscall_64+0x5c/0xa0
[   18.253746]  entry_SYSCALL_64_after_hwframe+0x49/0xb3
[   18.253748] RIP: 0033:0x7f24b8c2142b
[   18.253750] Code: 0f 1e fa 48 8b 05 5d ba 0c 00 64 c7 00 26 00 00
00 48 c7 c0 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa b8 10 00 00
00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 2d ba 0c 00 f7 d8 64 89
01 48
[   18.253752] RSP: 002b:00007fff0960d0c8 EFLAGS: 00000246 ORIG_RAX:
0000000000000010
[   18.253753] RAX: ffffffffffffffda RBX: 00007fff0960d120 RCX: 00007f24b8c2142b
[   18.253754] RDX: 00007fff0960d120 RSI: 00000000c0206440 RDI: 000000000000000f
[   18.253755] RBP: 00000000c0206440 R08: 0000555e7752ca40 R09: 00007f24b8ceda40
[   18.253756] R10: 0000000000000008 R11: 0000000000000246 R12: 0000555e7752ca40
[   18.253757] R13: 000000000000000f R14: 0000000000200000 R15: 0000555e7720ffd0

I think be better if I fill another bugreport in drm mailing list or
will be enough reply to this mailing list from here?

--
Best Regards,
Mike Gavrilov.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Bug report] Kernel 5.7 become unbootable
  2020-04-10 19:23     ` Mikhail Gavrilov
@ 2020-04-11  1:09       ` Arvind Sankar
  0 siblings, 0 replies; 6+ messages in thread
From: Arvind Sankar @ 2020-04-11  1:09 UTC (permalink / raw)
  To: Mikhail Gavrilov
  Cc: Arvind Sankar, Like Xu, Linux List Kernel Mailing, linux-x86_64

On Sat, Apr 11, 2020 at 12:23:52AM +0500, Mikhail Gavrilov wrote:
> On Fri, 10 Apr 2020 at 19:29, Arvind Sankar <nivedita@alum.mit.edu> wrote:
> >
> > Please try with the patches at
> > https://lore.kernel.org/linux-efi/20200409130434.6736-1-ardb@kernel.org/
> >
> > In particular, patches 3 and 4. The first one is a fix for upstream
> > commit 3ee372ccce4d ("x86/boot/compressed/64: Remove .bss/.pgtable from bzImage")
> 
> Thanks, I can confirm that these patches solve the issue with unbootable kernel.
> But, now after successful booting, I see another issue.
> Kernel logs flooded with the error messages:
> [   18.253336] BUG: sleeping function called from invalid context at
> drivers/gpu/drm/drm_mm.c:371
> [   18.253339] in_atomic(): 1, irqs_disabled(): 0, non_block: 0, pid:
> 1723, name: gnome-shell
> [   18.253340] INFO: lockdep is turned off.
> [   18.253342] CPU: 23 PID: 1723 Comm: gnome-shell Tainted: G      D W
>         5.7.0-0.rc0.git6.1.2.fc33.x86_64 #1
> [   18.253343] Hardware name: System manufacturer System Product
> Name/ROG STRIX X570-I GAMING, BIOS 1405 11/19/2019
> [   18.253344] Call Trace:
> [   18.253350]  dump_stack+0x8b/0xc8
> [   18.253352]  ___might_sleep.cold+0xb6/0xc6
> [   18.253362]  drm_mm_insert_node_in_range+0x31b/0x490 [drm]
> [   18.253365]  ? trace_kmalloc_node+0xbb/0x110
> [   18.253408]  amdgpu_vram_mgr_new+0x1e8/0x3c0 [amdgpu]
> [   18.253413]  ttm_bo_mem_space+0xd5/0x300 [ttm]
> [   18.253417]  ttm_bo_validate+0x122/0x1c0 [ttm]
> [   18.253420]  ? mutex_trylock+0x116/0x130
> [   18.253423]  ttm_bo_init_reserved+0x30f/0x3e0 [ttm]
> [   18.253456]  amdgpu_bo_do_create+0x1a3/0x630 [amdgpu]
> [   18.253489]  ? amdgpu_bo_subtract_pin_size+0x60/0x60 [amdgpu]
> [   18.253521]  amdgpu_bo_create+0x30/0x2e0 [amdgpu]
> [   18.253524]  ? lock_acquire+0xa4/0x2e0
> [   18.253557]  amdgpu_gem_object_create+0x7b/0xf0 [amdgpu]
> [   18.253590]  ? amdgpu_gem_object_close+0x180/0x180 [amdgpu]
> [   18.253622]  amdgpu_gem_create_ioctl+0x1ba/0x280 [amdgpu]
> [   18.253655]  ? amdgpu_gem_object_close+0x180/0x180 [amdgpu]
> [   18.253663]  drm_ioctl_kernel+0x86/0xd0 [drm]
> [   18.253671]  drm_ioctl+0x206/0x390 [drm]
> [   18.253703]  ? amdgpu_gem_object_close+0x180/0x180 [amdgpu]
> [   18.253736]  amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
> [   18.253739]  ksys_ioctl+0x82/0xc0
> [   18.253741]  __x64_sys_ioctl+0x16/0x20
> [   18.253744]  do_syscall_64+0x5c/0xa0
> [   18.253746]  entry_SYSCALL_64_after_hwframe+0x49/0xb3
> [   18.253748] RIP: 0033:0x7f24b8c2142b
> [   18.253750] Code: 0f 1e fa 48 8b 05 5d ba 0c 00 64 c7 00 26 00 00
> 00 48 c7 c0 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa b8 10 00 00
> 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 2d ba 0c 00 f7 d8 64 89
> 01 48
> [   18.253752] RSP: 002b:00007fff0960d0c8 EFLAGS: 00000246 ORIG_RAX:
> 0000000000000010
> [   18.253753] RAX: ffffffffffffffda RBX: 00007fff0960d120 RCX: 00007f24b8c2142b
> [   18.253754] RDX: 00007fff0960d120 RSI: 00000000c0206440 RDI: 000000000000000f
> [   18.253755] RBP: 00000000c0206440 R08: 0000555e7752ca40 R09: 00007f24b8ceda40
> [   18.253756] R10: 0000000000000008 R11: 0000000000000246 R12: 0000555e7752ca40
> [   18.253757] R13: 000000000000000f R14: 0000000000200000 R15: 0000555e7720ffd0
> 
> I think be better if I fill another bugreport in drm mailing list or
> will be enough reply to this mailing list from here?
> 
> --
> Best Regards,
> Mike Gavrilov.

Glad that the booting problem got fixed.

Probably cc both the drm and amdgpu maintainers and change the subject
to be more specific.

Thanks.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Bug report] Kernel 5.7 become unbootable
       [not found]     ` <20200411020051.20448-1-hdanton@sina.com>
@ 2020-04-12 20:00       ` Mikhail Gavrilov
  0 siblings, 0 replies; 6+ messages in thread
From: Mikhail Gavrilov @ 2020-04-12 20:00 UTC (permalink / raw)
  To: Hillf Danton
  Cc: Arvind Sankar, Like Xu, Linux List Kernel Mailing, linux-x86_64

On Sat, 11 Apr 2020 at 07:01, Hillf Danton <hdanton@sina.com> wrote:
>
> After taking a look at d8a26d8fc37c5b
> (drm/mm: revert "Break long searches in fragmented address spaces")
>

I applied the mentioned patch from the patchwork and the system has
been working stably already for 24 hours without any BUG messages in
kernel logs, thanks.

https://patchwork.freedesktop.org/patch/359278/

All problems solved.

--
Best Regards,
Mike Gavrilov.

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2020-04-12 20:01 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-04-09 19:12 [Bug report] Kernel 5.7 become unbootable Mikhail Gavrilov
2020-04-10  8:16 ` Like Xu
2020-04-10 14:29   ` Arvind Sankar
2020-04-10 19:23     ` Mikhail Gavrilov
2020-04-11  1:09       ` Arvind Sankar
     [not found]     ` <20200411020051.20448-1-hdanton@sina.com>
2020-04-12 20:00       ` Mikhail Gavrilov

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).