All of lore.kernel.org
 help / color / mirror / Atom feed
From: Dave Young <dyoung@redhat.com>
To: Pingfan Liu <kernelfans@gmail.com>
Cc: kexec@lists.infradead.org, linux-kernel@vger.kernel.org,
	Baoquan He <bhe@redhat.com>,
	Andrew Morton <akpm@linux-foundation.org>,
	Mike Rapoport <rppt@linux.vnet.ibm.com>,
	yinghai@kernel.org, vgoyal@redhat.com,
	Randy Dunlap <rdunlap@infradead.org>,
	Borislav Petkov <bp@alien8.de>,
	x86@kernel.org
Subject: Re: [PATCHv7] x86/kdump: bugfix, make the behavior of crashkernel=X consistent with kaslr
Date: Fri, 18 Jan 2019 11:43:24 +0800	[thread overview]
Message-ID: <20190118034324.GA3512@dhcp-128-65.nay.redhat.com> (raw)
In-Reply-To: <1547539623-18201-1-git-send-email-kernelfans@gmail.com>

Pingfan, thanks for the post.

On 01/15/19 at 04:07pm, Pingfan Liu wrote:
> People reported a bug on a high end server with many pcie devices, where
> kernel bootup with crashkernel=384M, and kaslr is enabled. Even
> though we still see much memory under 896 MB, the finding still failed
> intermittently. Because currently we can only find region under 896 MB,
> if without ',high' specified. Then KASLR breaks 896 MB into several parts
> randomly, and crashkernel reservation need be aligned to 128 MB, that's
> why failure is found. It raises confusion to the end user that sometimes
> crashkernel=X works while sometimes fails.
> If want to make it succeed, customer can change kernel option to
> "crashkernel=384M,high". Just this give "crashkernel=xx@yy" a very
> limited space to behave even though its grammar looks more generic.
> And we can't answer questions raised from customer that confidently:
> 1) why it doesn't succeed to reserve 896 MB;
> 2) what's wrong with memory region under 4G;
> 3) why I have to add ',high', I only require 384 MB, not 3840 MB.
> This patch tries to get memory region from 896 MB firstly, then [896MB,4G],
> finally above 4G.

The patch log still looks not very good.  It needs some cleanup like
paragraph line breaks to make it more readable.

For example you can take like below:
--
People reported crashkernel=384M reservation failed on a high end server
with KASLR enabled.  In that case there is enough free memory under 896M
but crashkernel reservation still fails intermittently.

The situation is crashkernel reservation code only finds free region under
896 MB with 128M aligned in case no ',high' being used.  And KASLR could
break the first 896M into several parts randomly thus the failure happens.
User has no way to predict and make sure crashkernel=xM working unless
he/she use 'crashkernel=xM,high'.  Since 'crashkernel=xM' is the most
common use case this issue is a serious bug.

And we can't answer questions raised from customer:
1) why it doesn't succeed to reserve 896 MB;
2) what's wrong with memory region under 4G;
3) why I have to add ',high', I only require 384 MB, not 3840 MB.

This patch tries to get memory region from 896 MB firstly, then [896MB,4G],
finally above 4G.

> Dave Young sent the original post, and I just re-post it with commit log
> improvement as his requirement.
> http://lists.infradead.org/pipermail/kexec/2017-October/019571.html
> There was an old discussion below (previously posted by Chao Wang):
> https://lkml.org/lkml/2013/10/15/601

I hope someone else can provide review because I posted it previously.

But I think previously when I posted it is a good to have improvement,
but now it is a real serious bug which need to be fixed.  I can review
and ack if you can repost with a better log.

> 
> Signed-off-by: Pingfan Liu <kernelfans@gmail.com>
> Cc: Dave Young <dyoung@redhat.com>
> Cc: Baoquan He <bhe@redhat.com>
> Cc: Andrew Morton <akpm@linux-foundation.org>
> Cc: Mike Rapoport <rppt@linux.vnet.ibm.com>
> Cc: yinghai@kernel.org,
> Cc: vgoyal@redhat.com
> Cc: Randy Dunlap <rdunlap@infradead.org>
> ---
> v6 -> v7: fix spelling mistake pointed out by Randy
>  arch/x86/kernel/setup.c | 16 ++++++++++++++++
>  1 file changed, 16 insertions(+)
> 
> diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
> index 3d872a5..fa62c81 100644
> --- a/arch/x86/kernel/setup.c
> +++ b/arch/x86/kernel/setup.c
> @@ -551,6 +551,22 @@ static void __init reserve_crashkernel(void)
>  						    high ? CRASH_ADDR_HIGH_MAX
>  							 : CRASH_ADDR_LOW_MAX,
>  						    crash_size, CRASH_ALIGN);
> +#ifdef CONFIG_X86_64
> +		/*
> +		 * crashkernel=X reserve below 896M fails? Try below 4G
> +		 */
> +		if (!high && !crash_base)
> +			crash_base = memblock_find_in_range(CRASH_ALIGN,
> +						(1ULL << 32),
> +						crash_size, CRASH_ALIGN);
> +		/*
> +		 * crashkernel=X reserve below 4G fails? Try MAXMEM
> +		 */
> +		if (!high && !crash_base)
> +			crash_base = memblock_find_in_range(CRASH_ALIGN,
> +						CRASH_ADDR_HIGH_MAX,
> +						crash_size, CRASH_ALIGN);
> +#endif
>  		if (!crash_base) {
>  			pr_info("crashkernel reservation failed - No suitable area found.\n");
>  			return;
> -- 
> 2.7.4
> 

Thanks
Dave

WARNING: multiple messages have this Message-ID (diff)
From: Dave Young <dyoung@redhat.com>
To: Pingfan Liu <kernelfans@gmail.com>
Cc: x86@kernel.org, Baoquan He <bhe@redhat.com>,
	Randy Dunlap <rdunlap@infradead.org>,
	kexec@lists.infradead.org, linux-kernel@vger.kernel.org,
	Mike Rapoport <rppt@linux.vnet.ibm.com>,
	Borislav Petkov <bp@alien8.de>,
	Andrew Morton <akpm@linux-foundation.org>,
	yinghai@kernel.org, vgoyal@redhat.com
Subject: Re: [PATCHv7] x86/kdump: bugfix, make the behavior of crashkernel=X consistent with kaslr
Date: Fri, 18 Jan 2019 11:43:24 +0800	[thread overview]
Message-ID: <20190118034324.GA3512@dhcp-128-65.nay.redhat.com> (raw)
In-Reply-To: <1547539623-18201-1-git-send-email-kernelfans@gmail.com>

Pingfan, thanks for the post.

On 01/15/19 at 04:07pm, Pingfan Liu wrote:
> People reported a bug on a high end server with many pcie devices, where
> kernel bootup with crashkernel=384M, and kaslr is enabled. Even
> though we still see much memory under 896 MB, the finding still failed
> intermittently. Because currently we can only find region under 896 MB,
> if without ',high' specified. Then KASLR breaks 896 MB into several parts
> randomly, and crashkernel reservation need be aligned to 128 MB, that's
> why failure is found. It raises confusion to the end user that sometimes
> crashkernel=X works while sometimes fails.
> If want to make it succeed, customer can change kernel option to
> "crashkernel=384M,high". Just this give "crashkernel=xx@yy" a very
> limited space to behave even though its grammar looks more generic.
> And we can't answer questions raised from customer that confidently:
> 1) why it doesn't succeed to reserve 896 MB;
> 2) what's wrong with memory region under 4G;
> 3) why I have to add ',high', I only require 384 MB, not 3840 MB.
> This patch tries to get memory region from 896 MB firstly, then [896MB,4G],
> finally above 4G.

The patch log still looks not very good.  It needs some cleanup like
paragraph line breaks to make it more readable.

For example you can take like below:
--
People reported crashkernel=384M reservation failed on a high end server
with KASLR enabled.  In that case there is enough free memory under 896M
but crashkernel reservation still fails intermittently.

The situation is crashkernel reservation code only finds free region under
896 MB with 128M aligned in case no ',high' being used.  And KASLR could
break the first 896M into several parts randomly thus the failure happens.
User has no way to predict and make sure crashkernel=xM working unless
he/she use 'crashkernel=xM,high'.  Since 'crashkernel=xM' is the most
common use case this issue is a serious bug.

And we can't answer questions raised from customer:
1) why it doesn't succeed to reserve 896 MB;
2) what's wrong with memory region under 4G;
3) why I have to add ',high', I only require 384 MB, not 3840 MB.

This patch tries to get memory region from 896 MB firstly, then [896MB,4G],
finally above 4G.

> Dave Young sent the original post, and I just re-post it with commit log
> improvement as his requirement.
> http://lists.infradead.org/pipermail/kexec/2017-October/019571.html
> There was an old discussion below (previously posted by Chao Wang):
> https://lkml.org/lkml/2013/10/15/601

I hope someone else can provide review because I posted it previously.

But I think previously when I posted it is a good to have improvement,
but now it is a real serious bug which need to be fixed.  I can review
and ack if you can repost with a better log.

> 
> Signed-off-by: Pingfan Liu <kernelfans@gmail.com>
> Cc: Dave Young <dyoung@redhat.com>
> Cc: Baoquan He <bhe@redhat.com>
> Cc: Andrew Morton <akpm@linux-foundation.org>
> Cc: Mike Rapoport <rppt@linux.vnet.ibm.com>
> Cc: yinghai@kernel.org,
> Cc: vgoyal@redhat.com
> Cc: Randy Dunlap <rdunlap@infradead.org>
> ---
> v6 -> v7: fix spelling mistake pointed out by Randy
>  arch/x86/kernel/setup.c | 16 ++++++++++++++++
>  1 file changed, 16 insertions(+)
> 
> diff --git a/arch/x86/kernel/setup.c b/arch/x86/kernel/setup.c
> index 3d872a5..fa62c81 100644
> --- a/arch/x86/kernel/setup.c
> +++ b/arch/x86/kernel/setup.c
> @@ -551,6 +551,22 @@ static void __init reserve_crashkernel(void)
>  						    high ? CRASH_ADDR_HIGH_MAX
>  							 : CRASH_ADDR_LOW_MAX,
>  						    crash_size, CRASH_ALIGN);
> +#ifdef CONFIG_X86_64
> +		/*
> +		 * crashkernel=X reserve below 896M fails? Try below 4G
> +		 */
> +		if (!high && !crash_base)
> +			crash_base = memblock_find_in_range(CRASH_ALIGN,
> +						(1ULL << 32),
> +						crash_size, CRASH_ALIGN);
> +		/*
> +		 * crashkernel=X reserve below 4G fails? Try MAXMEM
> +		 */
> +		if (!high && !crash_base)
> +			crash_base = memblock_find_in_range(CRASH_ALIGN,
> +						CRASH_ADDR_HIGH_MAX,
> +						crash_size, CRASH_ALIGN);
> +#endif
>  		if (!crash_base) {
>  			pr_info("crashkernel reservation failed - No suitable area found.\n");
>  			return;
> -- 
> 2.7.4
> 

Thanks
Dave

_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

  reply	other threads:[~2019-01-18  3:43 UTC|newest]

Thread overview: 93+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-01-15  8:07 [PATCHv7] x86/kdump: bugfix, make the behavior of crashkernel=X consistent with kaslr Pingfan Liu
2019-01-15  8:07 ` Pingfan Liu
2019-01-18  3:43 ` Dave Young [this message]
2019-01-18  3:43   ` Dave Young
2019-01-19  1:25 ` Jerry Hoemann
2019-01-19  1:25   ` Jerry Hoemann
2019-01-21  5:11   ` Pingfan Liu
2019-01-21  5:11     ` Pingfan Liu
2019-01-21  5:16 Pingfan Liu
2019-01-21  5:16 ` Pingfan Liu
2019-01-21  6:24 ` Baoquan He
2019-01-21  6:24   ` Baoquan He
2019-01-25 10:39 ` Borislav Petkov
2019-01-25 10:39   ` Borislav Petkov
2019-01-25 13:45   ` Dave Young
2019-01-25 13:45     ` Dave Young
2019-01-25 14:08     ` Borislav Petkov
2019-01-25 14:08       ` Borislav Petkov
2019-01-28  9:58       ` Dave Young
2019-01-28  9:58         ` Dave Young
2019-01-28 10:18         ` Borislav Petkov
2019-01-28 10:18           ` Borislav Petkov
2019-06-07 17:30           ` Borislav Petkov
2019-06-07 17:30             ` Borislav Petkov
2019-06-10  6:51             ` Dave Young
2019-06-10  6:51               ` Dave Young
2019-01-29  5:25       ` Pingfan Liu
2019-01-29  5:25         ` Pingfan Liu
2019-01-31  7:42         ` Dave Young
2019-01-31  7:42           ` Dave Young
2019-01-31  7:59       ` Dave Young
2019-01-31  7:59         ` Dave Young
2019-01-31 10:57         ` Borislav Petkov
2019-01-31 10:57           ` Borislav Petkov
2019-01-31 22:27           ` Jerry Hoemann
2019-01-31 22:27             ` Jerry Hoemann
2019-01-31 23:47             ` Borislav Petkov
2019-01-31 23:47               ` Borislav Petkov
2019-02-04 22:30               ` Jerry Hoemann
2019-02-04 22:30                 ` Jerry Hoemann
2019-02-05  8:15                 ` Borislav Petkov
2019-02-05  8:15                   ` Borislav Petkov
2019-02-06 12:08                   ` Dave Young
2019-02-06 12:08                     ` Dave Young
2019-02-11 20:48                     ` Dave Young
2019-02-11 20:48                       ` Dave Young
2019-02-12  5:35                       ` Pingfan Liu
2019-02-12  5:35                         ` Pingfan Liu
2019-02-15 10:24                       ` Borislav Petkov
2019-02-15 10:24                         ` Borislav Petkov
2019-02-18  1:48                         ` Dave Young
2019-02-18  1:48                           ` Dave Young
2019-02-18  1:48                           ` Dave Young
2019-02-20  7:38                           ` Pingfan Liu
2019-02-20  7:38                             ` Pingfan Liu
2019-02-20  7:38                             ` Pingfan Liu
2019-02-20  8:32                           ` Borislav Petkov
2019-02-20  8:32                             ` Borislav Petkov
2019-02-20  9:41                             ` Dave Young
2019-02-20  9:41                               ` Dave Young
2019-02-20 12:51                               ` Pingfan Liu
2019-02-20 12:51                                 ` Pingfan Liu
2019-02-21 17:13                               ` Borislav Petkov
2019-02-21 17:13                                 ` Borislav Petkov
2019-02-22  2:11                                 ` Dave Young
2019-02-22  2:11                                   ` Dave Young
2019-02-22  8:42                                   ` Joerg Roedel
2019-02-22  8:42                                     ` Joerg Roedel
2019-02-22 13:00                                     ` Borislav Petkov
2019-02-22 13:00                                       ` Borislav Petkov
2019-02-24 13:25                                       ` Pingfan Liu
2019-02-24 13:25                                         ` Pingfan Liu
2019-02-25  1:53                                         ` Dave Young
2019-02-25  1:53                                           ` Dave Young
2019-02-25  9:39                                         ` Borislav Petkov
2019-02-25  9:39                                           ` Borislav Petkov
2019-02-25 11:00                                       ` Joerg Roedel
2019-02-25 11:00                                         ` Joerg Roedel
2019-02-25 11:12                                         ` Dave Young
2019-02-25 11:12                                           ` Dave Young
2019-02-25 11:30                                           ` Borislav Petkov
2019-02-25 11:30                                             ` Borislav Petkov
2019-02-25 11:30                                             ` Borislav Petkov
2019-03-01  3:04                                             ` Pingfan Liu
2019-03-01  3:04                                               ` Pingfan Liu
2019-03-01  3:19                                               ` Pingfan Liu
2019-03-01  3:19                                                 ` Pingfan Liu
2019-03-22  8:22                                                 ` Dave Young
2019-03-22  8:22                                                   ` Dave Young
2019-01-29  5:51   ` Pingfan Liu
2019-01-29  5:51     ` Pingfan Liu
2019-01-31 10:50     ` Borislav Petkov
2019-01-31 10:50       ` Borislav Petkov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20190118034324.GA3512@dhcp-128-65.nay.redhat.com \
    --to=dyoung@redhat.com \
    --cc=akpm@linux-foundation.org \
    --cc=bhe@redhat.com \
    --cc=bp@alien8.de \
    --cc=kernelfans@gmail.com \
    --cc=kexec@lists.infradead.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=rdunlap@infradead.org \
    --cc=rppt@linux.vnet.ibm.com \
    --cc=vgoyal@redhat.com \
    --cc=x86@kernel.org \
    --cc=yinghai@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.