All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] x86/tsc: mark tsc reliable for qualified platforms
@ 2021-03-02  2:52 Feng Tang
  2021-03-02  9:14 ` Peter Zijlstra
  2021-03-03 14:51 ` Thomas Gleixner
  0 siblings, 2 replies; 5+ messages in thread
From: Feng Tang @ 2021-03-02  2:52 UTC (permalink / raw)
  To: Thomas Gleixner, Ingo Molnar, H Peter Anvin, Borislav Petkov,
	Peter Zijlstra, x86, linux-kernel
  Cc: rui.zhang, dave.hansen, andi.kleen, len.brown, Feng Tang

There are cases that tsc clocksource are wrongly judged as unstable by
clocksource watchdogs like hpet, acpi_pm or 'refined-jiffies'. While
there is hardly a general reliable way to check the validity of a
watchdog, and to protect the innocent tsc, Thomas Gleixner proposed [1]:

"I'm inclined to lift that requirement when the CPU has:

    1) X86_FEATURE_CONSTANT_TSC
    2) X86_FEATURE_NONSTOP_TSC
    3) X86_FEATURE_NONSTOP_TSC_S3
    4) X86_FEATURE_TSC_ADJUST
    5) At max. 4 sockets

 After two decades of horrors we're finally at a point where TSC seems
 to be halfway reliable and less abused by BIOS tinkerers. TSC_ADJUST
 was really key as we can now detect even small modifications reliably
 and the important point is that we can cure them as well (not pretty
 but better than all other options)."

So implement it with slight change as discussed in the thread, and be
more defensive to use maxim of 2 sockets.

The check is done inside tsc_init() before registering 'tsc-early' and
'tsc' clocksources, as there were cases that both of them have been
wrongly judged as unreliable.

[1]. https://lore.kernel.org/lkml/87eekfk8bd.fsf@nanos.tec.linutronix.de/
Signed-off-by: Feng Tang <feng.tang@intel.com>
Reviewed-by: Andi Kleen <ak@linux.intel.com>
---
 arch/x86/kernel/tsc.c | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/arch/x86/kernel/tsc.c b/arch/x86/kernel/tsc.c
index f70dffc..a7e3980 100644
--- a/arch/x86/kernel/tsc.c
+++ b/arch/x86/kernel/tsc.c
@@ -1193,6 +1193,17 @@ static void __init check_system_tsc_reliable(void)
 #endif
 	if (boot_cpu_has(X86_FEATURE_TSC_RELIABLE))
 		tsc_clocksource_reliable = 1;
+
+	/*
+	 * Ideally the socket number should be checked, but this is called
+	 * by tsc_init() which is in early boot phase and the socket numbers
+	 * may not be available. Use 'nr_online_nodes' as a fallback solution
+	 */
+	if (boot_cpu_has(X86_FEATURE_CONSTANT_TSC)
+		&& boot_cpu_has(X86_FEATURE_NONSTOP_TSC)
+		&& boot_cpu_has(X86_FEATURE_TSC_ADJUST)
+		&& nr_online_nodes <= 2)
+		tsc_clocksource_reliable = 1;
 }
 
 /*
-- 
2.7.4


^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH] x86/tsc: mark tsc reliable for qualified platforms
  2021-03-02  2:52 [PATCH] x86/tsc: mark tsc reliable for qualified platforms Feng Tang
@ 2021-03-02  9:14 ` Peter Zijlstra
  2021-03-02 11:55   ` Feng Tang
  2021-03-03 14:51 ` Thomas Gleixner
  1 sibling, 1 reply; 5+ messages in thread
From: Peter Zijlstra @ 2021-03-02  9:14 UTC (permalink / raw)
  To: Feng Tang
  Cc: Thomas Gleixner, Ingo Molnar, H Peter Anvin, Borislav Petkov,
	x86, linux-kernel, rui.zhang, dave.hansen, andi.kleen, len.brown

On Tue, Mar 02, 2021 at 10:52:52AM +0800, Feng Tang wrote:
> @@ -1193,6 +1193,17 @@ static void __init check_system_tsc_reliable(void)
>  #endif
>  	if (boot_cpu_has(X86_FEATURE_TSC_RELIABLE))
>  		tsc_clocksource_reliable = 1;
> +
> +	/*
> +	 * Ideally the socket number should be checked, but this is called
> +	 * by tsc_init() which is in early boot phase and the socket numbers
> +	 * may not be available. Use 'nr_online_nodes' as a fallback solution
> +	 */
> +	if (boot_cpu_has(X86_FEATURE_CONSTANT_TSC)
> +		&& boot_cpu_has(X86_FEATURE_NONSTOP_TSC)
> +		&& boot_cpu_has(X86_FEATURE_TSC_ADJUST)
> +		&& nr_online_nodes <= 2)
> +		tsc_clocksource_reliable = 1;

Logical operators go at the end of a line and alignment is with the (,
not the code block after it.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] x86/tsc: mark tsc reliable for qualified platforms
  2021-03-02  9:14 ` Peter Zijlstra
@ 2021-03-02 11:55   ` Feng Tang
  0 siblings, 0 replies; 5+ messages in thread
From: Feng Tang @ 2021-03-02 11:55 UTC (permalink / raw)
  To: Peter Zijlstra
  Cc: Thomas Gleixner, Ingo Molnar, H Peter Anvin, Borislav Petkov,
	x86, linux-kernel, rui.zhang, dave.hansen, andi.kleen, len.brown

On Tue, Mar 02, 2021 at 10:14:01AM +0100, Peter Zijlstra wrote:
> On Tue, Mar 02, 2021 at 10:52:52AM +0800, Feng Tang wrote:
> > @@ -1193,6 +1193,17 @@ static void __init check_system_tsc_reliable(void)
> >  #endif
> >  	if (boot_cpu_has(X86_FEATURE_TSC_RELIABLE))
> >  		tsc_clocksource_reliable = 1;
> > +
> > +	/*
> > +	 * Ideally the socket number should be checked, but this is called
> > +	 * by tsc_init() which is in early boot phase and the socket numbers
> > +	 * may not be available. Use 'nr_online_nodes' as a fallback solution
> > +	 */
> > +	if (boot_cpu_has(X86_FEATURE_CONSTANT_TSC)
> > +		&& boot_cpu_has(X86_FEATURE_NONSTOP_TSC)
> > +		&& boot_cpu_has(X86_FEATURE_TSC_ADJUST)
> > +		&& nr_online_nodes <= 2)
> > +		tsc_clocksource_reliable = 1;
> 
> Logical operators go at the end of a line and alignment is with the (,
> not the code block after it.

Thanks for pointing out and the suggestion! Will change it to

	if (boot_cpu_has(X86_FEATURE_CONSTANT_TSC) &&
	    boot_cpu_has(X86_FEATURE_NONSTOP_TSC) &&
	    boot_cpu_has(X86_FEATURE_TSC_ADJUST) &&
	    nr_online_nodes <= 2)
		tsc_clocksource_reliable = 1;

- Feng

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] x86/tsc: mark tsc reliable for qualified platforms
  2021-03-02  2:52 [PATCH] x86/tsc: mark tsc reliable for qualified platforms Feng Tang
  2021-03-02  9:14 ` Peter Zijlstra
@ 2021-03-03 14:51 ` Thomas Gleixner
  2021-03-08  1:42   ` Feng Tang
  1 sibling, 1 reply; 5+ messages in thread
From: Thomas Gleixner @ 2021-03-03 14:51 UTC (permalink / raw)
  To: Feng Tang, Ingo Molnar, H Peter Anvin, Borislav Petkov,
	Peter Zijlstra, x86, linux-kernel
  Cc: rui.zhang, dave.hansen, andi.kleen, len.brown, Feng Tang

On Tue, Mar 02 2021 at 10:52, Feng Tang wrote:
> There are cases that tsc clocksource are wrongly judged as unstable by
> clocksource watchdogs like hpet, acpi_pm or 'refined-jiffies'. While
> there is hardly a general reliable way to check the validity of a
> watchdog, and to protect the innocent tsc, Thomas Gleixner proposed [1]:
>
> "I'm inclined to lift that requirement when the CPU has:
>
>     1) X86_FEATURE_CONSTANT_TSC
>     2) X86_FEATURE_NONSTOP_TSC
>     3) X86_FEATURE_NONSTOP_TSC_S3
>     4) X86_FEATURE_TSC_ADJUST
>     5) At max. 4 sockets
>
>  After two decades of horrors we're finally at a point where TSC seems
>  to be halfway reliable and less abused by BIOS tinkerers. TSC_ADJUST
>  was really key as we can now detect even small modifications reliably
>  and the important point is that we can cure them as well (not pretty
>  but better than all other options)."
>
> So implement it with slight change as discussed in the thread, and be
> more defensive to use maxim of 2 sockets.

Can you please explain the slight change in the changelog?

Thanks,

        tglx

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] x86/tsc: mark tsc reliable for qualified platforms
  2021-03-03 14:51 ` Thomas Gleixner
@ 2021-03-08  1:42   ` Feng Tang
  0 siblings, 0 replies; 5+ messages in thread
From: Feng Tang @ 2021-03-08  1:42 UTC (permalink / raw)
  To: Thomas Gleixner
  Cc: Ingo Molnar, H Peter Anvin, Borislav Petkov, Peter Zijlstra, x86,
	linux-kernel, Zhang, Rui, Hansen, Dave, Kleen, Andi, Brown, Len

On Wed, Mar 03, 2021 at 10:51:31PM +0800, Thomas Gleixner wrote:
> On Tue, Mar 02 2021 at 10:52, Feng Tang wrote:
> > There are cases that tsc clocksource are wrongly judged as unstable by
> > clocksource watchdogs like hpet, acpi_pm or 'refined-jiffies'. While
> > there is hardly a general reliable way to check the validity of a
> > watchdog, and to protect the innocent tsc, Thomas Gleixner proposed [1]:
> >
> > "I'm inclined to lift that requirement when the CPU has:
> >
> >     1) X86_FEATURE_CONSTANT_TSC
> >     2) X86_FEATURE_NONSTOP_TSC
> >     3) X86_FEATURE_NONSTOP_TSC_S3
> >     4) X86_FEATURE_TSC_ADJUST
> >     5) At max. 4 sockets
> >
> >  After two decades of horrors we're finally at a point where TSC seems
> >  to be halfway reliable and less abused by BIOS tinkerers. TSC_ADJUST
> >  was really key as we can now detect even small modifications reliably
> >  and the important point is that we can cure them as well (not pretty
> >  but better than all other options)."
> >
> > So implement it with slight change as discussed in the thread, and be
> > more defensive to use maxim of 2 sockets.
> 
> Can you please explain the slight change in the changelog?
 
Sorry for the late response. Just found this mail in my "Junk Mail"
folder with 3 copies, interesting mail sever filters!

I will add 
"As feature #3 X86_FEATURE_NONSTOP_TSC_S3 only exists on several
generations of Atom processor, and is always coupled with 
X86_FEATURE_CONSTANT_TSC and X86_FEATURE_NONSTOP_TSC, skip checking
it"
to the commit log.

Thanks,
Feng



> Thanks,
> 
>         tglx

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2021-03-08  1:44 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-03-02  2:52 [PATCH] x86/tsc: mark tsc reliable for qualified platforms Feng Tang
2021-03-02  9:14 ` Peter Zijlstra
2021-03-02 11:55   ` Feng Tang
2021-03-03 14:51 ` Thomas Gleixner
2021-03-08  1:42   ` Feng Tang

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.