From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932562AbcHKI1p (ORCPT ); Thu, 11 Aug 2016 04:27:45 -0400 Received: from bombadil.infradead.org ([198.137.202.9]:40224 "EHLO bombadil.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932297AbcHKI1m (ORCPT ); Thu, 11 Aug 2016 04:27:42 -0400 Date: Thu, 11 Aug 2016 10:27:33 +0200 From: Peter Zijlstra To: Chris Metcalf Cc: Christoph Lameter , Gilad Ben Yossef , Steven Rostedt , Ingo Molnar , Andrew Morton , Rik van Riel , Tejun Heo , Frederic Weisbecker , Thomas Gleixner , "Paul E. McKenney" , Viresh Kumar , Catalin Marinas , Will Deacon , Andy Lutomirski , Daniel Lezcano , linux-doc@vger.kernel.org, linux-api@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v13 00/12] support "task_isolation" mode Message-ID: <20160811082733.GU30192@twins.programming.kicks-ass.net> References: <1468529299-27929-1-git-send-email-cmetcalf@mellanox.com> <7a3f66f7-5011-7d59-2e0e-f57e4e42e6b6@mellanox.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <7a3f66f7-5011-7d59-2e0e-f57e4e42e6b6@mellanox.com> User-Agent: Mutt/1.5.23.1 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Jul 22, 2016 at 08:50:44AM -0400, Chris Metcalf wrote: > On 7/21/2016 10:20 PM, Christoph Lameter wrote: > >On Thu, 21 Jul 2016, Chris Metcalf wrote: > >>On 7/20/2016 10:04 PM, Christoph Lameter wrote: > >>unstable, and then scheduling work to safely remove that timer. > >>I haven't looked at this code before (in kernel/time/clocksource.c > >>under CONFIG_CLOCKSOURCE_WATCHDOG) since the timers on > >>arm64 and tile aren't unstable. Is it possible to boot your machine > >>with a stable clocksource? > >It already as a stable clocksource. Sorry but that was one of the criteria > >for the server when we ordered them. Could this be clock adjustments? > > We probably need to get clock folks to jump in on this thread! Boot with: tsc=reliable, this disables the watchdog. We (sadly) have to have this thing running on most x86 because TSC, even if initially stable, can do weird things once its running. We have seen: - SMI - hotplug - suspend - multi-socket mess up the TSC, even if it was deemed 'good' at boot time. If you _know_ your TSC to be solid, boot with tsc=reliable and be happy.