From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751140AbdE3RWL (ORCPT ); Tue, 30 May 2017 13:22:11 -0400 Received: from mga07.intel.com ([134.134.136.100]:58218 "EHLO mga07.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751032AbdE3RWJ (ORCPT ); Tue, 30 May 2017 13:22:09 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.38,418,1491289200"; d="scan'208";a="1176149401" Date: Tue, 30 May 2017 10:22:08 -0700 From: Andi Kleen To: Peter Zijlstra Cc: Stephane Eranian , Vince Weaver , "Liang, Kan" , "mingo@redhat.com" , "linux-kernel@vger.kernel.org" , "alexander.shishkin@linux.intel.com" , "acme@redhat.com" , "jolsa@redhat.com" , "torvalds@linux-foundation.org" , "tglx@linutronix.de" Subject: Re: [PATCH 1/2] perf/x86/intel: enable CPU ref_cycles for GP counter Message-ID: <20170530172208.GL24144@tassilo.jf.intel.com> References: <20170522192335.v4gvhz24ix2jeihg@hirez.programming.kicks-ass.net> <20170523063913.363ssgcy7kmeesye@hirez.programming.kicks-ass.net> <20170524154518.GA24144@tassilo.jf.intel.com> <20170530092523.xkuj5lqpq5pb5y4m@hirez.programming.kicks-ass.net> <20170530135128.GI24144@tassilo.jf.intel.com> <20170530162838.h5tzdnrxpy6upbka@hirez.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20170530162838.h5tzdnrxpy6upbka@hirez.programming.kicks-ass.net> User-Agent: Mutt/1.8.0 (2017-02-23) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > > You would only need a single one per system however, not one per CPU. > > RCU already tracks all the CPUs, all we need is a single NMI watchdog > > that makes sure RCU itself does not get stuck. > > > > So we just have to find a single watchdog somewhere that can trigger > > NMI. > > But then you have to IPI broadcast the NMI, which is less than ideal. Only when the watchdog times out to print the backtraces. > > RCU doesn't have that problem because the quiescent state is a global > thing. CPU progress, which is what the NMI watchdog tests, is very much > per logical CPU though. RCU already has a CPU stall detector. It should work (and usually triggers before the NMI watchdog in my experience unless the whole system is dead) -Andi