From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753224Ab1HVOXJ (ORCPT ); Mon, 22 Aug 2011 10:23:09 -0400 Received: from casper.infradead.org ([85.118.1.10]:41417 "EHLO casper.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751429Ab1HVOXI convert rfc822-to-8bit (ORCPT ); Mon, 22 Aug 2011 10:23:08 -0400 Subject: Re: [RFC][PATCH 4/6] x86, nmi: add in logic to handle multiple events and unknown NMIs From: Peter Zijlstra To: Don Zickus Cc: x86@kernel.org, Andi Kleen , Robert Richter , ying.huang@intel.com, LKML , jason.wessel@windriver.com Date: Mon, 22 Aug 2011 16:22:15 +0200 In-Reply-To: <1313786266-9585-5-git-send-email-dzickus@redhat.com> References: <1313786266-9585-1-git-send-email-dzickus@redhat.com> <1313786266-9585-5-git-send-email-dzickus@redhat.com> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8BIT X-Mailer: Evolution 3.0.2- Message-ID: <1314022935.24275.35.camel@twins> Mime-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 2011-08-19 at 16:37 -0400, Don Zickus wrote: > @@ -260,6 +260,8 @@ unknown_nmi_error(unsigned char reason, struct pt_regs *regs) > pr_emerg("Dazed and confused, but trying to continue\n"); > } > > +DEFINE_PER_CPU(bool, swallow_nmi); > + > static notrace __kprobes void default_do_nmi(struct pt_regs *regs) > { > unsigned char reason = 0; > @@ -271,8 +273,28 @@ static notrace __kprobes void default_do_nmi(struct pt_regs *regs) > * NMI can not be detected/processed on other CPUs. > */ > handled = nmi_handle(NMI_LOCAL, regs); > - if (handled) > + if (handled) { > + /* > + * When handling multiple NMI events, we are not > + * sure if the second NMI was dropped (because of > + * too many NMIs), piggy-backed on the same NMI > + * (perf) or is queued right behind this NMI. > + * In the last case, we may accidentally get an > + * unknown NMI because the event is already handled. > + * Flag for this condition and swallow it later. > + * > + * FIXME: This detection has holes in it mainly > + * because we can't tell _when_ the next NMI comes > + * in. A multi-handled NMI event followed by an > + * unknown NMI a second later, clearly should not > + * be swallowed. > + */ > + if (handled > 1) > + __this_cpu_write(swallow_nmi, true); > + else > + __this_cpu_write(swallow_nmi, false); > return; > + } > > /* Non-CPU-specific NMI: NMI sources can be processed on any CPU */ > raw_spin_lock(&nmi_reason_lock); > @@ -296,6 +318,8 @@ static notrace __kprobes void default_do_nmi(struct pt_regs *regs) > raw_spin_unlock(&nmi_reason_lock); > > unknown_nmi_error(reason, regs); > + > + __this_cpu_write(swallow_nmi, false); > } All writes, no reads... the actual dropping of NMIs got lost and now lives in patch 5 which purports to be about statistics only.