From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1762596AbZFOThG (ORCPT ); Mon, 15 Jun 2009 15:37:06 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751556AbZFOTg4 (ORCPT ); Mon, 15 Jun 2009 15:36:56 -0400 Received: from terminus.zytor.com ([198.137.202.10]:52010 "EHLO terminus.zytor.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751413AbZFOTgz (ORCPT ); Mon, 15 Jun 2009 15:36:55 -0400 Message-ID: <4A36A1C7.6080005@zytor.com> Date: Mon, 15 Jun 2009 12:32:23 -0700 From: "H. Peter Anvin" User-Agent: Thunderbird 2.0.0.21 (X11/20090320) MIME-Version: 1.0 To: Mathieu Desnoyers CC: Peter Zijlstra , Linus Torvalds , Ingo Molnar , mingo@redhat.com, paulus@samba.org, acme@redhat.com, linux-kernel@vger.kernel.org, penberg@cs.helsinki.fi, vegard.nossum@gmail.com, efault@gmx.de, jeremy@goop.org, npiggin@suse.de, tglx@linutronix.de, linux-tip-commits@vger.kernel.org Subject: Re: [tip:perfcounters/core] perf_counter: x86: Fix call-chain support to use NMI-safe methods References: <20090615171845.GA7664@elte.hu> <4A369508.2090707@zytor.com> <20090615184858.GD6520@Krystal> <1245091917.6741.185.camel@laptop> <20090615185907.GF6520@Krystal> <1245092561.6741.205.camel@laptop> <4A369CD8.3090505@zytor.com> <20090615192720.GA9056@Krystal> In-Reply-To: <20090615192720.GA9056@Krystal> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Mathieu Desnoyers wrote: >>> >> Writing control registers is serializing, so it's a lot more expensive >> than writing a normal register; my *guess* is that it will be on the >> order of 100-200 cycles. >> >> That is not based on any actual information. >> > > Then how about just writing to the cr2 register *if* it has changed > while the NMI handler was running ? > > if (unlikely(read_cr2() != saved_cr2))) > write_cr2(saved_cr2) > > Mathieu > That works fine, obviously, and although it's probably overkill it's also a trivially cheap optimization. -hpa