From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752737AbeC3Ucw (ORCPT ); Fri, 30 Mar 2018 16:32:52 -0400 Received: from Galois.linutronix.de ([146.0.238.70]:56315 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752206AbeC3Ucv (ORCPT ); Fri, 30 Mar 2018 16:32:51 -0400 Date: Fri, 30 Mar 2018 22:32:44 +0200 (CEST) From: Thomas Gleixner To: Dave Hansen cc: Ingo Molnar , Linus Torvalds , Linux Kernel Mailing List , linux-mm , Andrea Arcangeli , Andrew Lutomirski , Kees Cook , Hugh Dickins , =?ISO-8859-15?Q?J=FCrgen_Gro=DF?= , the arch/x86 maintainers , namit@vmware.com Subject: Re: [PATCH 00/11] Use global pages with PTI In-Reply-To: <3cdc23a2-99eb-6f93-6934-f7757fa30a3e@linux.intel.com> Message-ID: References: <20180323174447.55F35636@viggo.jf.intel.com> <20180327200719.lvdomez6hszpmo4s@gmail.com> <0d6ea030-ec3b-d649-bad7-89ff54094e25@linux.intel.com> <20180330120920.btobga44wqytlkoe@gmail.com> <20180330121725.zcklh36ulg7crydw@gmail.com> <3cdc23a2-99eb-6f93-6934-f7757fa30a3e@linux.intel.com> User-Agent: Alpine 2.21 (DEB 202 2017-01-01) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII X-Linutronix-Spam-Score: -1.0 X-Linutronix-Spam-Level: - X-Linutronix-Spam-Status: No , -1.0 points, 5.0 required, ALL_TRUSTED=-1,SHORTCIRCUIT=-0.0001 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 30 Mar 2018, Dave Hansen wrote: > On 03/30/2018 05:17 AM, Ingo Molnar wrote: > > BTW., the expectation on !PCID Intel hardware would be for global pages to help > > even more than the 0.6% and 1.7% you measured on PCID hardware: PCID already > > _reduces_ the cost of TLB flushes - so if there's not even PCID then global pages > > should help even more. > > > > In theory at least. Would still be nice to measure it. > > I did the lseek test on a modern, non-PCID system: > > No Global pages (baseline): 6077741 lseeks/sec > 94 Global pages (this set): 8433111 lseeks/sec > +2355370 lseeks/sec (+38.8%) That's all kernel text, right? What's the result for the case where global is only set for all user/kernel shared pages? Thanks, tglx