From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1759081AbZE1MGs (ORCPT ); Thu, 28 May 2009 08:06:48 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755045AbZE1MGl (ORCPT ); Thu, 28 May 2009 08:06:41 -0400 Received: from courier.cs.helsinki.fi ([128.214.9.1]:37451 "EHLO mail.cs.helsinki.fi" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754944AbZE1MGk (ORCPT ); Thu, 28 May 2009 08:06:40 -0400 Subject: Re: [GIT PULL] scheduler fixes From: Pekka Enberg To: Nick Piggin Cc: Linus Torvalds , Ingo Molnar , Yinghai Lu , Rusty Russell , "H. Peter Anvin" , Jeff Garzik , Alexander Viro , Linux Kernel Mailing List , Andrew Morton , Peter Zijlstra , cl@linux-foundation.org, mpm@selenic.com In-Reply-To: <20090526073813.GE21496@wotan.suse.de> References: <20090525025353.GA2580@elte.hu> <4A1A2261.1000504@kernel.org> <20090525051521.GC23032@elte.hu> <20090525112504.GB24071@wotan.suse.de> <84144f020905250437x585e66a2oc1124a4f1f43059d@mail.gmail.com> <20090525114127.GE24071@wotan.suse.de> <4A1AE5CC.1000209@cs.helsinki.fi> <20090526073813.GE21496@wotan.suse.de> Date: Thu, 28 May 2009 15:06:40 +0300 Message-Id: <1243512400.11533.33.camel@penberg-laptop> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 7bit X-Mailer: Evolution 2.24.3 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 2009-05-26 at 09:38 +0200, Nick Piggin wrote: > On Mon, May 25, 2009 at 09:39:08PM +0300, Pekka Enberg wrote: > > Linus Torvalds wrote: > > >And vfs_caches_init_early() is actually doing some rather strange things, > > >like doing a "alloc_large_system_hash()" but not unconditionally: it does > > >it in the "late" initialization too, if not done early. inode_init_early > > >does soemthing very similar (ie a _conditional_ early init). > > > > > >So none of this seems to really get a huge advantage from the early init. > > >There seems to be some subtle NUMA issues, but do we really want that? I > > >get the feeling that nobody ever wanted to do it early, and then the NUMA > > >people said "I don't wnt to do this early, but I don't want to touch the > > >non-NUMA case, so I'll do it early for non-numa, and late for numa". > > > > SLUB does sysfs setup in kmem_cache_init() and if I saw some oopses if I > > don't call vfs_caches_init_early() first. I didn't look too closely, though. > > Did you also test the NUMA/hashdist case? vfs_caches_init_early doesn't > do much in that case. No, I tested UMA only. On Tue, 2009-05-26 at 09:38 +0200, Nick Piggin wrote: > I would say it is much more robust to do sysfs setup later if we move > the slab setup so early. Probably it is just quite lucky not to explode > in the !numa case because the vfs needs quite a bit of setting up... That should not be an issue. SLUB already defers sysfs registration until slab_sysfs_init() initcall has been run. So my patches have zero change in how SLUB interracts with sysfs, actually. Pekka