From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Message-ID: <2031cf4705d76dd4d0f722a600a6a106cce2ba41.camel@linux.intel.com> Subject: Re: [driver-core PATCH v6 9/9] libnvdimm: Schedule device registration on node local to the device From: Alexander Duyck Date: Tue, 27 Nov 2018 10:04:09 -0800 In-Reply-To: References: <154170028986.12967.2108024712555179678.stgit@ahduyck-desk1.jf.intel.com> <154170044652.12967.17419321472770956712.stgit@ahduyck-desk1.jf.intel.com> Content-Type: text/plain; charset="UTF-8" Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org To: Dan Williams Cc: Linux Kernel Mailing List , Greg KH , linux-nvdimm , Tejun Heo , Andrew Morton , Linux-pm mailing list , jiangshanlai@gmail.com, "Rafael J. Wysocki" , "Brown, Len" , Pavel Machek , zwisler@kernel.org, Dave Jiang , bvanassche@acm.org List-ID: On Mon, 2018-11-26 at 18:21 -0800, Dan Williams wrote: > On Thu, Nov 8, 2018 at 10:07 AM Alexander Duyck > wrote: > > > > Force the device registration for nvdimm devices to be closer to the actual > > device. This is achieved by using either the NUMA node ID of the region, or > > of the parent. By doing this we can have everything above the region based > > on the region, and everything below the region based on the nvdimm bus. > > > > By guaranteeing NUMA locality I see an improvement of as high as 25% for > > per-node init of a system with 12TB of persistent memory. > > > > It seems the speed-up is achieved with just patches 1, 2, and 9 from > this series, correct? I wouldn't want to hold up that benefit while > the driver-core bits are debated. Actually patch 6 ends up impacting things for persistent memory as well. The problem is that all the async calls to add interfaces only do anything if the driver is already loaded. So there are cases such as the X86_PMEM_LEGACY_DEVICE case where the memory regions end up still being serialized because the devices are added before the driver. > You can add: > > Reviewed-by: Dan Williams > > ...if the series needs to be kept together, but as far as I can see > the workqueue changes enable 2 sub-topics of development and it might > make sense for Tejun to take those first 2 and then Greg and I can > base any follow-up topics on that stable baseline. I had originally put this out there for Tejun to apply, but him and Greg had talked and Greg agreed to apply the set. If it works for you I would prefer to just keep it together for now as I don't believe there will be too many more revisions of this needed.