From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-yw0-x232.google.com (mail-yw0-x232.google.com [IPv6:2607:f8b0:4002:c05::232]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ml01.01.org (Postfix) with ESMTPS id ED98E2095A6CD for ; Thu, 29 Jun 2017 14:48:41 -0700 (PDT) Received: by mail-yw0-x232.google.com with SMTP id j11so42435079ywa.2 for ; Thu, 29 Jun 2017 14:50:15 -0700 (PDT) MIME-Version: 1.0 In-Reply-To: <59556E37.80808@hpe.com> References: <149875877608.10031.17813337234536358002.stgit@dwillia2-desk3.amr.corp.intel.com> <149875884190.10031.6179599135820559644.stgit@dwillia2-desk3.amr.corp.intel.com> <595552F5.5040008@hpe.com> <59556E37.80808@hpe.com> From: Dan Williams Date: Thu, 29 Jun 2017 14:50:12 -0700 Message-ID: Subject: Re: [PATCH v4 12/16] libnvdimm, nfit: enable support for volatile ranges List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: linux-nvdimm-bounces@lists.01.org Sender: "Linux-nvdimm" To: Linda Knippers Cc: Jan Kara , Matthew Wilcox , X86 ML , "linux-kernel@vger.kernel.org" , "linux-nvdimm@lists.01.org" , Al Viro , linux-fsdevel , Christoph Hellwig List-ID: On Thu, Jun 29, 2017 at 2:16 PM, Linda Knippers wrote: > On 06/29/2017 04:42 PM, Dan Williams wrote: >> On Thu, Jun 29, 2017 at 12:20 PM, Linda Knippers wrote: >>> On 06/29/2017 01:54 PM, Dan Williams wrote: >>>> Allow volatile nfit ranges to participate in all the same infrastructure >>>> provided for persistent memory regions. >>> >>> This seems to be a bit more than "other rework". >> >> It's part of the rationale for having a "write_cache" control >> attribute. There's only so much I can squeeze into the subject line, >> but it is mentioned in the cover letter. >> >>>> A resulting resulting namespace >>>> device will still be called "pmem", but the parent region type will be >>>> "nd_volatile". >>> >>> What does this look like to a user or admin? How does someone know that >>> /dev/pmemX is persistent memory and /dev/pmemY isn't? Someone shouldn't >>> have to weed through /sys or ndctl some other interface to figure that out >>> in the future if they don't have to do that today. We have different >>> names for BTT namespaces. Is there a different name for volatile ranges? >> >> No, the block device name is still /dev/pmem. It's already the case >> that you need to check behind just the name of the device to figure >> out if something is actually volatile or not (see memmap=ss!nn >> configurations), > > I don't have any experience with using memmap but if it's primarily used > by developers without NVDIMMs, they'd know it's not persistent. Or is it > primarily used by administrators using non-NFIT NVDIMMs, in which case it > is persistent? > > In any case, how exactly does one determine whether the device is volatile > or not? I'm dumb so tell me the command line or API. Especially with memmap= or e820-defined memory it's unknowable from the kernel. We don't know if the user is using it to cover for a platform where there is no BIOS support for advertising persistent memory, or if they have a BIOS that does not produce an NFIT as is the case here [1], or if it is some developer just testing with no expectation of persistence. [1]: https://github.com/pmem/ndctl/issues/21 >> so I would not be in favor of changing the device >> name if we think the memory might not be persistent. Moreover, I think >> it was a mistake that we change the device name for btt or not, and >> I'm glad Matthew talked me out of making the same mistake with >> memory-mode vs raw-mode pmem namespaces. So, the block device name >> just reflects the driver of the block device, not the properties of >> the device, just like all other block device instances. > > I agree that creating a new device name for BTT was perhaps a mistake, > although it would be good to know how to query a device property for > sector atomicity. The difference between BTT vs. non-BTT seems less > critical to me than knowing in an obvious way whether the device is > actually persistent. We don't have a good way to answer "actually persistent" in the general case. I'm thinking of cases where the energy source on the DIMM has died, or we trigger one of the conditions that leads to the ""unable to guarantee persistence of writes" message. The /dev/pmem device name just tells you that your block device is hosted by a driver that knows how to handle persistent memory constraints, but any other details about the nature of the address range need to come from other sources of information, and potentially information sources that the kernel does not know about. _______________________________________________ Linux-nvdimm mailing list Linux-nvdimm@lists.01.org https://lists.01.org/mailman/listinfo/linux-nvdimm From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753428AbdF2VuQ (ORCPT ); Thu, 29 Jun 2017 17:50:16 -0400 Received: from mail-yw0-f181.google.com ([209.85.161.181]:36173 "EHLO mail-yw0-f181.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752378AbdF2VuP (ORCPT ); Thu, 29 Jun 2017 17:50:15 -0400 MIME-Version: 1.0 In-Reply-To: <59556E37.80808@hpe.com> References: <149875877608.10031.17813337234536358002.stgit@dwillia2-desk3.amr.corp.intel.com> <149875884190.10031.6179599135820559644.stgit@dwillia2-desk3.amr.corp.intel.com> <595552F5.5040008@hpe.com> <59556E37.80808@hpe.com> From: Dan Williams Date: Thu, 29 Jun 2017 14:50:12 -0700 Message-ID: Subject: Re: [PATCH v4 12/16] libnvdimm, nfit: enable support for volatile ranges To: Linda Knippers Cc: "linux-nvdimm@lists.01.org" , Jan Kara , Matthew Wilcox , X86 ML , "linux-kernel@vger.kernel.org" , Al Viro , linux-fsdevel , Christoph Hellwig Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jun 29, 2017 at 2:16 PM, Linda Knippers wrote: > On 06/29/2017 04:42 PM, Dan Williams wrote: >> On Thu, Jun 29, 2017 at 12:20 PM, Linda Knippers wrote: >>> On 06/29/2017 01:54 PM, Dan Williams wrote: >>>> Allow volatile nfit ranges to participate in all the same infrastructure >>>> provided for persistent memory regions. >>> >>> This seems to be a bit more than "other rework". >> >> It's part of the rationale for having a "write_cache" control >> attribute. There's only so much I can squeeze into the subject line, >> but it is mentioned in the cover letter. >> >>>> A resulting resulting namespace >>>> device will still be called "pmem", but the parent region type will be >>>> "nd_volatile". >>> >>> What does this look like to a user or admin? How does someone know that >>> /dev/pmemX is persistent memory and /dev/pmemY isn't? Someone shouldn't >>> have to weed through /sys or ndctl some other interface to figure that out >>> in the future if they don't have to do that today. We have different >>> names for BTT namespaces. Is there a different name for volatile ranges? >> >> No, the block device name is still /dev/pmem. It's already the case >> that you need to check behind just the name of the device to figure >> out if something is actually volatile or not (see memmap=ss!nn >> configurations), > > I don't have any experience with using memmap but if it's primarily used > by developers without NVDIMMs, they'd know it's not persistent. Or is it > primarily used by administrators using non-NFIT NVDIMMs, in which case it > is persistent? > > In any case, how exactly does one determine whether the device is volatile > or not? I'm dumb so tell me the command line or API. Especially with memmap= or e820-defined memory it's unknowable from the kernel. We don't know if the user is using it to cover for a platform where there is no BIOS support for advertising persistent memory, or if they have a BIOS that does not produce an NFIT as is the case here [1], or if it is some developer just testing with no expectation of persistence. [1]: https://github.com/pmem/ndctl/issues/21 >> so I would not be in favor of changing the device >> name if we think the memory might not be persistent. Moreover, I think >> it was a mistake that we change the device name for btt or not, and >> I'm glad Matthew talked me out of making the same mistake with >> memory-mode vs raw-mode pmem namespaces. So, the block device name >> just reflects the driver of the block device, not the properties of >> the device, just like all other block device instances. > > I agree that creating a new device name for BTT was perhaps a mistake, > although it would be good to know how to query a device property for > sector atomicity. The difference between BTT vs. non-BTT seems less > critical to me than knowing in an obvious way whether the device is > actually persistent. We don't have a good way to answer "actually persistent" in the general case. I'm thinking of cases where the energy source on the DIMM has died, or we trigger one of the conditions that leads to the ""unable to guarantee persistence of writes" message. The /dev/pmem device name just tells you that your block device is hosted by a driver that knows how to handle persistent memory constraints, but any other details about the nature of the address range need to come from other sources of information, and potentially information sources that the kernel does not know about.