From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.7 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, INCLUDES_PATCH,MAILING_LIST_MULTI,SIGNED_OFF_BY,SPF_HELO_NONE,SPF_PASS, URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 97A29C3F2D1 for ; Thu, 5 Mar 2020 03:42:53 +0000 (UTC) Received: from lists.ozlabs.org (lists.ozlabs.org [203.11.71.2]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 50DAA2073B for ; Thu, 5 Mar 2020 03:42:53 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 50DAA2073B Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=au1.ibm.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=linuxppc-dev-bounces+linuxppc-dev=archiver.kernel.org@lists.ozlabs.org Received: from lists.ozlabs.org (lists.ozlabs.org [IPv6:2401:3900:2:1::3]) by lists.ozlabs.org (Postfix) with ESMTP id 48XxQy56ZXzDqjH for ; Thu, 5 Mar 2020 14:42:50 +1100 (AEDT) Authentication-Results: lists.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=au1.ibm.com (client-ip=148.163.156.1; helo=mx0a-001b2d01.pphosted.com; envelope-from=alastair@au1.ibm.com; receiver=) Authentication-Results: lists.ozlabs.org; dmarc=none (p=none dis=none) header.from=au1.ibm.com Received: from mx0a-001b2d01.pphosted.com (mx0a-001b2d01.pphosted.com [148.163.156.1]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by lists.ozlabs.org (Postfix) with ESMTPS id 48XxLg2S6LzDqjk for ; Thu, 5 Mar 2020 14:39:05 +1100 (AEDT) Received: from pps.filterd (m0187473.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id 0253UGNt149397 for ; Wed, 4 Mar 2020 22:39:03 -0500 Received: from e06smtp02.uk.ibm.com (e06smtp02.uk.ibm.com [195.75.94.98]) by mx0a-001b2d01.pphosted.com with ESMTP id 2yfmg37kr1-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Wed, 04 Mar 2020 22:39:02 -0500 Received: from localhost by e06smtp02.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Thu, 5 Mar 2020 03:38:59 -0000 Received: from b06cxnps3074.portsmouth.uk.ibm.com (9.149.109.194) by e06smtp02.uk.ibm.com (192.168.101.132) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Thu, 5 Mar 2020 03:38:52 -0000 Received: from b06wcsmtp001.portsmouth.uk.ibm.com (b06wcsmtp001.portsmouth.uk.ibm.com [9.149.105.160]) by b06cxnps3074.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 0253cpG563176904 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Thu, 5 Mar 2020 03:38:51 GMT Received: from b06wcsmtp001.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 587A0A405C; Thu, 5 Mar 2020 03:38:51 +0000 (GMT) Received: from b06wcsmtp001.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id B2436A405B; Thu, 5 Mar 2020 03:38:50 +0000 (GMT) Received: from ozlabs.au.ibm.com (unknown [9.192.253.14]) by b06wcsmtp001.portsmouth.uk.ibm.com (Postfix) with ESMTP; Thu, 5 Mar 2020 03:38:50 +0000 (GMT) Received: from adsilva.ozlabs.ibm.com (haven.au.ibm.com [9.192.254.114]) (using TLSv1.2 with cipher AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ozlabs.au.ibm.com (Postfix) with ESMTPSA id C64C2A0264; Thu, 5 Mar 2020 14:38:45 +1100 (AEDT) Subject: Re: [PATCH v3 16/27] powerpc/powernv/pmem: Register a character device for userspace to interact with From: "Alastair D'Silva" To: Frederic Barrat Date: Thu, 05 Mar 2020 14:38:49 +1100 In-Reply-To: References: <20200221032720.33893-1-alastair@au1.ibm.com> <20200221032720.33893-17-alastair@au1.ibm.com> Organization: IBM Australia Content-Type: text/plain; charset="UTF-8" User-Agent: Evolution 3.34.4 (3.34.4-1.fc31) MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-TM-AS-GCONF: 00 x-cbid: 20030503-0008-0000-0000-000003597306 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 20030503-0009-0000-0000-00004A7AA8AC Message-Id: <083251112829de95609c2220dde2e381940184cb.camel@au1.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.138, 18.0.572 definitions=2020-03-04_10:2020-03-04, 2020-03-04 signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 mlxlogscore=999 clxscore=1015 lowpriorityscore=0 malwarescore=0 adultscore=0 suspectscore=2 priorityscore=1501 spamscore=0 mlxscore=0 bulkscore=0 phishscore=0 impostorscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2001150001 definitions=main-2003050016 X-BeenThere: linuxppc-dev@lists.ozlabs.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Linux on PowerPC Developers Mail List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: Madhavan Srinivasan , Alexey Kardashevskiy , Masahiro Yamada , Oliver O'Halloran , Mauro Carvalho Chehab , Ira Weiny , Thomas Gleixner , Rob Herring , Dave Jiang , linux-nvdimm@lists.01.org, "Aneesh Kumar K . V" , Krzysztof Kozlowski , Anju T Sudhakar , Mahesh Salgaonkar , Andrew Donnellan , Arnd Bergmann , Greg Kurz , Nicholas Piggin , =?ISO-8859-1?Q?C=E9dric?= Le Goater , Dan Williams , Hari Bathini , linux-mm@kvack.org, Greg Kroah-Hartman , linux-kernel@vger.kernel.org, Vishal Verma , Paul Mackerras , Andrew Morton , linuxppc-dev@lists.ozlabs.org, "David S. Miller" Errors-To: linuxppc-dev-bounces+linuxppc-dev=archiver.kernel.org@lists.ozlabs.org Sender: "Linuxppc-dev" On Tue, 2020-03-03 at 10:28 +0100, Frederic Barrat wrote: > > Le 21/02/2020 à 04:27, Alastair D'Silva a écrit : > > From: Alastair D'Silva > > > > This patch introduces a character device (/dev/ocxl-scmX) which > > further > > patches will use to interact with userspace. > > > > Signed-off-by: Alastair D'Silva > > --- > > arch/powerpc/platforms/powernv/pmem/ocxl.c | 116 > > +++++++++++++++++- > > .../platforms/powernv/pmem/ocxl_internal.h | 2 + > > 2 files changed, 116 insertions(+), 2 deletions(-) > > > > diff --git a/arch/powerpc/platforms/powernv/pmem/ocxl.c > > b/arch/powerpc/platforms/powernv/pmem/ocxl.c > > index b8bd7e703b19..63109a870d2c 100644 > > --- a/arch/powerpc/platforms/powernv/pmem/ocxl.c > > +++ b/arch/powerpc/platforms/powernv/pmem/ocxl.c > > @@ -10,6 +10,7 @@ > > #include > > #include > > #include > > +#include > > #include > > #include > > #include "ocxl_internal.h" > > @@ -339,6 +340,9 @@ static void free_ocxlpmem(struct ocxlpmem > > *ocxlpmem) > > > > free_minor(ocxlpmem); > > > > + if (ocxlpmem->cdev.owner) > > + cdev_del(&ocxlpmem->cdev); > > + > > if (ocxlpmem->metadata_addr) > > devm_memunmap(&ocxlpmem->dev, ocxlpmem->metadata_addr); > > > > @@ -396,6 +400,70 @@ static int ocxlpmem_register(struct ocxlpmem > > *ocxlpmem) > > return device_register(&ocxlpmem->dev); > > } > > > > +static void ocxlpmem_put(struct ocxlpmem *ocxlpmem) > > +{ > > + put_device(&ocxlpmem->dev); > > +} > > + > > +static struct ocxlpmem *ocxlpmem_get(struct ocxlpmem *ocxlpmem) > > +{ > > + return (get_device(&ocxlpmem->dev) == NULL) ? NULL : ocxlpmem; > > +} > > + > > +static struct ocxlpmem *find_and_get_ocxlpmem(dev_t devno) > > +{ > > + struct ocxlpmem *ocxlpmem; > > + int minor = MINOR(devno); > > + /* > > + * We don't declare an RCU critical section here, as our AFU > > + * is protected by a reference counter on the device. By the > > time the > > + * minor number of a device is removed from the idr, the ref > > count of > > + * the device is already at 0, so no user API will access that > > AFU and > > + * this function can't return it. > > + */ > > I fixed something related in the ocxl driver (which had enough > changes > with the introduction of the "info" device to make a similar comment > become wrong). See commit a58d37bce0d21. The issue is handling a > simultaneous open() and removal of the device through /sysfs as best > we can. > > We are on a file open path and it's not like we're going to have a > thousand clients, so performance is not that critical. We can take > the > mutex before searching in the IDR and release it after we increment > the > reference count on the device. > But that's not enough: we could still find the device in the IDR > while > it is being removed in free_ocxlpmem(). I believe the only safe way > to > address it is by removing the user-facing APIs (the char device) > before > calling device_unregister(). So that it's not possible to find the > device in file_open() if it's in the middle of being removed. > > Fred > > Ok, I'll replicate that patch & follow your advice. > > + ocxlpmem = idr_find(&minors_idr, minor); > > + if (ocxlpmem) > > + ocxlpmem_get(ocxlpmem); > > + return ocxlpmem; > > +} > > + > > +static int file_open(struct inode *inode, struct file *file) > > +{ > > + struct ocxlpmem *ocxlpmem; > > + > > + ocxlpmem = find_and_get_ocxlpmem(inode->i_rdev); > > + if (!ocxlpmem) > > + return -ENODEV; > > + > > + file->private_data = ocxlpmem; > > + return 0; > > +} > > + > > +static int file_release(struct inode *inode, struct file *file) > > +{ > > + struct ocxlpmem *ocxlpmem = file->private_data; > > + > > + ocxlpmem_put(ocxlpmem); > > + return 0; > > +} > > + > > +static const struct file_operations fops = { > > + .owner = THIS_MODULE, > > + .open = file_open, > > + .release = file_release, > > +}; > > + > > +/** > > + * create_cdev() - Create the chardev in /dev for the device > > + * @ocxlpmem: the SCM metadata > > + * Return: 0 on success, negative on failure > > + */ > > +static int create_cdev(struct ocxlpmem *ocxlpmem) > > +{ > > + cdev_init(&ocxlpmem->cdev, &fops); > > + return cdev_add(&ocxlpmem->cdev, ocxlpmem->dev.devt, 1); > > +} > > + > > /** > > * ocxlpmem_remove() - Free an OpenCAPI persistent memory device > > * @pdev: the PCI device information struct > > @@ -572,6 +640,11 @@ static int probe(struct pci_dev *pdev, const > > struct pci_device_id *ent) > > goto err; > > } > > > > + if (create_cdev(ocxlpmem)) { > > + dev_err(&pdev->dev, "Could not create character > > device\n"); > > + goto err; > > + } > > As already mentioned in a previous patch, we branch to the err label > so > rc needs to be set to a valid error. > Ok > > > > + > > elapsed = 0; > > timeout = ocxlpmem->readiness_timeout + ocxlpmem- > > >memory_available_timeout; > > while (!is_usable(ocxlpmem, false)) { > > @@ -613,20 +686,59 @@ static struct pci_driver pci_driver = { > > .shutdown = ocxlpmem_remove, > > }; > > > > +static int file_init(void) > > +{ > > + int rc; > > + > > + mutex_init(&minors_idr_lock); > > + idr_init(&minors_idr); > > + > > + rc = alloc_chrdev_region(&ocxlpmem_dev, 0, NUM_MINORS, "ocxl- > > pmem"); > > + if (rc) { > > + idr_destroy(&minors_idr); > > + pr_err("Unable to allocate OpenCAPI persistent memory > > major number: %d\n", rc); > > + return rc; > > + } > > + > > + ocxlpmem_class = class_create(THIS_MODULE, "ocxl-pmem"); > > + if (IS_ERR(ocxlpmem_class)) { > > + idr_destroy(&minors_idr); > > + pr_err("Unable to create ocxl-pmem class\n"); > > + unregister_chrdev_region(ocxlpmem_dev, NUM_MINORS); > > + return PTR_ERR(ocxlpmem_class); > > + } > > + > > + return 0; > > +} > > + > > +static void file_exit(void) > > +{ > > + class_destroy(ocxlpmem_class); > > + unregister_chrdev_region(ocxlpmem_dev, NUM_MINORS); > > + idr_destroy(&minors_idr); > > +} > > + > > static int __init ocxlpmem_init(void) > > { > > - int rc = 0; > > + int rc; > > > > - rc = pci_register_driver(&pci_driver); > > + rc = file_init(); > > if (rc) > > return rc; > > > > + rc = pci_register_driver(&pci_driver); > > + if (rc) { > > + file_exit(); > > + return rc; > > + } > > + > > return 0; > > } > > > > static void ocxlpmem_exit(void) > > { > > pci_unregister_driver(&pci_driver); > > + file_exit(); > > } > > > > module_init(ocxlpmem_init); > > diff --git a/arch/powerpc/platforms/powernv/pmem/ocxl_internal.h > > b/arch/powerpc/platforms/powernv/pmem/ocxl_internal.h > > index 28e2020f6355..d2d81fec7bb1 100644 > > --- a/arch/powerpc/platforms/powernv/pmem/ocxl_internal.h > > +++ b/arch/powerpc/platforms/powernv/pmem/ocxl_internal.h > > @@ -2,6 +2,7 @@ > > // Copyright 2019 IBM Corp. > > > > #include > > +#include > > #include > > #include > > #include > > @@ -99,6 +100,7 @@ struct ocxlpmem_function0 { > > struct ocxlpmem { > > struct device dev; > > struct pci_dev *pdev; > > + struct cdev cdev; > > struct ocxl_fn *ocxl_fn; > > struct nd_interleave_set nd_set; > > struct nvdimm_bus_descriptor bus_desc; > > -- Alastair D'Silva Open Source Developer Linux Technology Centre, IBM Australia mob: 0423 762 819