From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-12.1 required=3.0 tests=BAYES_00, DKIM_ADSP_CUSTOM_MED,DKIM_INVALID,DKIM_SIGNED,FREEMAIL_FORGED_FROMDOMAIN, FREEMAIL_FROM,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,NICE_REPLY_A,SPF_HELO_NONE,SPF_PASS, USER_AGENT_SANE_1 autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 8E040C433DB for ; Fri, 19 Mar 2021 12:44:08 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 10272600EF for ; Fri, 19 Mar 2021 12:44:07 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 10272600EF Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=gmail.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id 7FA078D0001; Fri, 19 Mar 2021 08:44:06 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 7D0E36B007D; Fri, 19 Mar 2021 08:44:06 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 625088D0001; Fri, 19 Mar 2021 08:44:06 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0199.hostedemail.com [216.40.44.199]) by kanga.kvack.org (Postfix) with ESMTP id 428326B007B for ; Fri, 19 Mar 2021 08:44:06 -0400 (EDT) Received: from smtpin28.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay04.hostedemail.com (Postfix) with ESMTP id F3B9B9077 for ; Fri, 19 Mar 2021 12:44:05 +0000 (UTC) X-FDA: 77936591292.28.6567328 Received: from mail-lj1-f174.google.com (mail-lj1-f174.google.com [209.85.208.174]) by imf24.hostedemail.com (Postfix) with ESMTP id 720BEA0009E3 for ; Fri, 19 Mar 2021 12:44:05 +0000 (UTC) Received: by mail-lj1-f174.google.com with SMTP id 16so11763084ljc.11 for ; Fri, 19 Mar 2021 05:44:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=pE9UTROWsh3/pPLe+cr073qMj1X+IQp0oFmUiWqch/8=; b=MFHThy89oJc+dHthFkFMr+lYlYyUobH7v3iQwNWw2XFxe2ml3Q3AjBzhH3eVp3eQvf THEkRTSt/vkypRQNw1Ua4udKIMHBFCx1tNePa3vHyx+dYt9PGhYcwJfZ9qh/EKzDrnuu IyvnhHPcjWt2NCflmyl9rOuIg5G9etAIx2AdVNeXxLrBpH+6c5fnX4QkHAmkm00QufN5 aumN8Zj12BJgq3/AmS7Rz+VOZzq1vfz+fKS8ef4jok1QUhbxSYiHW6c5K2OGEsOyFJUk CaMEJipR8UgnvdjkAoJa8kw22WH6v4GHIX7AEuANIw37eQHBHeHT0PDxkKlsCqf2VYVD pwOw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=pE9UTROWsh3/pPLe+cr073qMj1X+IQp0oFmUiWqch/8=; b=gYdC6oF0EQho3vF3LpapjNBfNGFMqEPO30M+1wvFkmlB/y4TtmqOX8yx+6fWAhdoDT Ep5lgNumjtNreyLsKjxzBYPi/D55h7wt+MQyoB8ZN7Y/P0wUiHaqayDwL+TxpAmqXBwe 0p5HcfKPTqggLuoEFL9IC2QMdfgLT9pQiNgC1Nr/D7swVBKFhku7kknUROkGAEkzI7i8 /3hnNQZ7Y56X2oKIpvP6hi1asZKav1xwreoNtuZXGgdVp8RfR2JXvh20g1+DATDsjraB oD+gsH9eEdkQL2dstoCArBvbLAtKyHZobJZTmzfL3iefp5CCVpaBRp+ZmdXmF69We3Dl Wy1w== X-Gm-Message-State: AOAM531clcVBotyIPyOh/6zmuQ14GSjgDPBpTM99ZLHfj2GYek77MYgH dyfm2g2JMc09m7c5HpnXzb0= X-Google-Smtp-Source: ABdhPJwoCPtn9KGPuEZhQakE51bBvBf2VLF4yHt3RcvpYXwuXr6z/HqAyaNEZohhsA0CCkQGFPceUA== X-Received: by 2002:a05:651c:387:: with SMTP id e7mr798035ljp.425.1616157843736; Fri, 19 Mar 2021 05:44:03 -0700 (PDT) Received: from [192.168.2.145] (109-252-193-52.dynamic.spd-mgts.ru. [109.252.193.52]) by smtp.googlemail.com with ESMTPSA id k5sm752189ljb.78.2021.03.19.05.44.02 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 19 Mar 2021 05:44:03 -0700 (PDT) Subject: Re: [PATCH v4] mm: cma: support sysfs To: Minchan Kim , Andrew Morton Cc: linux-mm , LKML , joaodias@google.com, willy@infradead.org, david@redhat.com, surenb@google.com, Greg Kroah-Hartman , John Hubbard , Nicolas Chauvet , "linux-tegra@vger.kernel.org" References: <20210309062333.3216138-1-minchan@kernel.org> From: Dmitry Osipenko Message-ID: Date: Fri, 19 Mar 2021 15:44:02 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.8.1 MIME-Version: 1.0 In-Reply-To: <20210309062333.3216138-1-minchan@kernel.org> Content-Type: text/plain; charset=utf-8 Content-Language: en-US X-Rspamd-Server: rspam04 X-Rspamd-Queue-Id: 720BEA0009E3 X-Stat-Signature: nhw46a8468whezpsw8newshda73b4dmz Received-SPF: none (gmail.com>: No applicable sender policy available) receiver=imf24; identity=mailfrom; envelope-from=""; helo=mail-lj1-f174.google.com; client-ip=209.85.208.174 X-HE-DKIM-Result: pass/pass X-HE-Tag: 1616157845-267419 Content-Transfer-Encoding: quoted-printable X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: 09.03.2021 09:23, Minchan Kim =D0=BF=D0=B8=D1=88=D0=B5=D1=82: > Since CMA is getting used more widely, it's more important to > keep monitoring CMA statistics for system health since it's > directly related to user experience. >=20 > This patch introduces sysfs statistics for CMA, in order to provide > some basic monitoring of the CMA allocator. >=20 > * the number of CMA page successful allocations > * the number of CMA page allocation failures >=20 > These two values allow the user to calcuate the allocation > failure rate for each CMA area. >=20 > e.g.) > /sys/kernel/mm/cma/WIFI/alloc_pages_[success|fail] > /sys/kernel/mm/cma/SENSOR/alloc_pages_[success|fail] > /sys/kernel/mm/cma/BLUETOOTH/alloc_pages_[success|fail] >=20 > The cma_stat was intentionally allocated by dynamic allocation > to harmonize with kobject lifetime management. > https://lore.kernel.org/linux-mm/YCOAmXqt6dZkCQYs@kroah.com/ >=20 > Reviewed-by: Greg Kroah-Hartman > Reviewed-by: John Hubbard > Signed-off-by: Minchan Kim > --- > From v3 - https://lore.kernel.org/linux-mm/20210303205053.2906924-1-min= chan@kernel.org/ > * fix ZERO_OR_NULL_PTR - kernel test robot > * remove prefix cma - david@ > * resolve conflict with vmstat cma in mmotm - akpm@ > * rename stat name with success|fail >=20 > From v2 - https://lore.kernel.org/linux-mm/20210208180142.2765456-1-min= chan@kernel.org/ > * sysfs doc and description modification - jhubbard >=20 > From v1 - https://lore.kernel.org/linux-mm/20210203155001.4121868-1-min= chan@kernel.org/ > * fix sysfs build and refactoring - willy > * rename and drop some attributes - jhubbard >=20 > Documentation/ABI/testing/sysfs-kernel-mm-cma | 25 ++++ > mm/Kconfig | 7 ++ > mm/Makefile | 1 + > mm/cma.c | 7 +- > mm/cma.h | 20 ++++ > mm/cma_sysfs.c | 110 ++++++++++++++++++ > 6 files changed, 168 insertions(+), 2 deletions(-) > create mode 100644 Documentation/ABI/testing/sysfs-kernel-mm-cma > create mode 100644 mm/cma_sysfs.c >=20 > diff --git a/Documentation/ABI/testing/sysfs-kernel-mm-cma b/Documentat= ion/ABI/testing/sysfs-kernel-mm-cma > new file mode 100644 > index 000000000000..02b2bb60c296 > --- /dev/null > +++ b/Documentation/ABI/testing/sysfs-kernel-mm-cma > @@ -0,0 +1,25 @@ > +What: /sys/kernel/mm/cma/ > +Date: Feb 2021 > +Contact: Minchan Kim > +Description: > + /sys/kernel/mm/cma/ contains a subdirectory for each CMA > + heap name (also sometimes called CMA areas). > + > + Each CMA heap subdirectory (that is, each > + /sys/kernel/mm/cma/ directory) contains the > + following items: > + > + alloc_pages_success > + alloc_pages_fail > + > +What: /sys/kernel/mm/cma//alloc_pages_success > +Date: Feb 2021 > +Contact: Minchan Kim > +Description: > + the number of pages CMA API succeeded to allocate > + > +What: /sys/kernel/mm/cma//alloc_pages_fail > +Date: Feb 2021 > +Contact: Minchan Kim > +Description: > + the number of pages CMA API failed to allocate > diff --git a/mm/Kconfig b/mm/Kconfig > index 24c045b24b95..febb7e8e24de 100644 > --- a/mm/Kconfig > +++ b/mm/Kconfig > @@ -513,6 +513,13 @@ config CMA_DEBUGFS > help > Turns on the DebugFS interface for CMA. > =20 > +config CMA_SYSFS > + bool "CMA information through sysfs interface" > + depends on CMA && SYSFS > + help > + This option exposes some sysfs attributes to get information > + from CMA. > + > config CMA_AREAS > int "Maximum count of the CMA areas" > depends on CMA > diff --git a/mm/Makefile b/mm/Makefile > index 72227b24a616..56968b23ed7a 100644 > --- a/mm/Makefile > +++ b/mm/Makefile > @@ -109,6 +109,7 @@ obj-$(CONFIG_CMA) +=3D cma.o > obj-$(CONFIG_MEMORY_BALLOON) +=3D balloon_compaction.o > obj-$(CONFIG_PAGE_EXTENSION) +=3D page_ext.o > obj-$(CONFIG_CMA_DEBUGFS) +=3D cma_debug.o > +obj-$(CONFIG_CMA_SYSFS) +=3D cma_sysfs.o > obj-$(CONFIG_USERFAULTFD) +=3D userfaultfd.o > obj-$(CONFIG_IDLE_PAGE_TRACKING) +=3D page_idle.o > obj-$(CONFIG_DEBUG_PAGE_REF) +=3D debug_page_ref.o > diff --git a/mm/cma.c b/mm/cma.c > index 908f04775686..ac050359faae 100644 > --- a/mm/cma.c > +++ b/mm/cma.c > @@ -507,10 +507,13 @@ struct page *cma_alloc(struct cma *cma, size_t co= unt, unsigned int align, > =20 > pr_debug("%s(): returned %p\n", __func__, page); > out: > - if (page) > + if (page) { > count_vm_event(CMA_ALLOC_SUCCESS); > - else > + cma_sysfs_alloc_pages_count(cma, count); > + } else { > count_vm_event(CMA_ALLOC_FAIL); > + cma_sysfs_fail_pages_count(cma, count); > + } > =20 > return page; > } > diff --git a/mm/cma.h b/mm/cma.h > index 42ae082cb067..95d1aa2d808a 100644 > --- a/mm/cma.h > +++ b/mm/cma.h > @@ -3,6 +3,16 @@ > #define __MM_CMA_H__ > =20 > #include > +#include > + > +struct cma_stat { > + spinlock_t lock; > + /* the number of CMA page successful allocations */ > + unsigned long nr_pages_succeeded; > + /* the number of CMA page allocation failures */ > + unsigned long nr_pages_failed; > + struct kobject kobj; > +}; > =20 > struct cma { > unsigned long base_pfn; > @@ -16,6 +26,9 @@ struct cma { > struct debugfs_u32_array dfs_bitmap; > #endif > char name[CMA_MAX_NAME]; > +#ifdef CONFIG_CMA_SYSFS > + struct cma_stat *stat; > +#endif > }; > =20 > extern struct cma cma_areas[MAX_CMA_AREAS]; > @@ -26,4 +39,11 @@ static inline unsigned long cma_bitmap_maxno(struct = cma *cma) > return cma->count >> cma->order_per_bit; > } > =20 > +#ifdef CONFIG_CMA_SYSFS > +void cma_sysfs_alloc_pages_count(struct cma *cma, size_t count); > +void cma_sysfs_fail_pages_count(struct cma *cma, size_t count); > +#else > +static inline void cma_sysfs_alloc_pages_count(struct cma *cma, size_t= count) {}; > +static inline void cma_sysfs_fail_pages_count(struct cma *cma, size_t = count) {}; > +#endif > #endif > diff --git a/mm/cma_sysfs.c b/mm/cma_sysfs.c > new file mode 100644 > index 000000000000..3134b2b3a96d > --- /dev/null > +++ b/mm/cma_sysfs.c > @@ -0,0 +1,110 @@ > +// SPDX-License-Identifier: GPL-2.0 > +/* > + * CMA SysFS Interface > + * > + * Copyright (c) 2021 Minchan Kim > + */ > + > +#include > +#include > +#include > + > +#include "cma.h" > + > +static struct cma_stat *cma_stats; > + > +void cma_sysfs_alloc_pages_count(struct cma *cma, size_t count) > +{ > + spin_lock(&cma->stat->lock); > + cma->stat->nr_pages_succeeded +=3D count; > + spin_unlock(&cma->stat->lock); > +} > + > +void cma_sysfs_fail_pages_count(struct cma *cma, size_t count) > +{ > + spin_lock(&cma->stat->lock); > + cma->stat->nr_pages_failed +=3D count; > + spin_unlock(&cma->stat->lock); > +} > + > +#define CMA_ATTR_RO(_name) \ > + static struct kobj_attribute _name##_attr =3D __ATTR_RO(_name) > + > +static struct kobject *cma_kobj; > + > +static ssize_t alloc_pages_success_show(struct kobject *kobj, > + struct kobj_attribute *attr, char *buf) > +{ > + struct cma_stat *stat =3D container_of(kobj, struct cma_stat, kobj); > + > + return sysfs_emit(buf, "%lu\n", stat->nr_pages_succeeded); > +} > +CMA_ATTR_RO(alloc_pages_success); > + > +static ssize_t alloc_pages_fail_show(struct kobject *kobj, > + struct kobj_attribute *attr, char *buf) > +{ > + struct cma_stat *stat =3D container_of(kobj, struct cma_stat, kobj); > + > + return sysfs_emit(buf, "%lu\n", stat->nr_pages_failed); > +} > +CMA_ATTR_RO(alloc_pages_fail); > + > +static void cma_kobj_release(struct kobject *kobj) > +{ > + struct cma_stat *stat =3D container_of(kobj, struct cma_stat, kobj); > + > + kfree(stat); > +} > + > +static struct attribute *cma_attrs[] =3D { > + &alloc_pages_success_attr.attr, > + &alloc_pages_fail_attr.attr, > + NULL, > +}; > +ATTRIBUTE_GROUPS(cma); > + > +static struct kobj_type cma_ktype =3D { > + .release =3D cma_kobj_release, > + .sysfs_ops =3D &kobj_sysfs_ops, > + .default_groups =3D cma_groups > +}; > + > +static int __init cma_sysfs_init(void) > +{ > + int i =3D 0; > + struct cma *cma; > + > + cma_kobj =3D kobject_create_and_add("cma", mm_kobj); > + if (!cma_kobj) > + return -ENOMEM; > + > + cma_stats =3D kmalloc_array(cma_area_count, sizeof(struct cma_stat), > + GFP_KERNEL|__GFP_ZERO); Use kcalloc(). Code identation is wrong, please use checkpatch. > + if (ZERO_OR_NULL_PTR(cma_stats)) > + goto out; > + > + do { > + cma =3D &cma_areas[i]; > + cma->stat =3D &cma_stats[i]; > + spin_lock_init(&cma->stat->lock); > + if (kobject_init_and_add(&cma->stat->kobj, &cma_ktype, > + cma_kobj, "%s", cma->name)) { > + kobject_put(&cma->stat->kobj); > + goto out; > + } > + } while (++i < cma_area_count); > + > + return 0; > +out: > + while (--i >=3D 0) { > + cma =3D &cma_areas[i]; > + kobject_put(&cma->stat->kobj); > + } > + > + kfree(cma_stats); > + kobject_put(cma_kobj); > + > + return -ENOMEM; > +} > +subsys_initcall(cma_sysfs_init); >=20 Hi, There is a NULL derence on ARM32 NVIDIA Tegra SoCs with CONFIG_CMA_SYSFS=3D= y using today's next-20210319, please take a look. [ 1.185423] 8<--- cut here --- [ 1.186081] Unable to handle kernel NULL pointer dereference at virtua= l address 00000000 [ 1.186705] pgd =3D (ptrval) [ 1.188130] [00000000] *pgd=3D00000000 [ 1.190554] Internal error: Oops: 5 [#1] PREEMPT SMP THUMB2 [ 1.191545] Modules linked in: [ 1.192629] CPU: 1 PID: 1 Comm: swapper/0 Tainted: G W = 5.12.0-rc3-next-20210319-00174-g89b3b421dd2b #7142 [ 1.193540] Hardware name: NVIDIA Tegra SoC (Flattened Device Tree) [ 1.194613] PC is at _raw_spin_lock+0x1a/0x48 [ 1.200352] LR is at cma_sysfs_alloc_pages_count+0x13/0x24 [ 1.200821] pc : [] lr : [] psr: 00000033 [ 1.201269] sp : c1547e48 ip : f0000080 fp : 0000c800 [ 1.201580] r10: c13bd178 r9 : 00000040 r8 : 00000040 [ 1.201972] r7 : 00000000 r6 : c13bd168 r5 : 00000040 r4 : c13bd168 [ 1.202418] r3 : c1546000 r2 : 00000001 r1 : 00000040 r0 : 00000000 [ 1.203014] Flags: nzcv IRQs on FIQs on Mode SVC_32 ISA Thumb Seg= ment none [ 1.203488] Control: 50c5387d Table: 0000406a DAC: 00000051 [ 1.203988] Register r0 information: NULL pointer [ 1.204868] Register r1 information: non-paged memory [ 1.205233] Register r2 information: non-paged memory [ 1.205563] Register r3 information: non-slab/vmalloc memory [ 1.206213] Register r4 information: non-slab/vmalloc memory [ 1.206578] Register r5 information: non-paged memory [ 1.206929] Register r6 information: non-slab/vmalloc memory [ 1.207278] Register r7 information: NULL pointer [ 1.207594] Register r8 information: non-paged memory [ 1.207968] Register r9 information: non-paged memory [ 1.208291] Register r10 information: non-slab/vmalloc memory [ 1.208648] Register r11 information: non-paged memory [ 1.209002] Register r12 information: non-paged memory [ 1.209407] Process swapper/0 (pid: 1, stack limit =3D 0x(ptrval)) [ 1.209956] Stack: (0xc1547e48 to 0xc1548000) [ 1.211102] 7e40: c1199e24 00033800 efe35000 c027706= f 0000003f 00000000 [ 1.211999] 7e60: 00000040 0000003f 00000000 00000040 00000cc0 0000000= 0 00000006 00000000 [ 1.212855] 7e80: c1547ed8 00040000 c1141780 00000001 00000000 c1547ed= 8 00000647 00000040 [ 1.213768] 7ea0: c138c000 c01112ab c0fed0d8 c1141780 c1546000 0000000= 1 c1140d24 00000000 [ 1.214648] 7ec0: c1140d44 c1104af9 c1104a7f 00000001 00000000 00000cc= 0 00000000 e29968ad [ 1.215578] 7ee0: c161a077 c1546000 c136d940 c1104a7f ffffe000 c0101d6= 9 c161a077 c161a098 [ 1.216564] 7f00: c1058490 c0138345 c10566cc c0ef58b8 c11003d1 c154600= 0 00000000 00000002 [ 1.217357] 7f20: 00000002 c0f10280 c0efe3e4 c0efe398 c11003d1 c161a07= 4 c161a077 e29968ad [ 1.218212] 7f40: c1140d40 e29968ad c161a000 c118d304 00000003 c11003d= 1 c161a000 c1140d24 [ 1.219164] 7f60: 0000017e c1101141 00000002 00000002 00000000 c11003d= 1 c0a27fc5 c10566cc [ 1.220015] 7f80: c1547f98 00000000 c0a27fc5 00000000 00000000 0000000= 0 00000000 00000000 [ 1.220863] 7fa0: 00000000 c0a27fd1 00000000 c0100155 00000000 0000000= 0 00000000 00000000 [ 1.221713] 7fc0: 00000000 00000000 00000000 00000000 00000000 0000000= 0 00000000 00000000 [ 1.222584] 7fe0: 00000000 00000000 00000000 00000000 00000013 0000000= 0 00000000 00000000 [ 1.225038] [] (_raw_spin_lock) from [] (cma_sysfs= _alloc_pages_count+0x13/0x24) [ 1.226190] [] (cma_sysfs_alloc_pages_count) from [] (cma_alloc+0x153/0x274) [ 1.226720] [] (cma_alloc) from [] (__alloc_from_c= ontiguous+0x37/0x8c) [ 1.227272] [] (__alloc_from_contiguous) from [] (= atomic_pool_init+0x7b/0x126) [ 1.233596] [] (atomic_pool_init) from [] (do_one_= initcall+0x45/0x1e4) [ 1.234188] [] (do_one_initcall) from [] (kernel_i= nit_freeable+0x157/0x1a6) [ 1.234741] [] (kernel_init_freeable) from [] (ker= nel_init+0xd/0xe0) [ 1.235289] [] (kernel_init) from [] (ret_from_for= k+0x11/0x1c)