From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.8 required=3.0 tests=DKIM_INVALID,DKIM_SIGNED, INCLUDES_PATCH,MAILING_LIST_MULTI,SIGNED_OFF_BY,SPF_PASS,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 26719C67863 for ; Thu, 18 Oct 2018 20:38:05 +0000 (UTC) Received: from lists.ozlabs.org (lists.ozlabs.org [203.11.71.2]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 5352F20658 for ; Thu, 18 Oct 2018 20:38:04 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=fail reason="signature verification failed" (1024-bit key) header.d=kernel.org header.i=@kernel.org header.b="IFzFophp" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 5352F20658 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=kernel.org Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=linuxppc-dev-bounces+linuxppc-dev=archiver.kernel.org@lists.ozlabs.org Received: from lists.ozlabs.org (lists.ozlabs.org [IPv6:2401:3900:2:1::3]) by lists.ozlabs.org (Postfix) with ESMTP id 42bgpy1SXmzF3Hw for ; Fri, 19 Oct 2018 07:38:02 +1100 (AEDT) Authentication-Results: lists.ozlabs.org; dmarc=pass (p=none dis=none) header.from=kernel.org Authentication-Results: lists.ozlabs.org; dkim=fail reason="signature verification failed" (1024-bit key; unprotected) header.d=kernel.org header.i=@kernel.org header.b="IFzFophp"; dkim-atps=neutral Authentication-Results: lists.ozlabs.org; spf=pass (mailfrom) smtp.mailfrom=kernel.org (client-ip=198.145.29.99; helo=mail.kernel.org; envelope-from=atull@kernel.org; receiver=) Authentication-Results: lists.ozlabs.org; dmarc=pass (p=none dis=none) header.from=kernel.org Authentication-Results: lists.ozlabs.org; dkim=pass (1024-bit key; unprotected) header.d=kernel.org header.i=@kernel.org header.b="IFzFophp"; dkim-atps=neutral Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by lists.ozlabs.org (Postfix) with ESMTPS id 42bgln6nlXzF3Hw for ; Fri, 19 Oct 2018 07:35:17 +1100 (AEDT) Received: from mail-ed1-f45.google.com (mail-ed1-f45.google.com [209.85.208.45]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id B7C532098A for ; Thu, 18 Oct 2018 20:25:31 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1539894332; bh=FijrFl+Jun+HKk+KfBRIss5bmX9j9o4cw4HezeO7mrc=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=IFzFophpyZ4gh4CxrP09E+Ad6cWJaEmZnk2qTOeLlrhbo4w1jRZURc2+dOGd1uNsR +hnBnF46tMlOq+yeipulnSXr0PgGlheZX9CuYeGKNWgEH/KwuP76QzSAd3Xjamb/7l niTfgjqQxli+Y0iScbYzwmpdiFtfV5wdBhaH256I= Received: by mail-ed1-f45.google.com with SMTP id c1-v6so29472643ede.5 for ; Thu, 18 Oct 2018 13:25:31 -0700 (PDT) X-Gm-Message-State: ABuFfoisNbhijVFZ1lVtvmcKziN8V8J5cgUXp9xQP834urdZW0liimXD LnGttjt9SaacJmM0WVvEVmNEaftg1BCw10Ajq1w= X-Google-Smtp-Source: ACcGV60DoMkSuQ3oj9wSaOxtCBQQmi4iSPq32E3hILxAhsLJ2eUM+3hAIvxnpELtqvfrs9ZVYTLyA/qQ55hwbffOwDQ= X-Received: by 2002:a50:92fd:: with SMTP id l58-v6mr4468252eda.200.1539894330243; Thu, 18 Oct 2018 13:25:30 -0700 (PDT) MIME-Version: 1.0 References: <1539657458-24401-1-git-send-email-frowand.list@gmail.com> <1539657458-24401-2-git-send-email-frowand.list@gmail.com> In-Reply-To: From: Alan Tull Date: Thu, 18 Oct 2018 15:24:53 -0500 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH v4 01/18] of: overlay: add tests to validate kfrees from overlay removal To: Frank Rowand Content-Type: text/plain; charset="UTF-8" X-BeenThere: linuxppc-dev@lists.ozlabs.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Linux on PowerPC Developers Mail List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: "open list:OPEN FIRMWARE AND FLATTENED DEVICE TREE BINDINGS" , linux-fpga@vger.kernel.org, Pantelis Antoniou , linux-kernel , Rob Herring , Moritz Fischer , Paul Mackerras , linuxppc-dev Errors-To: linuxppc-dev-bounces+linuxppc-dev=archiver.kernel.org@lists.ozlabs.org Sender: "Linuxppc-dev" On Wed, Oct 17, 2018 at 4:30 PM Alan Tull wrote: > > On Mon, Oct 15, 2018 at 9:39 PM wrote: > > Hi Frank, > > > > > From: Frank Rowand > > > > Add checks: > > - attempted kfree due to refcount reaching zero before overlay > > is removed > > - properties linked to an overlay node when the node is removed > > - node refcount > one during node removal in a changeset destroy, > > if the node was created by the changeset > > > > After applying this patch, several validation warnings will be > > reported from the devicetree unittest during boot due to > > pre-existing devicetree bugs. The warnings will be similar to: > > > > OF: ERROR: of_node_release() overlay node /testcase-data/overlay-node/test-bus/test-unittest11/test-unittest111 contains unexpected properties > > OF: ERROR: memory leak - destroy cset entry: attach overlay node /testcase-data-2/substation@100/hvac-medium-2 expected refcount 1 instead of 2. of_node_get() / of_node_put() are unbalanced for this node. > > > > Signed-off-by: Frank Rowand > > --- > > Changes since v3: > > - Add expected value of refcount for destroy cset entry error. Also > > explain the cause of the error. > > > > drivers/of/dynamic.c | 29 +++++++++++++++++++++++++++++ > > drivers/of/overlay.c | 1 + > > include/linux/of.h | 15 ++++++++++----- > > 3 files changed, 40 insertions(+), 5 deletions(-) > > > > diff --git a/drivers/of/dynamic.c b/drivers/of/dynamic.c > > index f4f8ed9b5454..24c97b7a050f 100644 > > --- a/drivers/of/dynamic.c > > +++ b/drivers/of/dynamic.c > > @@ -330,6 +330,25 @@ void of_node_release(struct kobject *kobj) > > if (!of_node_check_flag(node, OF_DYNAMIC)) > > return; > > > > + if (of_node_check_flag(node, OF_OVERLAY)) { > > + > > + if (!of_node_check_flag(node, OF_OVERLAY_FREE_CSET)) { > > + /* premature refcount of zero, do not free memory */ > > + pr_err("ERROR: memory leak %s() overlay node %pOF before free overlay changeset\n", > > + __func__, node); > > + return; > > + } > > + > > + /* > > + * If node->properties non-empty then properties were added > > + * to this node either by different overlay that has not > > + * yet been removed, or by a non-overlay mechanism. > > + */ > > + if (node->properties) > > + pr_err("ERROR: %s() overlay node %pOF contains unexpected properties\n", > > + __func__, node); > > + } > > + > > property_list_free(node->properties); > > property_list_free(node->deadprops); > > > > @@ -434,6 +453,16 @@ struct device_node *__of_node_dup(const struct device_node *np, > > > > static void __of_changeset_entry_destroy(struct of_changeset_entry *ce) > > { > > + if (ce->action == OF_RECONFIG_ATTACH_NODE && > > + of_node_check_flag(ce->np, OF_OVERLAY)) { > > + if (kref_read(&ce->np->kobj.kref) > 1) { > > + pr_err("ERROR: memory leak - destroy cset entry: attach overlay node %pOF expected refcount 1 instead of %d. of_node_get() / of_node_put() are unbalanced for this node.\n", > > + ce->np, kref_read(&ce->np->kobj.kref)); > > Still testing as much as I have time to do. > > I'm hitting this error message once when removing an overlay that adds > several child nodes. The only node I get the message for was a node > that added a fixed-clock (the other nodes didn't trigger the error). > Then even if I edited all the rest of the overlay DTS and removed all > other child nodes and all references to the clock from other nodes, I > still got the error. > > Removing dtbo: 1-socfpga_arria10_socdk_sdmmc_ghrd_ovl_ext_cfg.dtb > [ 72.032270] OF: ERROR: memory leak - destroy cset entry: attach > overlay node /soc/base_fpga_region/clk_0 expected refcount 1 instead > of 2. of_node_get() / of_node_put() are unbalanced for this node. Update: with some helpful offline debug patches from Frank, I was able to find the source of the of_node_get/put unbalance. The fixed-rate clock driver calls of_clk_add_provider() when probed but never calls of_clk_del_provider() This patchset quite likely will uncover other of_node_get/put unbalances around the kernel. Alan > > Here's the very stripped down overlay: > > /dts-v1/; > /plugin/; > / { > fragment@0 { > target-path = "/soc/base_fpga_region"; > #address-cells = <1>; > #size-cells = <1>; > > __overlay__ { > external-fpga-config; > > #address-cells = <1>; > #size-cells = <1>; > > clk_0: clk_0 { > compatible = "fixed-clock"; > #clock-cells = <0>; > clock-frequency = <100000000>; /* 100.00 MHz */ > clock-output-names = "clk_0-clk"; > }; > }; > }; > };