From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.3 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 9C0E4C74A35 for ; Thu, 11 Jul 2019 13:57:52 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 7A4CA20872 for ; Thu, 11 Jul 2019 13:57:52 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728363AbfGKN5w (ORCPT ); Thu, 11 Jul 2019 09:57:52 -0400 Received: from relay.sw.ru ([185.231.240.75]:55748 "EHLO relay.sw.ru" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728274AbfGKN5v (ORCPT ); Thu, 11 Jul 2019 09:57:51 -0400 Received: from [172.16.24.21] by relay.sw.ru with esmtp (Exim 4.92) (envelope-from ) id 1hlZZp-0001WC-QI; Thu, 11 Jul 2019 16:57:37 +0300 Subject: Re: [PATCH v3 0/3] kernel/notifier.c: avoid duplicate registration To: Nixiaoming , "adobriyan@gmail.com" , "akpm@linux-foundation.org" , "anna.schumaker@netapp.com" , "arjan@linux.intel.com" , "bfields@fieldses.org" , "chuck.lever@oracle.com" , "davem@davemloft.net" , "gregkh@linuxfoundation.org" , "jlayton@kernel.org" , "luto@kernel.org" , "mingo@kernel.org" , "Nadia.Derbey@bull.net" , "paulmck@linux.vnet.ibm.com" , "semen.protsenko@linaro.org" , "stable@kernel.org" , "stern@rowland.harvard.edu" , "tglx@linutronix.de" , "torvalds@linux-foundation.org" , "trond.myklebust@hammerspace.com" , "viresh.kumar@linaro.org" Cc: "Huangjianhui (Alex)" , Dailei , "linux-kernel@vger.kernel.org" , "linux-nfs@vger.kernel.org" , "netdev@vger.kernel.org" References: <1562728147-30251-1-git-send-email-nixiaoming@huawei.com> From: Vasily Averin Message-ID: Date: Thu, 11 Jul 2019 16:57:27 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.7.2 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On 7/11/19 4:55 AM, Nixiaoming wrote: > On Wed, July 10, 2019 1:49 PM Vasily Averin wrote: >> On 7/10/19 6:09 AM, Xiaoming Ni wrote: >>> Registering the same notifier to a hook repeatedly can cause the hook >>> list to form a ring or lose other members of the list. >> >> I think is not enough to _prevent_ 2nd register attempt, >> it's enough to detect just attempt and generate warning to mark host in bad state. >> > > Duplicate registration is prevented in my patch, not just "mark host in bad state" > > Duplicate registration is checked and exited in notifier_chain_cond_register() > > Duplicate registration was checked in notifier_chain_register() but only > the alarm was triggered without exiting. added by commit 831246570d34692e > ("kernel/notifier.c: double register detection") > > My patch is like a combination of 831246570d34692e and notifier_chain_cond_register(), > which triggers an alarm and exits when a duplicate registration is detected. > >> Unexpected 2nd register of the same hook most likely will lead to 2nd unregister, >> and it can lead to host crash in any time: >> you can unregister notifier on first attempt it can be too early, it can be still in use. >> on the other hand you can never call 2nd unregister at all. > > Since the member was not added to the linked list at the time of the second registration, > no linked list ring was formed. > The member is released on the first unregistration and -ENOENT on the second unregistration. > After patching, the fault has been alleviated You are wrong here. 2nd notifier's registration is a pure bug, this should never happen. If you know the way to reproduce this situation -- you need to fix it. 2nd registration can happen in 2 cases: 1) missed rollback, when someone forget to call unregister after successfull registration, and then tried to call register again. It can lead to crash for example when according module will be unloaded. 2) some subsystem is registered twice, for example from different namespaces. in this case unregister called during sybsystem cleanup in first namespace will incorrectly remove notifier used in second namespace, it also can lead to unexpacted behaviour. > It may be more helpful to return an error code when someone tries to register the same > notification program a second time. You are wrong again here, it is senseless. If you have detected 2nd register -- your node is already in bad state. > But I noticed that notifier_chain_cond_register() returns 0 when duplicate registration > is detected. At the same time, in all the existing export function comments of notify, > "Currently always returns zero" > > I am a bit confused: which is better? > >> >> Unfortunately I do not see any ways to handle such cases properly, >> and it seems for me your patches does not resolve this problem. >> >> Am I missed something probably? >> >>> case1: An infinite loop in notifier_chain_register() can cause soft lockup >>> atomic_notifier_chain_register(&test_notifier_list, &test1); >>> atomic_notifier_chain_register(&test_notifier_list, &test1); >>> atomic_notifier_chain_register(&test_notifier_list, &test2); > > Thanks > > Xiaoming Ni >