From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 9B60BC31E5B for ; Tue, 18 Jun 2019 14:11:22 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 7B1E72085A for ; Tue, 18 Jun 2019 14:11:22 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729831AbfFROLS (ORCPT ); Tue, 18 Jun 2019 10:11:18 -0400 Received: from gate.crashing.org ([63.228.1.57]:32992 "EHLO gate.crashing.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729743AbfFROLR (ORCPT ); Tue, 18 Jun 2019 10:11:17 -0400 Received: from localhost (localhost.localdomain [127.0.0.1]) by gate.crashing.org (8.14.1/8.14.1) with ESMTP id x5IEB0wY002489; Tue, 18 Jun 2019 09:11:01 -0500 Message-ID: <03865a8c403d3f26aab65d758daa900ab175de08.camel@kernel.crashing.org> Subject: Re: [PATCH v4] driver core: Fix use-after-free and double free on glue directory From: Benjamin Herrenschmidt To: Muchun Song , Greg KH Cc: "Rafael J. Wysocki" , Prateek Sood , Mukesh Ojha , gkohli@codeaurora.org, linux-kernel , linux-arm-msm , zhaowuyun@wingtech.com Date: Wed, 19 Jun 2019 00:11:00 +1000 In-Reply-To: References: <20190516142342.28019-1-smuchun@gmail.com> <20190524190443.GB29565@kroah.com> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.28.5-0ubuntu0.18.04.1 Mime-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-arm-msm-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-arm-msm@vger.kernel.org On Tue, 2019-06-18 at 21:40 +0800, Muchun Song wrote: > Ping guys ? I think this is worth fixing. I agree :-) My opinion hasn't changed though, the right fix isn't making guesses based on the refcount but solve the actual race which is the mutex being dropped between looking for the object existence and deciding to create it :-) Cheers, Ben. > Muchun Song 于2019年5月25日周六 下午8:15写道: > > > > > Hi greg k-h, > > > > Greg KH 于2019年5月25日周六 上午3:04写道: > > > > > > On Thu, May 16, 2019 at 10:23:42PM +0800, Muchun Song wrote: > > > > There is a race condition between removing glue directory and > > > > adding a new > > > > device under the glue directory. It can be reproduced in > > > > following test: > > > > > > > > > > > > Is this related to: > > > Subject: [PATCH v3] drivers: core: Remove glue dirs early > > > only when refcount is 1 > > > > > > ? > > > > > > If so, why is the solution so different? > > > > In the v1 patch, the solution is that remove glue dirs early only > > when > > refcount is 1. So > > the v1 patch like below: > > > > @@ -1825,7 +1825,7 @@ static void cleanup_glue_dir(struct device > > *dev, > > struct kobject *glue_dir) > > return; > > > > mutex_lock(&gdp_mutex); > > - if (!kobject_has_children(glue_dir)) > > + if (!kobject_has_children(glue_dir) && kref_read(&glue_dir- > > >kref) == 1) > > kobject_del(glue_dir); > > kobject_put(glue_dir); > > mutex_unlock(&gdp_mutex); > > ----------------------------------------------------------------- > > ------ > > > > But from Ben's suggestion as below: > > > > I find relying on the object count for such decisions rather > > fragile as > > it could be taken temporarily for other reasons, couldn't it ? In > > which > > case we would just fail... > > > > Ideally, the looking up of the glue dir and creation of its child > > should be protected by the same lock instance (the gdp_mutex in > > that > > case). > > ----------------------------------------------------------------- > > ------ > > > > So another solution is used from Ben's suggestion in the v2 patch. > > But > > I forgot to update the commit message until the v4 patch. Thanks. > > > > Yours, > > Muchun