From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-8.5 required=3.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,NICE_REPLY_A,SIGNED_OFF_BY, SPF_HELO_NONE,SPF_PASS,USER_AGENT_SANE_1 autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 6345BC433E1 for ; Thu, 13 Aug 2020 12:07:50 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 354A820774 for ; Thu, 13 Aug 2020 12:07:50 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726596AbgHMMHp (ORCPT ); Thu, 13 Aug 2020 08:07:45 -0400 Received: from szxga07-in.huawei.com ([45.249.212.35]:49138 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726597AbgHMMH0 (ORCPT ); Thu, 13 Aug 2020 08:07:26 -0400 Received: from DGGEMS407-HUB.china.huawei.com (unknown [172.30.72.58]) by Forcepoint Email with ESMTP id C7CE5FDC741F5256B7E6; Thu, 13 Aug 2020 20:07:23 +0800 (CST) Received: from [127.0.0.1] (10.174.176.211) by DGGEMS407-HUB.china.huawei.com (10.3.19.207) with Microsoft SMTP Server id 14.3.487.0; Thu, 13 Aug 2020 20:07:16 +0800 Subject: Re: [PATCH 4.19 016/133] cgroup: fix cgroup_sk_alloc() for sk_clone_lock() To: Greg Kroah-Hartman CC: , , "Cameron Berkenpas" , Peter Geis , Lu Fengqi , =?UTF-8?Q?Dani=c3=abl_Sonck?= , Zhang Qiang , "Thomas Lamprecht" , Daniel Borkmann , Zefan Li , Tejun Heo , Roman Gushchin , Cong Wang , "David S. Miller" References: <20200720152803.732195882@linuxfoundation.org> <20200720152804.513188610@linuxfoundation.org> <20200813114138.GA3754843@kroah.com> From: Yang Yingliang Message-ID: <61e5b2db-d720-4b51-a3ca-3540097dd28f@huawei.com> Date: Thu, 13 Aug 2020 20:07:14 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:68.0) Gecko/20100101 Thunderbird/68.7.0 MIME-Version: 1.0 In-Reply-To: <20200813114138.GA3754843@kroah.com> Content-Type: text/plain; charset="utf-8"; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US X-Originating-IP: [10.174.176.211] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/8/13 19:41, Greg Kroah-Hartman wrote: > On Thu, Aug 13, 2020 at 07:30:55PM +0800, Yang Yingliang wrote: >> Hi, >> >> On 2020/7/20 23:36, Greg Kroah-Hartman wrote: >>> From: Cong Wang >>> >>> [ Upstream commit ad0f75e5f57ccbceec13274e1e242f2b5a6397ed ] >>> >>> When we clone a socket in sk_clone_lock(), its sk_cgrp_data is >>> copied, so the cgroup refcnt must be taken too. And, unlike the >>> sk_alloc() path, sock_update_netprioidx() is not called here. >>> Therefore, it is safe and necessary to grab the cgroup refcnt >>> even when cgroup_sk_alloc is disabled. >>> >>> sk_clone_lock() is in BH context anyway, the in_interrupt() >>> would terminate this function if called there. And for sk_alloc() >>> skcd->val is always zero. So it's safe to factor out the code >>> to make it more readable. >>> >>> The global variable 'cgroup_sk_alloc_disabled' is used to determine >>> whether to take these reference counts. It is impossible to make >>> the reference counting correct unless we save this bit of information >>> in skcd->val. So, add a new bit there to record whether the socket >>> has already taken the reference counts. This obviously relies on >>> kmalloc() to align cgroup pointers to at least 4 bytes, >>> ARCH_KMALLOC_MINALIGN is certainly larger than that. >>> >>> This bug seems to be introduced since the beginning, commit >>> d979a39d7242 ("cgroup: duplicate cgroup reference when cloning sockets") >>> tried to fix it but not compeletely. It seems not easy to trigger until >>> the recent commit 090e28b229af >>> ("netprio_cgroup: Fix unlimited memory leak of v2 cgroups") was merged. >>> >>> Fixes: bd1060a1d671 ("sock, cgroup: add sock->sk_cgroup") >>> Reported-by: Cameron Berkenpas >>> Reported-by: Peter Geis >>> Reported-by: Lu Fengqi >>> Reported-by: Daniƫl Sonck >>> Reported-by: Zhang Qiang >>> Tested-by: Cameron Berkenpas >>> Tested-by: Peter Geis >>> Tested-by: Thomas Lamprecht >>> Cc: Daniel Borkmann >>> Cc: Zefan Li >>> Cc: Tejun Heo >>> Cc: Roman Gushchin >>> Signed-off-by: Cong Wang >>> Signed-off-by: David S. Miller >>> Signed-off-by: Greg Kroah-Hartman >>> --- >> [...] >>> +void cgroup_sk_clone(struct sock_cgroup_data *skcd) >>> +{ >>> + /* Socket clone path */ >>> + if (skcd->val) { >> Compare to mainline patch, it's missing *if (skcd->no_refcnt)* check here. >> >> Is it a mistake here ? > Possibly, it is in the cgroup_sk_free() call. Can you send a patch to > fix this up? OK, I checked other stable branches, it also need be fixed in stable-4.9 and stable-4.14. I will send the patches to these branches. > > thanks, > > greg k-h > .