From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-14.8 required=3.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED,USER_AGENT_SANE_1, USER_IN_DEF_DKIM_WL autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 4A7DFC4708A for ; Thu, 27 May 2021 09:32:45 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 2CF37613DE for ; Thu, 27 May 2021 09:32:45 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235868AbhE0JeQ (ORCPT ); Thu, 27 May 2021 05:34:16 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36878 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235810AbhE0JeQ (ORCPT ); Thu, 27 May 2021 05:34:16 -0400 Received: from mail-wr1-x435.google.com (mail-wr1-x435.google.com [IPv6:2a00:1450:4864:20::435]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2B1B9C061574 for ; Thu, 27 May 2021 02:32:42 -0700 (PDT) Received: by mail-wr1-x435.google.com with SMTP id g17so3785426wrs.13 for ; Thu, 27 May 2021 02:32:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=WUOW0aYMYacTttJcw/Wumsyv7jlobDPzPKJRlWNJDUo=; b=Fts2SzA7NlCiuPjoPjAYz0zx8cr0BkxmM+vJOFkwElZMHyM96QfAfWgc3HTPRIM2QC C49jhHv2WAQz67scRFNGP9YEJt2Wc1WsuKrRbguWnhIPUWcEySLHMNKLG8Qawz2O6fTC fhLIoYwzBm25hTiWmUcFAxFCAi/rEKqFpTrPXs/BPgYDY/I1EOKRPJigzLc7HC5rCadX Dyow/rpWI3yLED1KeOsd9j3O8UADvwzpJjPcKg1BZwGjOYRp5iPhzJAs3E10eAQA5JgF pK2hVMDl6ma5AgA+JG9R2AYtZUyAM5Whr2ippvA1hHysyKmDsVk3vJE8gf/yBr1SLsIG DjPA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=WUOW0aYMYacTttJcw/Wumsyv7jlobDPzPKJRlWNJDUo=; b=JG/pO6Eo+Zn2opxhn+RTNTXlMvp0QzpASJTQEAN6eEnqnwhy9qqIK/2eeMAUxnupvO IC0WVyR25ontWfKkemzsflG0UVQ6tIafUK+/+Oi9csEAQR7TOt3ea1LkSu6kk9juSidv fK8o7jmORMMVErkRBCuDPnNEZyfabAVBTEDibTyqUl1Q+zAyMJtKQNUQT5J8OFUiFyzH um2buDsCdcGSOc1A/TSBT+Om7cuULH3R5khzvZTQDSKHLaGIXv6BYkFVbKUk19bQDPg3 I8JXXUsnm7R9TOdNSMp3zDnZsQIMwR/yZm8WJ8M//zdGLScBiPsWH8gqkBv43LB90R8G BVvA== X-Gm-Message-State: AOAM532icEqXu862WZqFM2psKYESh/8GBTAiLiZUGW+tpry1CFk1s7Vj GUuA9klZYv/z4UoAhhL3Vc485g== X-Google-Smtp-Source: ABdhPJwsj+iam2ha503lEq0q3f2HZtIBTocjpt2H8N3qkS5YcOaGp/53yu6T35+G43Z0XoJA+yu7mg== X-Received: by 2002:adf:f5c5:: with SMTP id k5mr2256885wrp.81.1622107960640; Thu, 27 May 2021 02:32:40 -0700 (PDT) Received: from elver.google.com ([2a00:79e0:15:13:74ba:ff42:8494:7f35]) by smtp.gmail.com with ESMTPSA id m6sm11820746wml.3.2021.05.27.02.32.39 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 27 May 2021 02:32:39 -0700 (PDT) Date: Thu, 27 May 2021 11:32:34 +0200 From: Marco Elver To: Pavel Begunkov Cc: Jens Axboe , syzbot , io-uring@vger.kernel.org, LKML , syzkaller-bugs , Dmitry Vyukov Subject: Re: [syzbot] KCSAN: data-race in __io_uring_cancel / io_uring_try_cancel_requests Message-ID: References: <000000000000fa9f7005c33d83b9@google.com> <5cf2250a-c580-4dbf-5997-e987c7b71086@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <5cf2250a-c580-4dbf-5997-e987c7b71086@gmail.com> User-Agent: Mutt/2.0.5 (2021-01-21) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, May 26, 2021 at 09:31PM +0100, Pavel Begunkov wrote: > On 5/26/21 5:36 PM, Marco Elver wrote: > > On Wed, 26 May 2021 at 18:29, Pavel Begunkov wrote: > >> On 5/26/21 4:52 PM, Marco Elver wrote: > >>> Due to some moving around of code, the patch lost the actual fix (using > >>> atomically read io_wq) -- so here it is again ... hopefully as intended. > >>> :-) > >> > >> "fortify" damn it... It was synchronised with &ctx->uring_lock > >> before, see io_uring_try_cancel_iowq() and io_uring_del_tctx_node(), > >> so should not clear before *del_tctx_node() > > > > Ah, so if I understand right, the property stated by the comment in > > io_uring_try_cancel_iowq() was broken, and your patch below would fix > > that, right? > > "io_uring: fortify tctx/io_wq cleanup" broke it and the diff > should fix it. > > >> The fix should just move it after this sync point. Will you send > >> it out as a patch? > > > > Do you mean your move of write to io_wq goes on top of the patch I > > proposed? (If so, please also leave your Signed-of-by so I can squash > > it.) > > No, only my diff, but you hinted on what has happened, so I would > prefer you to take care of patching. If you want of course. > > To be entirely fair, assuming that aligned ptr > reads can't be torn, I don't see any _real_ problem. But surely > the report is very helpful and the current state is too wonky, so > should be patched. In the current version, it is a problem if we end up with a double-read, as it is in the current C code. The compiler might of course optimize it into 1 read into a register. Tangent: I avoid reasoning in terms of compiler optimizations where I can. :-) It's is a slippery slope if the code in question isn't tolerant to data races by design (examples are stats counting, or other heuristics -- in the case here that's certainly not the case). Therefore, my wish is that we really ought to resolve as many data races as we can (+ mark intentional ones appropriately). Also, so that we're left with only the interesting cases like in the case here. (More background if you're interested: https://lwn.net/Articles/816850/) The problem here, however, has a nicer resolution as you suggested. > TL;DR; > The synchronisation goes as this: it's usually used by the owner > task, and the owner task deletes it, so is mostly naturally > synchronised. An exception is a worker (not only) that accesses > it for cancellation purpose, but it uses it only under ->uring_lock, > so if removal is also taking the lock it should be fine. see > io_uring_del_tctx_node() locking. Did you mean io_uring_del_task_file()? There is no io_uring_del_tctx_node(). > > So if I understand right, we do in fact have 2 problems: > > 1. the data race as I noted in my patch, and > > Yes, and it deals with it > > > 2. the fact that io_wq does not live long enough. > > Nope, io_wq outlives them fine. I've sent: https://lkml.kernel.org/r/20210527092547.2656514-1-elver@google.com Thanks, -- Marco