From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-1.0 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 167DAC282DD for ; Thu, 23 May 2019 16:11:53 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id E6CB721773 for ; Thu, 23 May 2019 16:11:52 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1731174AbfEWQLw (ORCPT ); Thu, 23 May 2019 12:11:52 -0400 Received: from relay.sw.ru ([185.231.240.75]:51748 "EHLO relay.sw.ru" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730782AbfEWQLv (ORCPT ); Thu, 23 May 2019 12:11:51 -0400 Received: from [172.16.25.169] by relay.sw.ru with esmtp (Exim 4.91) (envelope-from ) id 1hTqJe-00034X-DO; Thu, 23 May 2019 19:11:38 +0300 Subject: Re: [PATCH v2 0/7] mm: process_vm_mmap() -- syscall for duplication a process mapping To: "Kirill A. Shutemov" Cc: akpm@linux-foundation.org, dan.j.williams@intel.com, mhocko@suse.com, keith.busch@intel.com, kirill.shutemov@linux.intel.com, alexander.h.duyck@linux.intel.com, ira.weiny@intel.com, andreyknvl@google.com, arunks@codeaurora.org, vbabka@suse.cz, cl@linux.com, riel@surriel.com, keescook@chromium.org, hannes@cmpxchg.org, npiggin@gmail.com, mathieu.desnoyers@efficios.com, shakeelb@google.com, guro@fb.com, aarcange@redhat.com, hughd@google.com, jglisse@redhat.com, mgorman@techsingularity.net, daniel.m.jordan@oracle.com, jannh@google.com, kilobyte@angband.pl, linux-api@vger.kernel.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org References: <155836064844.2441.10911127801797083064.stgit@localhost.localdomain> <20190522152254.5cyxhjizuwuojlix@box> From: Kirill Tkhai Message-ID: <4b0a2b23-abc7-fa0d-5e30-74741331e7e5@virtuozzo.com> Date: Thu, 23 May 2019 19:11:37 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.6.1 MIME-Version: 1.0 In-Reply-To: <20190522152254.5cyxhjizuwuojlix@box> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 22.05.2019 18:22, Kirill A. Shutemov wrote: > On Mon, May 20, 2019 at 05:00:01PM +0300, Kirill Tkhai wrote: >> This patchset adds a new syscall, which makes possible >> to clone a VMA from a process to current process. >> The syscall supplements the functionality provided >> by process_vm_writev() and process_vm_readv() syscalls, >> and it may be useful in many situation. > > Kirill, could you explain how the change affects rmap and how it is safe. > > My concern is that the patchset allows to map the same page multiple times > within one process or even map page allocated by child to the parrent. > > It was not allowed before. > > In the best case it makes reasoning about rmap substantially more difficult. I don't think here is big impact from process relationships, because of as it existed before, the main rule of VMA chaining is that VMA is younger or older each other. For example, reusing of anon_vma in anon_vma_clone() may be done either children or siblings. Also, it is possible reparenting after some of processes dies; or splitting two branches of processes having the same grand parent into two chains after the grand parent dies, so it looks there should be many combinations already available. Mapping of the same page multiple times is a different thing, and it was never allowed for rmap. Could you please say more specifically what looks suspicious for you and I'll try to answer then? Otherwise, it's possible to write explanations as big as a dissertation and to miss all answers to that is interested for you :) > > But I'm worry it will introduce hard-to-debug bugs, like described in > https://lwn.net/Articles/383162/. I read the article, but there are a lot of messages in thread, I'm not sure, that found the actual fix there. But it looks like one of the fixes may be be usage of anon_vma->root in __page_set_anon_rmap(). > Note, that is some cases we care about rmap walk order (see for instance > mremap() case). I'm not convinced that the feature will not break > something in the area. Yeah, thanks for pointing, I'll check this. Kirill