From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Date: Fri, 23 Dec 2016 18:53:36 +0900 From: Minchan Kim To: Michal Hocko CC: "Kirill A. Shutemov" , Andrew Morton , , Jason Evans , "Kirill A . Shutemov" , Will Deacon , Catalin Marinas , , , "[4.5+]" , Andreas Schwab Subject: Re: [PATCH] mm: pmd dirty emulation in page fault handler Message-ID: <20161223095336.GA5305@bbox> References: <1482364101-16204-1-git-send-email-minchan@kernel.org> <20161222081713.GA32480@node.shutemov.name> <20161222145203.GA18970@bbox> <20161223091725.GA23117@dhcp22.suse.cz> MIME-Version: 1.0 In-Reply-To: <20161223091725.GA23117@dhcp22.suse.cz> Content-Type: text/plain; charset="us-ascii" Content-Disposition: inline Sender: owner-linux-mm@kvack.org List-ID: Hi, On Fri, Dec 23, 2016 at 10:17:25AM +0100, Michal Hocko wrote: > On Thu 22-12-16 23:52:03, Minchan Kim wrote: > [...] > > >From b3ec95c0df91ad113525968a4a6b53030fd0b48d Mon Sep 17 00:00:00 2001 > > From: Minchan Kim > > Date: Thu, 22 Dec 2016 23:43:49 +0900 > > Subject: [PATCH v2] mm: pmd dirty emulation in page fault handler > > > > Andreas reported [1] made a test in jemalloc hang in THP mode in arm64. > > http://lkml.kernel.org/r/mvmmvfy37g1.fsf@hawking.suse.de > > > > The problem is page fault handler supports only accessed flag emulation > > for THP page of SW-dirty/accessed architecture. > > > > This patch enables dirty-bit emulation for those architectures. > > Without it, MADV_FREE makes application hang by repeated fault forever. > > The changelog is rather terse and considering the issue is rather subtle > and it aims the stable tree I think it could see more information. How > do we end up looping in the page fault and why the dirty pmd stops it. > Could you update the changelog to be more verbose, please? I am still > digesting this patch but I believe it is correct fwiw... > How about this? Feel free to suggest better wording. Andreas reported [1] made a test in jemalloc hang in THP mode in arm64. http://lkml.kernel.org/r/mvmmvfy37g1.fsf@hawking.suse.de The problem is currently page fault handler doesn't supports dirty bit emulation of pte for non-HW dirty-bit architecture so that application stucks until VM marked the pmd dirty. How the emulation work depends on the architecture. In case of arm64, when it set up pte firstly, it sets pte PTE_RDONLY to get a chance to mark the pte dirty via triggering page fault when store access happens. Once the page fault occurs, VM marks the pte dirty and arch code for setting pte will clear PTE_RDONLY for application to proceed. IOW, if VM doesn't mark the pte dirty, application hangs forever by repeated fault(i.e., store op but the pte is PTE_RDONLY). This patch enables dirty-bit emulation for those architectures. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org From mboxrd@z Thu Jan 1 00:00:00 1970 From: Minchan Kim Subject: Re: [PATCH] mm: pmd dirty emulation in page fault handler Date: Fri, 23 Dec 2016 18:53:36 +0900 Message-ID: <20161223095336.GA5305@bbox> References: <1482364101-16204-1-git-send-email-minchan@kernel.org> <20161222081713.GA32480@node.shutemov.name> <20161222145203.GA18970@bbox> <20161223091725.GA23117@dhcp22.suse.cz> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Return-path: In-Reply-To: <20161223091725.GA23117@dhcp22.suse.cz> Content-Disposition: inline Sender: owner-linux-mm@kvack.org To: Michal Hocko Cc: "Kirill A. Shutemov" , Andrew Morton , linux-mm@kvack.org, Jason Evans , "Kirill A . Shutemov" , Will Deacon , Catalin Marinas , linux-arch@vger.kernel.org, linux-arm-kernel@lists.infradead.org, "[4.5+]" , Andreas Schwab List-Id: linux-arch.vger.kernel.org Hi, On Fri, Dec 23, 2016 at 10:17:25AM +0100, Michal Hocko wrote: > On Thu 22-12-16 23:52:03, Minchan Kim wrote: > [...] > > >From b3ec95c0df91ad113525968a4a6b53030fd0b48d Mon Sep 17 00:00:00 2001 > > From: Minchan Kim > > Date: Thu, 22 Dec 2016 23:43:49 +0900 > > Subject: [PATCH v2] mm: pmd dirty emulation in page fault handler > > > > Andreas reported [1] made a test in jemalloc hang in THP mode in arm64. > > http://lkml.kernel.org/r/mvmmvfy37g1.fsf@hawking.suse.de > > > > The problem is page fault handler supports only accessed flag emulation > > for THP page of SW-dirty/accessed architecture. > > > > This patch enables dirty-bit emulation for those architectures. > > Without it, MADV_FREE makes application hang by repeated fault forever. > > The changelog is rather terse and considering the issue is rather subtle > and it aims the stable tree I think it could see more information. How > do we end up looping in the page fault and why the dirty pmd stops it. > Could you update the changelog to be more verbose, please? I am still > digesting this patch but I believe it is correct fwiw... > How about this? Feel free to suggest better wording. Andreas reported [1] made a test in jemalloc hang in THP mode in arm64. http://lkml.kernel.org/r/mvmmvfy37g1.fsf@hawking.suse.de The problem is currently page fault handler doesn't supports dirty bit emulation of pte for non-HW dirty-bit architecture so that application stucks until VM marked the pmd dirty. How the emulation work depends on the architecture. In case of arm64, when it set up pte firstly, it sets pte PTE_RDONLY to get a chance to mark the pte dirty via triggering page fault when store access happens. Once the page fault occurs, VM marks the pte dirty and arch code for setting pte will clear PTE_RDONLY for application to proceed. IOW, if VM doesn't mark the pte dirty, application hangs forever by repeated fault(i.e., store op but the pte is PTE_RDONLY). This patch enables dirty-bit emulation for those architectures. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from LGEAMRELO11.lge.com ([156.147.23.51]:45296 "EHLO lgeamrelo11.lge.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932659AbcLWJxk (ORCPT ); Fri, 23 Dec 2016 04:53:40 -0500 Date: Fri, 23 Dec 2016 18:53:36 +0900 From: Minchan Kim Subject: Re: [PATCH] mm: pmd dirty emulation in page fault handler Message-ID: <20161223095336.GA5305@bbox> References: <1482364101-16204-1-git-send-email-minchan@kernel.org> <20161222081713.GA32480@node.shutemov.name> <20161222145203.GA18970@bbox> <20161223091725.GA23117@dhcp22.suse.cz> MIME-Version: 1.0 In-Reply-To: <20161223091725.GA23117@dhcp22.suse.cz> Content-Type: text/plain; charset="us-ascii" Content-Disposition: inline Sender: linux-arch-owner@vger.kernel.org List-ID: To: Michal Hocko Cc: "Kirill A. Shutemov" , Andrew Morton , linux-mm@kvack.org, Jason Evans , "Kirill A . Shutemov" , Will Deacon , Catalin Marinas , linux-arch@vger.kernel.org, linux-arm-kernel@lists.infradead.org, "[4.5+]" , Andreas Schwab Message-ID: <20161223095336.EWCvDryk29tppoDUyuSUCK3eUsxWb_37wuAccbtywUU@z> Hi, On Fri, Dec 23, 2016 at 10:17:25AM +0100, Michal Hocko wrote: > On Thu 22-12-16 23:52:03, Minchan Kim wrote: > [...] > > >From b3ec95c0df91ad113525968a4a6b53030fd0b48d Mon Sep 17 00:00:00 2001 > > From: Minchan Kim > > Date: Thu, 22 Dec 2016 23:43:49 +0900 > > Subject: [PATCH v2] mm: pmd dirty emulation in page fault handler > > > > Andreas reported [1] made a test in jemalloc hang in THP mode in arm64. > > http://lkml.kernel.org/r/mvmmvfy37g1.fsf@hawking.suse.de > > > > The problem is page fault handler supports only accessed flag emulation > > for THP page of SW-dirty/accessed architecture. > > > > This patch enables dirty-bit emulation for those architectures. > > Without it, MADV_FREE makes application hang by repeated fault forever. > > The changelog is rather terse and considering the issue is rather subtle > and it aims the stable tree I think it could see more information. How > do we end up looping in the page fault and why the dirty pmd stops it. > Could you update the changelog to be more verbose, please? I am still > digesting this patch but I believe it is correct fwiw... > How about this? Feel free to suggest better wording. Andreas reported [1] made a test in jemalloc hang in THP mode in arm64. http://lkml.kernel.org/r/mvmmvfy37g1.fsf@hawking.suse.de The problem is currently page fault handler doesn't supports dirty bit emulation of pte for non-HW dirty-bit architecture so that application stucks until VM marked the pmd dirty. How the emulation work depends on the architecture. In case of arm64, when it set up pte firstly, it sets pte PTE_RDONLY to get a chance to mark the pte dirty via triggering page fault when store access happens. Once the page fault occurs, VM marks the pte dirty and arch code for setting pte will clear PTE_RDONLY for application to proceed. IOW, if VM doesn't mark the pte dirty, application hangs forever by repeated fault(i.e., store op but the pte is PTE_RDONLY). This patch enables dirty-bit emulation for those architectures. From mboxrd@z Thu Jan 1 00:00:00 1970 From: minchan@kernel.org (Minchan Kim) Date: Fri, 23 Dec 2016 18:53:36 +0900 Subject: [PATCH] mm: pmd dirty emulation in page fault handler In-Reply-To: <20161223091725.GA23117@dhcp22.suse.cz> References: <1482364101-16204-1-git-send-email-minchan@kernel.org> <20161222081713.GA32480@node.shutemov.name> <20161222145203.GA18970@bbox> <20161223091725.GA23117@dhcp22.suse.cz> Message-ID: <20161223095336.GA5305@bbox> To: linux-arm-kernel@lists.infradead.org List-Id: linux-arm-kernel.lists.infradead.org Hi, On Fri, Dec 23, 2016 at 10:17:25AM +0100, Michal Hocko wrote: > On Thu 22-12-16 23:52:03, Minchan Kim wrote: > [...] > > >From b3ec95c0df91ad113525968a4a6b53030fd0b48d Mon Sep 17 00:00:00 2001 > > From: Minchan Kim > > Date: Thu, 22 Dec 2016 23:43:49 +0900 > > Subject: [PATCH v2] mm: pmd dirty emulation in page fault handler > > > > Andreas reported [1] made a test in jemalloc hang in THP mode in arm64. > > http://lkml.kernel.org/r/mvmmvfy37g1.fsf at hawking.suse.de > > > > The problem is page fault handler supports only accessed flag emulation > > for THP page of SW-dirty/accessed architecture. > > > > This patch enables dirty-bit emulation for those architectures. > > Without it, MADV_FREE makes application hang by repeated fault forever. > > The changelog is rather terse and considering the issue is rather subtle > and it aims the stable tree I think it could see more information. How > do we end up looping in the page fault and why the dirty pmd stops it. > Could you update the changelog to be more verbose, please? I am still > digesting this patch but I believe it is correct fwiw... > How about this? Feel free to suggest better wording. Andreas reported [1] made a test in jemalloc hang in THP mode in arm64. http://lkml.kernel.org/r/mvmmvfy37g1.fsf at hawking.suse.de The problem is currently page fault handler doesn't supports dirty bit emulation of pte for non-HW dirty-bit architecture so that application stucks until VM marked the pmd dirty. How the emulation work depends on the architecture. In case of arm64, when it set up pte firstly, it sets pte PTE_RDONLY to get a chance to mark the pte dirty via triggering page fault when store access happens. Once the page fault occurs, VM marks the pte dirty and arch code for setting pte will clear PTE_RDONLY for application to proceed. IOW, if VM doesn't mark the pte dirty, application hangs forever by repeated fault(i.e., store op but the pte is PTE_RDONLY). This patch enables dirty-bit emulation for those architectures.