From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932160AbcHXBeO (ORCPT ); Tue, 23 Aug 2016 21:34:14 -0400 Received: from szxga01-in.huawei.com ([58.251.152.64]:51201 "EHLO szxga01-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753156AbcHXBeL (ORCPT ); Tue, 23 Aug 2016 21:34:11 -0400 Subject: Re: [PATCH 1/1] arm64/hugetlb: clear PG_dcache_clean if the page is dirty when munmap To: Catalin Marinas References: <20160707153741.GC27180@e104818-lin.cambridge.arm.com> <577F1FD9.1040205@huawei.com> <20160708135447.GB22099@e104818-lin.cambridge.arm.com> <577FC5AA.5010709@huawei.com> <20160708161347.GC22099@e104818-lin.cambridge.arm.com> <57839474.6030203@huawei.com> <20160712153535.GH22183@e104818-lin.cambridge.arm.com> <578EE603.9020206@huawei.com> <20160720091939.GA25890@e104818-lin.cambridge.arm.com> <57BA7D38.8030103@huawei.com> <20160823172852.GB16213@e104818-lin.cambridge.arm.com> CC: Steve Capper , David Woods , Tianhong Ding , Will Deacon , linux-kernel , Xinwei Hu , Zefan Li , "fangwei (I)" , "Hanjun Guo" , linux-arm-kernel From: "Leizhen (ThunderTown)" Message-ID: <57BCF8B4.9010109@huawei.com> Date: Wed, 24 Aug 2016 09:30:28 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.5.1 MIME-Version: 1.0 In-Reply-To: <20160823172852.GB16213@e104818-lin.cambridge.arm.com> Content-Type: text/plain; charset="windows-1252" Content-Transfer-Encoding: 7bit X-Originating-IP: [10.177.23.164] X-CFilter-Loop: Reflected X-Mirapoint-Virus-RAPID-Raw: score=unknown(0), refid=str=0001.0A020206.57BCF8C2.01CE,ss=1,re=0.000,recu=0.000,reip=0.000,cl=1,cld=1,fgs=0, ip=0.0.0.0, so=2013-06-18 04:22:30, dmn=2013-03-21 17:37:32 X-Mirapoint-Loop-Id: 925ebe5d6e84ba3f030e1fdc21b06d29 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2016/8/24 1:28, Catalin Marinas wrote: > On Mon, Aug 22, 2016 at 12:19:04PM +0800, Leizhen (ThunderTown) wrote: >> On 2016/7/20 17:19, Catalin Marinas wrote: >>> On Wed, Jul 20, 2016 at 10:46:27AM +0800, Leizhen (ThunderTown) wrote: >>>>>>>> On 2016/7/8 21:54, Catalin Marinas wrote: >>>>>>>>> ------------8<---------------- >>>>>>>>> diff --git a/arch/arm64/mm/flush.c b/arch/arm64/mm/flush.c >>>>>>>>> index dbd12ea8ce68..c753fa804165 100644 >>>>>>>>> --- a/arch/arm64/mm/flush.c >>>>>>>>> +++ b/arch/arm64/mm/flush.c >>>>>>>>> @@ -75,7 +75,8 @@ void __sync_icache_dcache(pte_t pte, unsigned long addr) >>>>>>>>> if (!page_mapping(page)) >>>>>>>>> return; >>>>>>>>> >>>>>>>>> - if (!test_and_set_bit(PG_dcache_clean, &page->flags)) >>>>>>>>> + if (!test_and_set_bit(PG_dcache_clean, &page->flags) || >>>>>>>>> + PageDirty(page)) >>>>>>>>> sync_icache_aliases(page_address(page), >>>>>>>>> PAGE_SIZE << compound_order(page)); >>>>>>>>> else if (icache_is_aivivt()) >>>>>>>>> ----------------8<--------------------- >>>> >>>> Do you plan to send this patch? My colleagues told me that if our >>>> patches are quite different, it should be Signed-off-by you. >>> >>> The reason I'm not sending it is that I don't fully understand how it >>> solves the problem for a shared file mmap(), not just hugetlbfs. As I >>> said in an earlier email: after an msync() in user space we >>> should flush the pages to disk via write_cache_pages(). This function >> Hi Catalin: >> I'm so sorry for my fault. The previous small pages test result I actually ran on ramfs. >> Today, I ran the case on harddisk fs, it worked well without this patch. >> >> Summarized as follows: >> small pages on ramfs: need this patch >> small pages on harddisk fs: no need this patch >> hugetlbfs: need this patch > > I would add: > > small pages over nfs: fails with or without this patch > > (tested on Juno, Cortex-A57; seems to be fixed if I remove the > PG_dcache_clean test altogether but, well, we end up over-flushing) > > I assume that when using a hard drive, it goes through the block I/O > layer and we may have a flush_dcache_page() called when the kernel is > about to read a page that has been mapped in user space. This would > clear the PG_dcache_clean bit and subsequent __sync_icache_dcache() > would perform cache maintenance. > > Could you try on your system the test case without the msync() call? I'm > not sure whether munmap() would trigger an immediate write-back, in > which case we may see the issue even with the filesystem on a hard > drive. OK, no problem. I will do it today or tomorrow. > From mboxrd@z Thu Jan 1 00:00:00 1970 From: thunder.leizhen@huawei.com (Leizhen (ThunderTown)) Date: Wed, 24 Aug 2016 09:30:28 +0800 Subject: [PATCH 1/1] arm64/hugetlb: clear PG_dcache_clean if the page is dirty when munmap In-Reply-To: <20160823172852.GB16213@e104818-lin.cambridge.arm.com> References: <20160707153741.GC27180@e104818-lin.cambridge.arm.com> <577F1FD9.1040205@huawei.com> <20160708135447.GB22099@e104818-lin.cambridge.arm.com> <577FC5AA.5010709@huawei.com> <20160708161347.GC22099@e104818-lin.cambridge.arm.com> <57839474.6030203@huawei.com> <20160712153535.GH22183@e104818-lin.cambridge.arm.com> <578EE603.9020206@huawei.com> <20160720091939.GA25890@e104818-lin.cambridge.arm.com> <57BA7D38.8030103@huawei.com> <20160823172852.GB16213@e104818-lin.cambridge.arm.com> Message-ID: <57BCF8B4.9010109@huawei.com> To: linux-arm-kernel@lists.infradead.org List-Id: linux-arm-kernel.lists.infradead.org On 2016/8/24 1:28, Catalin Marinas wrote: > On Mon, Aug 22, 2016 at 12:19:04PM +0800, Leizhen (ThunderTown) wrote: >> On 2016/7/20 17:19, Catalin Marinas wrote: >>> On Wed, Jul 20, 2016 at 10:46:27AM +0800, Leizhen (ThunderTown) wrote: >>>>>>>> On 2016/7/8 21:54, Catalin Marinas wrote: >>>>>>>>> ------------8<---------------- >>>>>>>>> diff --git a/arch/arm64/mm/flush.c b/arch/arm64/mm/flush.c >>>>>>>>> index dbd12ea8ce68..c753fa804165 100644 >>>>>>>>> --- a/arch/arm64/mm/flush.c >>>>>>>>> +++ b/arch/arm64/mm/flush.c >>>>>>>>> @@ -75,7 +75,8 @@ void __sync_icache_dcache(pte_t pte, unsigned long addr) >>>>>>>>> if (!page_mapping(page)) >>>>>>>>> return; >>>>>>>>> >>>>>>>>> - if (!test_and_set_bit(PG_dcache_clean, &page->flags)) >>>>>>>>> + if (!test_and_set_bit(PG_dcache_clean, &page->flags) || >>>>>>>>> + PageDirty(page)) >>>>>>>>> sync_icache_aliases(page_address(page), >>>>>>>>> PAGE_SIZE << compound_order(page)); >>>>>>>>> else if (icache_is_aivivt()) >>>>>>>>> ----------------8<--------------------- >>>> >>>> Do you plan to send this patch? My colleagues told me that if our >>>> patches are quite different, it should be Signed-off-by you. >>> >>> The reason I'm not sending it is that I don't fully understand how it >>> solves the problem for a shared file mmap(), not just hugetlbfs. As I >>> said in an earlier email: after an msync() in user space we >>> should flush the pages to disk via write_cache_pages(). This function >> Hi Catalin: >> I'm so sorry for my fault. The previous small pages test result I actually ran on ramfs. >> Today, I ran the case on harddisk fs, it worked well without this patch. >> >> Summarized as follows: >> small pages on ramfs: need this patch >> small pages on harddisk fs: no need this patch >> hugetlbfs: need this patch > > I would add: > > small pages over nfs: fails with or without this patch > > (tested on Juno, Cortex-A57; seems to be fixed if I remove the > PG_dcache_clean test altogether but, well, we end up over-flushing) > > I assume that when using a hard drive, it goes through the block I/O > layer and we may have a flush_dcache_page() called when the kernel is > about to read a page that has been mapped in user space. This would > clear the PG_dcache_clean bit and subsequent __sync_icache_dcache() > would perform cache maintenance. > > Could you try on your system the test case without the msync() call? I'm > not sure whether munmap() would trigger an immediate write-back, in > which case we may see the issue even with the filesystem on a hard > drive. OK, no problem. I will do it today or tomorrow. >