From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from gate.crashing.org (gate.crashing.org [63.228.1.57]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by ozlabs.org (Postfix) with ESMTPS id 3AFB2B7B69 for ; Thu, 24 Sep 2009 16:44:26 +1000 (EST) Subject: Re: [PATCH] powerpc/8xx: fix regression introduced by cache coherency rewrite From: Benjamin Herrenschmidt To: Rex Feany In-Reply-To: <20090924004552.GA11737@compile2.chatsunix.int.mrv.com> References: <20090924004552.GA11737@compile2.chatsunix.int.mrv.com> Content-Type: text/plain Date: Thu, 24 Sep 2009 16:44:19 +1000 Message-Id: <1253774659.7103.405.camel@pasglop> Mime-Version: 1.0 Cc: linuxppc-dev@ozlabs.org List-Id: Linux on PowerPC Developers Mail List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , On Wed, 2009-09-23 at 17:45 -0700, Rex Feany wrote: > After upgrading to the latest kernel on my mpc875 userspace started > running incredibly slow (hours to get to a shell, even!). > I tracked it down to commit 8d30c14cab30d405a05f2aaceda1e9ad57800f36, > that patch removed a work-around for the 8xx. Adding it > back makes my problem go away. > > Signed-off-by: Rex Feany Note that while your patch applies to earlier kernels such as 2.6.31, it doesn't apply with current Linus tree due to changes in that file. I've updated the powerpc "merge" branch of my tree with a hand-modified version, please test and let me know. That should also contain the _PAGE_SPECIAL fix. You can get my tree at: git://git.kernel.org/pub/scm/linux/kernel/git/benh/powerpc.git (To get the "merge" branch, just add "merge" after the clone if you are cloning it, or just create a local branch and manually pull into it) Note that i just pushed out so it may take a little while for the kernel.org mirrors to get it, the commit ID is: ae48e383e3299c2f87723d21df2db97927774b1e Cheers, Ben. > diff --git a/arch/powerpc/mm/pgtable.c b/arch/powerpc/mm/pgtable.c > index 627767d..d8e6725 100644 > --- a/arch/powerpc/mm/pgtable.c > +++ b/arch/powerpc/mm/pgtable.c > @@ -30,6 +30,8 @@ > #include > #include > > +#include "mmu_decl.h" > + > static DEFINE_PER_CPU(struct pte_freelist_batch *, pte_freelist_cur); > static unsigned long pte_freelist_forced_free; > > @@ -119,7 +121,7 @@ void pte_free_finish(void) > /* > * Handle i/d cache flushing, called from set_pte_at() or ptep_set_access_flags() > */ > -static pte_t do_dcache_icache_coherency(pte_t pte) > +static pte_t do_dcache_icache_coherency(pte_t pte, unsigned long addr) > { > unsigned long pfn = pte_pfn(pte); > struct page *page; > @@ -128,6 +130,17 @@ static pte_t do_dcache_icache_coherency(pte_t pte) > return pte; > page = pfn_to_page(pfn); > > +#ifdef CONFIG_8xx > + /* On 8xx, cache control instructions (particularly > + * "dcbst" from flush_dcache_icache) fault as write > + * operation if there is an unpopulated TLB entry > + * for the address in question. To workaround that, > + * we invalidate the TLB here, thus avoiding dcbst > + * misbehaviour. > + */ > + _tlbil_va(addr, 0 /* 8xx doesn't care about PID */); > +#endif > + > if (!PageReserved(page) && !test_bit(PG_arch_1, &page->flags)) { > pr_devel("do_dcache_icache_coherency... flushing\n"); > flush_dcache_icache_page(page); > @@ -198,7 +211,7 @@ void set_pte_at(struct mm_struct *mm, unsigned long addr, pte_t *ptep, pte_t pte > */ > pte = __pte(pte_val(pte) & ~_PAGE_HPTEFLAGS); > if (pte_need_exec_flush(pte, 1)) > - pte = do_dcache_icache_coherency(pte); > + pte = do_dcache_icache_coherency(pte, addr); > > /* Perform the setting of the PTE */ > __set_pte_at(mm, addr, ptep, pte, 0); > @@ -216,7 +229,7 @@ int ptep_set_access_flags(struct vm_area_struct *vma, unsigned long address, > { > int changed; > if (!dirty && pte_need_exec_flush(entry, 0)) > - entry = do_dcache_icache_coherency(entry); > + entry = do_dcache_icache_coherency(entry, address); > changed = !pte_same(*(ptep), entry); > if (changed) { > if (!(vma->vm_flags & VM_HUGETLB))