From: Rob Clark <robdclark-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> To: Sricharan <sricharan-sgV2jX0FEOL9JmXXK+q4OQ@public.gmane.org> Cc: "devicetree-u79uwXL29TY76Z2rM5mHXA@public.gmane.org" <devicetree-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>, Archit Taneja <architt-sgV2jX0FEOL9JmXXK+q4OQ@public.gmane.org>, Arnd Bergmann <arnd-r2nGTMty4D4@public.gmane.org>, linux-arm-msm <linux-arm-msm-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>, "iommu-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org" <iommu-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org>, Srinivas Kandagatla <srinivas.kandagatla-QSEj5FYQhm4dnm+yROfE0A@public.gmane.org>, Laurent Pinchart <laurent.pinchart-ryLnwIuWjnjg/C1BVhZhaw@public.gmane.org>, Thierry Reding <treding-DDmLM1+adcrQT0dZR+AlfA@public.gmane.org>, "linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r@public.gmane.org" <linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r@public.gmane.org>, stepanm-sgV2jX0FEOL9JmXXK+q4OQ@public.gmane.org Subject: Re: [PATCH V6 0/6] iommu/msm: Add DT adaptation and generic bindings support Date: Fri, 12 Aug 2016 11:17:17 -0400 [thread overview] Message-ID: <CAF6AEGvy4efaA4PSYNAJ3gDzxP9xX4b32uQDCfh4b=gtnqJpFw@mail.gmail.com> (raw) In-Reply-To: <CAF6AEGs=m5qZJhBqBa90RS-1aO1zb3w_U=izF3MjyvMOPGSJ+A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> On Fri, Aug 12, 2016 at 10:40 AM, Rob Clark <robdclark-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote: > On Fri, Aug 12, 2016 at 10:32 AM, Rob Clark <robdclark-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote: >> On Fri, Aug 12, 2016 at 9:48 AM, Sricharan <sricharan-sgV2jX0FEOL9JmXXK+q4OQ@public.gmane.org> wrote: >>> Hi, >>> >>>>>>>>> btw, the current state, at least on linaro integration branch, fault >>>>>>>>> handling doesn't work so well (ie. device never gets resumed).. which >>>>>>>>> is a bit unfortunate for a gpu (and results in a *lot* of rebooting on >>>>>>>>> my part when debugging userspace). I haven't had time yet to compare >>>>>>>>> to the ancient downstream driver, but not sure if you have any ideas? >>>>>>>>> >>>>>>>>> I guess probably disabling stall on fault would help. But I'm not >>>>>>>>> even getting the "Fault occurred in context.." prints. Seeing the >>>>>>>>> fault iova is pretty useful since that plus gpu cmdstream trace helps >>>>>>>>> me figure out which texture/etc is being accessed out of bounds. >>>>>>>> >>>>>>>>fyi, it looks like it is not getting any fault irq.. it's *possible* >>>>>>>>that I screwed up the irq #'s when translating from downstream, so you >>>>>>>>might want to double check that. I thought I had it right, I assume I >>>>>>>>would have noticed during piglit runs if fault recovery wasn't working >>>>>>>>(since the result is that *everything* after the faulting test would >>>>>>>>have failed since gpu is wedged with no access to memory), but it was >>>>>>>>long enough ago that I can't claim that definitively. >>>>>>>> >>>>>>>>If you need an easy way to trigger a gpu fault, msmtest is a good way, >>>>>>>>change this line: >>>>>>>> >>>>>>>> https://github.com/freedreno/msmtest/blob/master/msmtest.c#L247 >>>>>>>> >>>>>>>>from OUT_RELOC() to OUT_RING(ring, 0x00000000) will trigger a fault. >>>>>>>> >>>>>>> So for the irq to be triggered, 'non-secure' irq line has to be >>>>>>> populated in DT. There is a 'secure'and 'non-secure' irq lines for these iommus >>>>>>> and non-secure irq number is secure + 1. I tested this by having a 'return 0' >>>>>>> from the msm_iommu_map (no mapping), and the faults were getting triggered. >>>>>>> >>>>>>> Can you share me your dts data ? >>>>>>> >>>>>> >>>>>>I think this is what you want: >>>>>> >>>>>>https://github.com/freedreno/kernel-msm/blob/integration-linux-qcomlt/arch/arm/boot/dts/qcom-apq8064.dtsi#L2008 >>>>>> >>>>>>I haven't tested a display fault, so I suppose it is possible that >>>>>>irq's are working for some iommu instances but not others? >>>>> >>>>> So in your DT, for gfx3d, the non-secure line is '70' and not '69' (This is secure) . >>>>> Infact only '70' should be populated. The driver sets the irq line based on resource 0. >>>>> This applies for all iommu nodes in your DT. (only the second irq line is needed). >>>> >>>>ahh, that would explain. >>>> >>>>Is it better to remove the extra entry, or should I just swap them >>>>all? Ie. might there be some point in the future where the driver >>>>would want both? >>> I feel better to have one. Not sure why the secure irq was added in first >>> place in the downstream data and it would setup/handled by the TZ >> >> Ok, getting further.. still seems like the gpu is not getting resumed, >> but at least we are getting a fault interrupt.. > > > ok, seems like we need: > > ------- > diff --git a/drivers/iommu/msm_iommu.c b/drivers/iommu/msm_iommu.c > index b09692b..f6f596f 100644 > --- a/drivers/iommu/msm_iommu.c > +++ b/drivers/iommu/msm_iommu.c > @@ -628,6 +628,7 @@ irqreturn_t msm_iommu_fault_handler(int irq, void *dev_id) > pr_err("Interesting registers:\n"); > print_ctx_regs(iommu->base, i); > SET_FSR(iommu->base, i, 0x4000000F); > + SET_RESUME(iommu->base, i, 1); > } > } > __disable_clocks(iommu); > ------- > > with that it seems like it kinda/mostly recovers, although there is > still a bit of strangeness.. ok, the "still a bit of strangeness" was a false alarm (unrelated problem).. I'll send a patch to add the SET_RESUME(). And also one to wire up fault reporting. When things go wrong we can get thousands of faults from the GPU, so I'd rather it hit my simpler ratelimited print from the fault handler ;-) BR, -R > BR, > -R
WARNING: multiple messages have this Message-ID (diff)
From: robdclark@gmail.com (Rob Clark) To: linux-arm-kernel@lists.infradead.org Subject: [PATCH V6 0/6] iommu/msm: Add DT adaptation and generic bindings support Date: Fri, 12 Aug 2016 11:17:17 -0400 [thread overview] Message-ID: <CAF6AEGvy4efaA4PSYNAJ3gDzxP9xX4b32uQDCfh4b=gtnqJpFw@mail.gmail.com> (raw) In-Reply-To: <CAF6AEGs=m5qZJhBqBa90RS-1aO1zb3w_U=izF3MjyvMOPGSJ+A@mail.gmail.com> On Fri, Aug 12, 2016 at 10:40 AM, Rob Clark <robdclark@gmail.com> wrote: > On Fri, Aug 12, 2016 at 10:32 AM, Rob Clark <robdclark@gmail.com> wrote: >> On Fri, Aug 12, 2016 at 9:48 AM, Sricharan <sricharan@codeaurora.org> wrote: >>> Hi, >>> >>>>>>>>> btw, the current state, at least on linaro integration branch, fault >>>>>>>>> handling doesn't work so well (ie. device never gets resumed).. which >>>>>>>>> is a bit unfortunate for a gpu (and results in a *lot* of rebooting on >>>>>>>>> my part when debugging userspace). I haven't had time yet to compare >>>>>>>>> to the ancient downstream driver, but not sure if you have any ideas? >>>>>>>>> >>>>>>>>> I guess probably disabling stall on fault would help. But I'm not >>>>>>>>> even getting the "Fault occurred in context.." prints. Seeing the >>>>>>>>> fault iova is pretty useful since that plus gpu cmdstream trace helps >>>>>>>>> me figure out which texture/etc is being accessed out of bounds. >>>>>>>> >>>>>>>>fyi, it looks like it is not getting any fault irq.. it's *possible* >>>>>>>>that I screwed up the irq #'s when translating from downstream, so you >>>>>>>>might want to double check that. I thought I had it right, I assume I >>>>>>>>would have noticed during piglit runs if fault recovery wasn't working >>>>>>>>(since the result is that *everything* after the faulting test would >>>>>>>>have failed since gpu is wedged with no access to memory), but it was >>>>>>>>long enough ago that I can't claim that definitively. >>>>>>>> >>>>>>>>If you need an easy way to trigger a gpu fault, msmtest is a good way, >>>>>>>>change this line: >>>>>>>> >>>>>>>> https://github.com/freedreno/msmtest/blob/master/msmtest.c#L247 >>>>>>>> >>>>>>>>from OUT_RELOC() to OUT_RING(ring, 0x00000000) will trigger a fault. >>>>>>>> >>>>>>> So for the irq to be triggered, 'non-secure' irq line has to be >>>>>>> populated in DT. There is a 'secure'and 'non-secure' irq lines for these iommus >>>>>>> and non-secure irq number is secure + 1. I tested this by having a 'return 0' >>>>>>> from the msm_iommu_map (no mapping), and the faults were getting triggered. >>>>>>> >>>>>>> Can you share me your dts data ? >>>>>>> >>>>>> >>>>>>I think this is what you want: >>>>>> >>>>>>https://github.com/freedreno/kernel-msm/blob/integration-linux-qcomlt/arch/arm/boot/dts/qcom-apq8064.dtsi#L2008 >>>>>> >>>>>>I haven't tested a display fault, so I suppose it is possible that >>>>>>irq's are working for some iommu instances but not others? >>>>> >>>>> So in your DT, for gfx3d, the non-secure line is '70' and not '69' (This is secure) . >>>>> Infact only '70' should be populated. The driver sets the irq line based on resource 0. >>>>> This applies for all iommu nodes in your DT. (only the second irq line is needed). >>>> >>>>ahh, that would explain. >>>> >>>>Is it better to remove the extra entry, or should I just swap them >>>>all? Ie. might there be some point in the future where the driver >>>>would want both? >>> I feel better to have one. Not sure why the secure irq was added in first >>> place in the downstream data and it would setup/handled by the TZ >> >> Ok, getting further.. still seems like the gpu is not getting resumed, >> but at least we are getting a fault interrupt.. > > > ok, seems like we need: > > ------- > diff --git a/drivers/iommu/msm_iommu.c b/drivers/iommu/msm_iommu.c > index b09692b..f6f596f 100644 > --- a/drivers/iommu/msm_iommu.c > +++ b/drivers/iommu/msm_iommu.c > @@ -628,6 +628,7 @@ irqreturn_t msm_iommu_fault_handler(int irq, void *dev_id) > pr_err("Interesting registers:\n"); > print_ctx_regs(iommu->base, i); > SET_FSR(iommu->base, i, 0x4000000F); > + SET_RESUME(iommu->base, i, 1); > } > } > __disable_clocks(iommu); > ------- > > with that it seems like it kinda/mostly recovers, although there is > still a bit of strangeness.. ok, the "still a bit of strangeness" was a false alarm (unrelated problem).. I'll send a patch to add the SET_RESUME(). And also one to wire up fault reporting. When things go wrong we can get thousands of faults from the GPU, so I'd rather it hit my simpler ratelimited print from the fault handler ;-) BR, -R > BR, > -R
next prev parent reply other threads:[~2016-08-12 15:17 UTC|newest] Thread overview: 38+ messages / expand[flat|nested] mbox.gz Atom feed top 2016-06-13 11:36 [PATCH V6 0/6] iommu/msm: Add DT adaptation and generic bindings support Sricharan R 2016-06-13 11:36 ` Sricharan R [not found] ` <1465817767-9856-1-git-send-email-sricharan-sgV2jX0FEOL9JmXXK+q4OQ@public.gmane.org> 2016-06-13 11:36 ` [PATCH V6 1/6] iommu/msm: Add DT adaptation Sricharan R 2016-06-13 11:36 ` Sricharan R 2016-06-13 11:36 ` [PATCH V6 2/6] documentation: iommu: Add bindings for msm, iommu-v0 ip Sricharan R 2016-06-13 11:36 ` Sricharan R 2016-06-13 11:36 ` [PATCH V6 3/6] iommu/msm: Move the contents from msm_iommu_dev.c to msm_iommu.c Sricharan R 2016-06-13 11:36 ` Sricharan R 2016-06-13 11:36 ` [PATCH V6 4/6] iommu/msm: Add support for generic master bindings Sricharan R 2016-06-13 11:36 ` Sricharan R 2016-06-13 11:36 ` [PATCH V6 5/6] iommu/msm: use generic ARMV7S short descriptor pagetable ops Sricharan R 2016-06-13 11:36 ` Sricharan R 2016-06-13 11:36 ` [PATCH V6 6/6] iommu/msm: Remove driver BROKEN Sricharan R 2016-06-13 11:36 ` Sricharan R 2016-06-21 12:04 ` [PATCH V6 0/6] iommu/msm: Add DT adaptation and generic bindings support Joerg Roedel 2016-06-21 12:04 ` Joerg Roedel 2016-08-11 20:11 ` Rob Clark 2016-08-11 20:11 ` Rob Clark [not found] ` <CAF6AEGsF5M1sfwaXKW0x=Mu=nQJR-9+xqrtXd=7Vz_+6G_u25A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2016-08-11 22:06 ` Rob Clark 2016-08-11 22:06 ` Rob Clark 2016-08-12 7:00 ` Sricharan 2016-08-12 7:00 ` Sricharan 2016-08-12 12:13 ` Rob Clark 2016-08-12 12:13 ` Rob Clark [not found] ` <CAF6AEGsRVULNpSe_s460biNhV0dRaKLv6tf2Y244HEjGVOV3bw-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2016-08-12 13:03 ` Sricharan 2016-08-12 13:03 ` Sricharan 2016-08-12 13:28 ` Rob Clark 2016-08-12 13:28 ` Rob Clark [not found] ` <CAF6AEGuitSWt9dhXrMfBDqAkXzpZObuj6A_8XxL+TVSWnbkwaQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2016-08-12 13:48 ` Sricharan 2016-08-12 13:48 ` Sricharan 2016-08-12 14:32 ` Rob Clark 2016-08-12 14:32 ` Rob Clark [not found] ` <CAF6AEGuagnVu_cxkHB4vjBT_=XSpHyWAkKdB8=++z-FBnvMB3A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2016-08-12 14:40 ` Rob Clark 2016-08-12 14:40 ` Rob Clark 2016-08-12 15:16 ` Sricharan 2016-08-12 15:16 ` Sricharan [not found] ` <CAF6AEGs=m5qZJhBqBa90RS-1aO1zb3w_U=izF3MjyvMOPGSJ+A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2016-08-12 15:17 ` Rob Clark [this message] 2016-08-12 15:17 ` Rob Clark
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to='CAF6AEGvy4efaA4PSYNAJ3gDzxP9xX4b32uQDCfh4b=gtnqJpFw@mail.gmail.com' \ --to=robdclark-re5jqeeqqe8avxtiumwx3w@public.gmane.org \ --cc=architt-sgV2jX0FEOL9JmXXK+q4OQ@public.gmane.org \ --cc=arnd-r2nGTMty4D4@public.gmane.org \ --cc=devicetree-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \ --cc=iommu-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org \ --cc=laurent.pinchart-ryLnwIuWjnjg/C1BVhZhaw@public.gmane.org \ --cc=linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r@public.gmane.org \ --cc=linux-arm-msm-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \ --cc=sricharan-sgV2jX0FEOL9JmXXK+q4OQ@public.gmane.org \ --cc=srinivas.kandagatla-QSEj5FYQhm4dnm+yROfE0A@public.gmane.org \ --cc=stepanm-sgV2jX0FEOL9JmXXK+q4OQ@public.gmane.org \ --cc=treding-DDmLM1+adcrQT0dZR+AlfA@public.gmane.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.