All of lore.kernel.org
 help / color / mirror / Atom feed
From: Tom St Denis <tstdenis-5C7GfCeVMHo@public.gmane.org>
To: Huang Rui <ray.huang-5C7GfCeVMHo@public.gmane.org>,
	"Koenig,
	Christian" <Christian.Koenig-5C7GfCeVMHo@public.gmane.org>
Cc: "StDenis, Tom" <Tom.StDenis-5C7GfCeVMHo@public.gmane.org>,
	"Deucher,
	Alexander" <Alexander.Deucher-5C7GfCeVMHo@public.gmane.org>,
	"Daenzer, Michel" <Michel.Daenzer-5C7GfCeVMHo@public.gmane.org>,
	"amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org"
	<amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org>
Subject: Re: two KASANs in TTM logic
Date: Sat, 8 Sep 2018 06:40:03 -0400	[thread overview]
Message-ID: <a8a67cc9-5ef8-4f6c-8f67-d5c29908c874@amd.com> (raw)
In-Reply-To: <20180908092300.GB25048@hr-amur2>



On 09/08/2018 05:23 AM, Huang Rui wrote:
> On Fri, Sep 07, 2018 at 04:59:11PM +0800, Christian König wrote:
>> Hi Ray,
>>
>> in the meantime can we disable the feature once more in the kernel until
>> we have hammered out all possible corner cases?
> 
> That's fine. So far, we have to disable it again. I will do more testing
> and repro the issue of Tom firstly.
> 
>>
>> As Tom figured out commenting out setting "bulk_moveable" to true should
>> be enough.
> 
> I saw you already remove the "bulk_moveable = true" in amdgpu_vm_init(), do
> you point we also comment out the one in amdgpu_vm_move_to_lru_tail() to
> disable bulk_move totally for the moment?

Hi Ray,

I just commented out the assignment of true.

Tom


> 
> Thanks,
> Ray
> 
>>
>> Thanks,
>> Christian.
>>
>> Am 07.09.2018 um 08:51 schrieb Huang, Ray:
>>> Hi Tom,
>>>
>>> Thanks to trace this issue.  I am trying to reproduce it on amd-staging-drm-next with piglit.
>>> May I know the steps/configurations to repro it?
>>>
>>> Thanks,
>>> Ray
>>>
>>> -----Original Message-----
>>> From: amd-gfx <amd-gfx-bounces@lists.freedesktop.org> On Behalf Of Tom St Denis
>>> Sent: Wednesday, September 5, 2018 9:27 PM
>>> To: Koenig, Christian <Christian.Koenig@amd.com>; Daenzer, Michel <Michel.Daenzer@amd.com>; amd-gfx@lists.freedesktop.org; Deucher, Alexander <Alexander.Deucher@amd.com>
>>> Subject: Re: two KASANs in TTM logic
>>>
>>> Logs attached.
>>>
>>> Tom
>>>
>>>
>>>
>>> On 09/05/2018 08:02 AM, Christian König wrote:
>>>> Still not the slightest idea what is causing this and the patch
>>>> definitely fixes things a lot.
>>>>
>>>> Can you try to enable list debugging in your kernel?
>>>>
>>>> Thanks,
>>>> Christian.
>>>>
>>>> Am 04.09.2018 um 19:18 schrieb Tom St Denis:
>>>>> Sure:
>>>>>
>>>>> d2917f399e0b250f47d07da551a335843a24f835 is the first bad commit
>>>>> commit d2917f399e0b250f47d07da551a335843a24f835
>>>>> Author: Christian König <christian.koenig@amd.com>
>>>>> Date:   Thu Aug 30 10:04:53 2018 +0200
>>>>>
>>>>>       drm/amdgpu: fix "use bulk moves for efficient VM LRU handling" v2
>>>>>
>>>>>       First step to fix the LRU corruption, we accidentially tried to
>>>>> move things
>>>>>       on the LRU after dropping the lock.
>>>>>
>>>>>       Signed-off-by: Christian König <christian.koenig@amd.com>
>>>>>       Tested-by: Michel Dänzer <michel.daenzer@amd.com>
>>>>>
>>>>> :040000 040000 ed5be1ad4da129c4154b2b43acf7ef349a470700
>>>>> 0008c4e2fb56512f41559618dd474c916fc09a37 M      drivers
>>>>>
>>>>>
>>>>> The commit before that I can run xonotic-glx and piglit on my Carrizo
>>>>> without a KASAN.
>>>>>
>>>>> Tom
>>>>>
>>>>> On 09/04/2018 10:05 AM, Christian König wrote:
>>>>>> The first one should already be fixed.
>>>>>>
>>>>>> Not sure where the second comes from. Can you narrow that down further?
>>>>>>
>>>>>> Christian.
>>>>>>
>>>>>> Am 04.09.2018 um 15:46 schrieb Tom St Denis:
>>>>>>> First is caused by this commit while running a GL heavy application.
>>>>>>>
>>>>>>> d78c1fa0c9f815fe951fd57001acca3d35262a17 is the first bad commit
>>>>>>> commit d78c1fa0c9f815fe951fd57001acca3d35262a17
>>>>>>> Author: Michel Dänzer <michel.daenzer@amd.com>
>>>>>>> Date:   Wed Aug 29 11:59:38 2018 +0200
>>>>>>>
>>>>>>>       Revert "drm/amdgpu: move PD/PT bos on LRU again"
>>>>>>>
>>>>>>>       This reverts commit 31625ccae4464b61ec8cdb9740df848bbc857a5b.
>>>>>>>
>>>>>>>       It triggered various badness on my development machine when
>>>>>>> running the
>>>>>>>       piglit gpu profile with radeonsi on Bonaire, looks like memory
>>>>>>>       corruption due to insufficiently protected list manipulations.
>>>>>>>
>>>>>>>       Signed-off-by: Michel Dänzer <michel.daenzer@amd.com>
>>>>>>>       Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
>>>>>>>
>>>>>>> :040000 040000 b7169f0cf0c7decec631751a9896a92badb67f9d
>>>>>>> 42ea58f43199d26fc0c7ddcc655e6d0964b81817 M      drivers
>>>>>>>
>>>>>>> The second is caused by something between that and the tip of the
>>>>>>> 4.19-rc1 amd-staging-drm-next (I haven't pinned it down yet) while
>>>>>>> loading GNOME.
>>>>>>>
>>>>>>> Tom
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> amd-gfx mailing list
>>>>>>> amd-gfx@lists.freedesktop.org
>>>>>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
>>> _______________________________________________
>>> amd-gfx mailing list
>>> amd-gfx@lists.freedesktop.org
>>> https://lists.freedesktop.org/mailman/listinfo/amd-gfx
>>
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

  reply	other threads:[~2018-09-08 10:40 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-09-04 13:46 two KASANs in TTM logic Tom St Denis
     [not found] ` <62eed98f-e37c-6a72-3553-cfd10b13e49f-5C7GfCeVMHo@public.gmane.org>
2018-09-04 14:05   ` Christian König
     [not found]     ` <63c23f00-92dd-2311-5dc9-21e29b3bbb8b-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2018-09-04 17:18       ` Tom St Denis
     [not found]         ` <3bca643b-306f-d403-f304-3b22d98b6a9d-5C7GfCeVMHo@public.gmane.org>
2018-09-05 12:02           ` Christian König
     [not found]             ` <7c88bc11-05aa-eb72-edf5-70f558dffa52-5C7GfCeVMHo@public.gmane.org>
2018-09-05 12:05               ` Tom St Denis
2018-09-05 13:27               ` Tom St Denis
     [not found]                 ` <84c7fd33-b9aa-001a-82b8-0f832096d649-5C7GfCeVMHo@public.gmane.org>
2018-09-07  6:51                   ` Huang, Ray
     [not found]                     ` <CY1PR12MB004306687840C79D6354ACA7EC000-1s8aH8ViOEdbUNsZNX5b0gdYzm3356FpvxpqHgZTriW3zl9H0oFU5g@public.gmane.org>
2018-09-07  8:59                       ` Christian König
     [not found]                         ` <32dea5a5-6463-b991-6fd4-0f082e526458-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2018-09-08  9:23                           ` Huang Rui
2018-09-08 10:40                             ` Tom St Denis [this message]
     [not found]                               ` <a8a67cc9-5ef8-4f6c-8f67-d5c29908c874-5C7GfCeVMHo@public.gmane.org>
2018-09-08 10:57                                 ` Christian König

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=a8a67cc9-5ef8-4f6c-8f67-d5c29908c874@amd.com \
    --to=tstdenis-5c7gfcevmho@public.gmane.org \
    --cc=Alexander.Deucher-5C7GfCeVMHo@public.gmane.org \
    --cc=Christian.Koenig-5C7GfCeVMHo@public.gmane.org \
    --cc=Michel.Daenzer-5C7GfCeVMHo@public.gmane.org \
    --cc=Tom.StDenis-5C7GfCeVMHo@public.gmane.org \
    --cc=amd-gfx-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org \
    --cc=ray.huang-5C7GfCeVMHo@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.