From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 103100] Image corruptions, instability and performance regression in drm-next-wip Kernel Date: Wed, 04 Oct 2017 16:38:21 +0000 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1782088449==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [IPv6:2610:10:20:722:a800:ff:fe98:4b55]) by gabe.freedesktop.org (Postfix) with ESMTP id 435AD6E720 for ; Wed, 4 Oct 2017 16:38:21 +0000 (UTC) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1782088449== Content-Type: multipart/alternative; boundary="15071351010.FcEC.18034"; charset="UTF-8" --15071351010.FcEC.18034 Date: Wed, 4 Oct 2017 16:38:21 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D103100 Bug ID: 103100 Summary: Image corruptions, instability and performance regression in drm-next-wip Kernel Product: Mesa Version: git Hardware: Other OS: All Status: NEW Severity: normal Priority: medium Component: Drivers/Gallium/radeonsi Assignee: dri-devel@lists.freedesktop.org Reporter: gr.muench@gmail.com QA Contact: dri-devel@lists.freedesktop.org Im running current drm-next-4.15-wip Kernel and I use AMDGPU with Radeon HD 7970 DC disabled. The following is wrong: -Performance in Shadow of Mordor internal benchmark decreases from 68 to 61= fps -also other games see a small decrease of 1-2 fps -I see random screen corruptions on my desktop -after I exit from a game, the system is unstable, screen corruptions are e= ven more visible and the systems randomly hangs=20 I bisected this to: fd8bf087dffc0bce047c5aea2afcb8f821e48db1 is the first bad commit commit fd8bf087dffc0bce047c5aea2afcb8f821e48db1 Author: Christian K=C3=B6nig Date: Tue Aug 29 16:14:32 2017 +0200 drm/amdgpu: bump version for support of local BOs Signed-off-by: Christian K=C3=B6nig Reviewed-by: Felix Kuehling Signed-off-by: Alex Deucher :040000 040000 440c9b026e802e50b6a25ae3b402ea57ef58a891 d31d8e8b93060b11e88f95d4d3bdcf081c77e4e2 M drivers This is probably not making any sense, I guess one of the previous commits related to BOs are faulty. To double checked things I used git checkout bet= ween those commits and make clean during the steps. Its still very unusual but m= aybe a dev know whats going on. log: amdgpu 0000:01:00.0: GPU fault detected: 146 0x030f3d14 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010CD= 18 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F03D0= 14 kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 7) at page 1101080, write from '' (0x00000000) (61) kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0f073d14 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010E1= 78 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0703D0= 14 kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 3) at page 1106296, write from '' (0x00000000) (61) kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0e0a3d0c kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010E6= 70 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A03D0= 0C kernel: amdgpu 0000:01:00.0: VM fault (0x0c, vmid 5) at page 1107568, read = from '' (0x00000000) (61) kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0e0a3d0c kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0010E6= 73 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A03D0= 0C kernel: amdgpu 0000:01:00.0: VM fault (0x0c, vmid 5) at page 1107571, read = from '' (0x00000000) (61) kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0e0e440c kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x001046= 70 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E0440= 0C kernel: amdgpu 0000:01:00.0: VM fault (0x0c, vmid 7) at page 1066608, read = from '' (0x00000000) (68) kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0c0f3d14 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x001019= 60 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F03D0= 14 kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 7) at page 1055072, write from '' (0x00000000) (61) kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0e0b3d14 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x001017= F0 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B03D0= 14 kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 5) at page 1054704, write from '' (0x00000000) (61) kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x02073d14 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00112F= 90 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0703D0= 14 kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 3) at page 1126288, write from '' (0x00000000) (61) kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x08073d14 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00110E= 40 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0703D0= 14 kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 3) at page 1117760, write from '' (0x00000000) (61) --=20 You are receiving this mail because: You are the assignee for the bug.= --15071351010.FcEC.18034 Date: Wed, 4 Oct 2017 16:38:21 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated
Bug ID 103100
Summary Image corruptions, instability and performance regression in = drm-next-wip Kernel
Product Mesa
Version git
Hardware Other
OS All
Status NEW
Severity normal
Priority medium
Component Drivers/Gallium/radeonsi
Assignee dri-devel@lists.freedesktop.org
Reporter gr.muench@gmail.com
QA Contact dri-devel@lists.freedesktop.org

Im running current drm-next-4.15-wip Kernel and I use AMDGPU w=
ith Radeon HD
7970
DC disabled.

The following is wrong:
-Performance in Shadow of Mordor internal benchmark decreases from 68 to 61=
 fps
-also other games see a small decrease of 1-2 fps
-I see random screen corruptions on my desktop
-after I exit from a game, the system is unstable, screen corruptions are e=
ven
more visible and the systems randomly hangs=20

I bisected this to:
fd8bf087dffc0bce047c5aea2afcb8f821e48db1 is the first bad commit
commit fd8bf087dffc0bce047c5aea2afcb8f821e48db1
Author: Christian K=C3=B6nig <christian.koenig@amd.com>
Date:   Tue Aug 29 16:14:32 2017 +0200

    drm/amdgpu: bump version for support of local BOs

    Signed-off-by: Christian K=C3=B6nig <christian.koenig@amd.com>
    Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

:040000 040000 440c9b026e802e50b6a25ae3b402ea57ef58a891
d31d8e8b93060b11e88f95d4d3bdcf081c77e4e2 M      drivers

This is probably not making any sense, I guess one of the previous commits
related to BOs are faulty. To double checked things I used git checkout bet=
ween
those commits and make clean during the steps. Its still very unusual but m=
aybe
a dev know whats going on.

log:

amdgpu 0000:01:00.0: GPU fault detected: 146 0x030f3d14
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010CD=
18
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F03D0=
14
kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 7) at page 1101080, write
from '' (0x00000000) (61)
kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0f073d14
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010E1=
78
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0703D0=
14
kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 3) at page 1106296, write
from '' (0x00000000) (61)
kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0e0a3d0c
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010E6=
70
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A03D0=
0C
kernel: amdgpu 0000:01:00.0: VM fault (0x0c, vmid 5) at page 1107568, read =
from
'' (0x00000000) (61)
kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0e0a3d0c
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0010E6=
73
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A03D0=
0C
kernel: amdgpu 0000:01:00.0: VM fault (0x0c, vmid 5) at page 1107571, read =
from
'' (0x00000000) (61)
kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0e0e440c
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x001046=
70
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E0440=
0C
kernel: amdgpu 0000:01:00.0: VM fault (0x0c, vmid 7) at page 1066608, read =
from
'' (0x00000000) (68)
kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0c0f3d14
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x001019=
60
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0F03D0=
14
kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 7) at page 1055072, write
from '' (0x00000000) (61)
kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0e0b3d14
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x001017=
F0
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0B03D0=
14
kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 5) at page 1054704, write
from '' (0x00000000) (61)
kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x02073d14
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00112F=
90
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0703D0=
14
kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 3) at page 1126288, write
from '' (0x00000000) (61)
kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x08073d14
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00110E=
40
kernel: amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0703D0=
14
kernel: amdgpu 0000:01:00.0: VM fault (0x14, vmid 3) at page 1117760, write
from '' (0x00000000) (61)


You are receiving this mail because:
  • You are the assignee for the bug.
= --15071351010.FcEC.18034-- --===============1782088449== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1782088449==--