All of lore.kernel.org
 help / color / mirror / Atom feed
* [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup
@ 2017-08-31 20:44 bugzilla-daemon
  2017-08-31 21:58 ` bugzilla-daemon
                   ` (23 more replies)
  0 siblings, 24 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-08-31 20:44 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 1671 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

            Bug ID: 102500
           Summary: [polaris10][amd-staging-4.12] GPU fault detected,
                    somethimes lockup
           Product: DRI
           Version: DRI git
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: NEW
          Severity: normal
          Priority: medium
         Component: DRM/AMDgpu
          Assignee: dri-devel@lists.freedesktop.org
          Reporter: arek.rusi@gmail.com

Created attachment 133914
  --> https://bugs.freedesktop.org/attachment.cgi?id=133914&action=edit
dmesg - start gnome 3 session only

Hi, 
Afer today kernel update witcher 3 hangs. After restart (only Gnome3 session is
running) in kernel log i see lot of:
...
amdgpu 0000:01:00.0: GPU fault detected: 146 0x0d023d14
amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x001061A0
amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0903D014
amdgpu 0000:01:00.0: VM fault (0x14, vmid 4) at page 1073568, write from 'SDM1'
(0x53444d31) (61)
...
downgrade kernel resolve this issue. 

I use binary kernel from unofficial arch's repo:
linux-amd-staging 4.12.0.680862.7be0a528b097 - the bad one 
linux-amd-staging-4.12.0.680853.eeb9985d7228 - works great

these kernels are built from alex's git tree, branch amd-staging-4.12

OpenGL renderer string: AMD Radeon (TM) RX 470 Graphics (POLARIS10 / DRM 3.18.0
/ 4.13.0-rc7-mainline, LLVM 6.0.0)
OpenGL core profile version string: 4.5 (Core Profile) Mesa 17.3.0-devel
(git-2d93b462b4)

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 3125 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
@ 2017-08-31 21:58 ` bugzilla-daemon
  2017-09-01  9:55 ` bugzilla-daemon
                   ` (22 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-08-31 21:58 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 212 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #1 from Alex Deucher <alexdeucher@gmail.com> ---
can you bisect?

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1033 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
  2017-08-31 21:58 ` bugzilla-daemon
@ 2017-09-01  9:55 ` bugzilla-daemon
  2017-09-01 10:04 ` bugzilla-daemon
                   ` (21 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-01  9:55 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 795 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #2 from Arek Ruśniak <arek.rusi@gmail.com> ---
Ok, sometimes gnome session refuse to work or even crashes on it (leds blinking
on my kb)  

1753d85bc82849deeb68cb5d7883207f0acbddc4 is the first bad commit
commit 1753d85bc82849deeb68cb5d7883207f0acbddc4
Author: Christian König <christian.koenig@amd.com>
Date:   Tue Aug 29 16:14:32 2017 +0200

    drm/amdgpu: bump version for support of local BOs

    Signed-off-by: Christian König <christian.koenig@amd.com>
    Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>

:040000 040000 fb4af6a5aa54bac7afddeb83db09105ce7dab3e5
b30f9c48abef6ddd2bd3a34dd447c0543ca1b29e M      drivers

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1793 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
  2017-08-31 21:58 ` bugzilla-daemon
  2017-09-01  9:55 ` bugzilla-daemon
@ 2017-09-01 10:04 ` bugzilla-daemon
  2017-09-01 15:37 ` bugzilla-daemon
                   ` (20 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-01 10:04 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 452 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #3 from Arek Ruśniak <arek.rusi@gmail.com> ---
Created attachment 133923
  --> https://bugs.freedesktop.org/attachment.cgi?id=133923&action=edit
dmesg for first bad commit

autostart throu gdm to gnome3-session failed so I booted into multi-user (no vm
faults yet) and then started fluxbox from tty

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1416 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (2 preceding siblings ...)
  2017-09-01 10:04 ` bugzilla-daemon
@ 2017-09-01 15:37 ` bugzilla-daemon
  2017-09-01 21:36 ` bugzilla-daemon
                   ` (19 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-01 15:37 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 785 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #4 from Dieter Nützel <Dieter@nuetzel-hh.de> ---
(In reply to Alex Deucher from comment #1)
> can you bisect?

Hello all,

I get the same on 'amd-staging-drm-next' since 1. of Sep (kernel build time: 1.
Sep 02:14 CEST) update, too. Will go to bisect in the evening.

[  262.462941] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0a023d14
[  262.462946] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00101540
[  262.462949] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x0903D014
[  262.462952] amdgpu 0000:01:00.0: VM fault (0x14, vmid 4) at page 1054016,
write from 'SDM1' (0x53
444d31) (61)

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1678 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (3 preceding siblings ...)
  2017-09-01 15:37 ` bugzilla-daemon
@ 2017-09-01 21:36 ` bugzilla-daemon
  2017-09-02  7:37 ` bugzilla-daemon
                   ` (18 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-01 21:36 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 1588 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

Dieter Nützel <Dieter@nuetzel-hh.de> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |deathsimple@vodafone.de

--- Comment #5 from Dieter Nützel <Dieter@nuetzel-hh.de> ---
(In reply to Dieter Nützel from comment #4)
> (In reply to Alex Deucher from comment #1)
> > can you bisect?
> 
> Hello all,
> 
> I get the same on 'amd-staging-drm-next' since 1. of Sep (kernel build time:
> 1. Sep 02:14 CEST) update, too. Will go to bisect in the evening.
> 
> [  262.462941] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0a023d14
> [  262.462946] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
> 0x00101540
> [  262.462949] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
> 0x0903D014
> [  262.462952] amdgpu 0000:01:00.0: VM fault (0x14, vmid 4) at page 1054016,
> write from 'SDM1' (0x53
> 444d31) (61)

Yes,

git revert fd8bf087dffc

commit fd8bf087dffc0bce047c5aea2afcb8f821e48db1
Author: Christian König <christian.koenig@amd.com>
Date:   Tue Aug 29 16:14:32 2017 +0200

    drm/amdgpu: bump version for support of local BOs

    Signed-off-by: Christian König <christian.koenig@amd.com>
    Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

Solve it on 'amd-staging-drm-next', too.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 3375 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (4 preceding siblings ...)
  2017-09-01 21:36 ` bugzilla-daemon
@ 2017-09-02  7:37 ` bugzilla-daemon
  2017-09-02  7:38 ` bugzilla-daemon
                   ` (17 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-02  7:37 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 315 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #6 from Arek Ruśniak <arek.rusi@gmail.com> ---
Today's build is fine. There's no vm-fault anymore.
Thx for fix.
Dieter could you confirm that for staging-next tree?

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1135 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (5 preceding siblings ...)
  2017-09-02  7:37 ` bugzilla-daemon
@ 2017-09-02  7:38 ` bugzilla-daemon
  2017-09-02 10:30 ` bugzilla-daemon
                   ` (16 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-02  7:38 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 276 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #7 from Christian König <deathsimple@vodafone.de> ---
Does patch "drm/amdgpu: fix moved list handling in the VM" fix the issue?

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1113 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (6 preceding siblings ...)
  2017-09-02  7:38 ` bugzilla-daemon
@ 2017-09-02 10:30 ` bugzilla-daemon
  2017-09-03 16:34 ` bugzilla-daemon
                   ` (15 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-02 10:30 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 368 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #8 from Arek Ruśniak <arek.rusi@gmail.com> ---
Something wrong happened in my build environment or i've got just luck with
earlier test.There's no fix. 
"GPU fault detected" still happenning.

sorry for inconvinient

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1198 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (7 preceding siblings ...)
  2017-09-02 10:30 ` bugzilla-daemon
@ 2017-09-03 16:34 ` bugzilla-daemon
  2017-09-04 19:31 ` [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] " bugzilla-daemon
                   ` (14 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-03 16:34 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 1487 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #9 from Vedran Miletić <vedran@miletic.net> ---
(In reply to Dieter Nützel from comment #5)
> (In reply to Dieter Nützel from comment #4)
> > (In reply to Alex Deucher from comment #1)
> > > can you bisect?
> > 
> > Hello all,
> > 
> > I get the same on 'amd-staging-drm-next' since 1. of Sep (kernel build time:
> > 1. Sep 02:14 CEST) update, too. Will go to bisect in the evening.
> > 
> > [  262.462941] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0a023d14
> > [  262.462946] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
> > 0x00101540
> > [  262.462949] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
> > 0x0903D014
> > [  262.462952] amdgpu 0000:01:00.0: VM fault (0x14, vmid 4) at page 1054016,
> > write from 'SDM1' (0x53
> > 444d31) (61)
> 
> Yes,
> 
> git revert fd8bf087dffc
> 
> commit fd8bf087dffc0bce047c5aea2afcb8f821e48db1
> Author: Christian König <christian.koenig@amd.com>
> Date:   Tue Aug 29 16:14:32 2017 +0200
> 
>     drm/amdgpu: bump version for support of local BOs
>     
>     Signed-off-by: Christian König <christian.koenig@amd.com>
>     Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>
>     Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
> 
> Solve it on 'amd-staging-drm-next', too.

Confirmed fixed by reverting on Vega 10.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 2845 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (8 preceding siblings ...)
  2017-09-03 16:34 ` bugzilla-daemon
@ 2017-09-04 19:31 ` bugzilla-daemon
  2017-09-07 20:13 ` bugzilla-daemon
                   ` (13 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-04 19:31 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 733 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

Vedran Miletić <vedran@miletic.net> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |vedran@miletic.net
            Summary|[polaris10][amd-staging-4.1 |[polaris10,
                   |2] GPU fault detected,      |vega10][amd-staging-4.12,
                   |somethimes lockup           |amd-staging-drm-next] GPU
                   |                            |fault detected, somethimes
                   |                            |lockup

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1465 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (9 preceding siblings ...)
  2017-09-04 19:31 ` [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] " bugzilla-daemon
@ 2017-09-07 20:13 ` bugzilla-daemon
  2017-09-07 23:23 ` bugzilla-daemon
                   ` (12 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-07 20:13 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 516 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #10 from Arek Ruśniak <arek.rusi@gmail.com> ---
additional info:
I try figure out why in my earlier test everything went ok and probably mesa is
the trigger, 

Linux-amd-staging + Mesa-git + LLVM-svn - failure
Linux-amd-staging + Mesa-git + LLVM 4.0.1 - failure
Linux-amd-staging + Mesa 17.1.8 + LLVM 4.0.1 - works ok. 
I try later some bisecting, we will see.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1397 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (10 preceding siblings ...)
  2017-09-07 20:13 ` bugzilla-daemon
@ 2017-09-07 23:23 ` bugzilla-daemon
  2017-09-08  8:18 ` bugzilla-daemon
                   ` (11 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-07 23:23 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 894 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #11 from Arek Ruśniak <arek.rusi@gmail.com> ---
on mesa side looks like this is it:

214b565bc28bc4419f3eec29ab7bbe34080459fe is the first bad commit
commit 214b565bc28bc4419f3eec29ab7bbe34080459fe
Author: Christian König <christian.koenig@amd.com>
Date:   Tue Aug 29 16:45:46 2017 +0200

    winsys/amdgpu: set AMDGPU_GEM_CREATE_VM_ALWAYS_VALID if possible v2

    When the kernel supports it set the local flag and
    stop adding those BOs to the BO list.

    Can probably be optimized much more.

    v2: rename new flag to AMDGPU_GEM_CREATE_VM_ALWAYS_VALID

    Reviewed-by: Marek Olšák <marek.olsak@amd.com>

:040000 040000 2e4b2737f37ede2bbdbbe6815fe0fa562177c2b7
3482c86ed92116adff7ab12b2d4de870746a1df6 M      src

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1890 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (11 preceding siblings ...)
  2017-09-07 23:23 ` bugzilla-daemon
@ 2017-09-08  8:18 ` bugzilla-daemon
  2017-09-08 10:13 ` bugzilla-daemon
                   ` (10 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-08  8:18 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 386 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #12 from Christian König <deathsimple@vodafone.de> ---
To repeat my question: Does patch "drm/amdgpu: fix moved list handling in the
VM" fix the issue?

Do you guys have this in your kernel branch yet? If not that lockup is
expected.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1284 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (12 preceding siblings ...)
  2017-09-08  8:18 ` bugzilla-daemon
@ 2017-09-08 10:13 ` bugzilla-daemon
  2017-09-08 11:08 ` bugzilla-daemon
                   ` (9 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-08 10:13 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 515 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #13 from Arek Ruśniak <arek.rusi@gmail.com> ---
Christian sorry, I thought that was clear. 
Yes, I updated ASAP so it contains:
https://cgit.freedesktop.org/~agd5f/linux/commit/?h=amd-staging-4.12&id=8bd2cc0ab44b00346cc41f3ac828cbf992f6bc61
Doesn't help for vm-faults

Every test right before and after your comment is for: 
linux-amd-staging-4.12-c5def4cbdb61

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1531 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (13 preceding siblings ...)
  2017-09-08 10:13 ` bugzilla-daemon
@ 2017-09-08 11:08 ` bugzilla-daemon
  2017-09-08 12:19 ` bugzilla-daemon
                   ` (8 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-08 11:08 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 377 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #14 from Christian König <deathsimple@vodafone.de> ---
(In reply to Arek Ruśniak from comment #13)
> Christian sorry, I thought that was clear. 

No problem, that just means that this is the same issue I'm still hunting for.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1337 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (14 preceding siblings ...)
  2017-09-08 11:08 ` bugzilla-daemon
@ 2017-09-08 12:19 ` bugzilla-daemon
  2017-09-08 15:54 ` bugzilla-daemon
                   ` (7 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-08 12:19 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 357 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #15 from Christian König <deathsimple@vodafone.de> ---
Created attachment 134082
  --> https://bugs.freedesktop.org/attachment.cgi?id=134082&action=edit
Possible fix

Please try the attached kernel patch.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1447 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (15 preceding siblings ...)
  2017-09-08 12:19 ` bugzilla-daemon
@ 2017-09-08 15:54 ` bugzilla-daemon
  2017-09-09  1:49 ` bugzilla-daemon
                   ` (6 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-08 15:54 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 374 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #16 from Arek Ruśniak <arek.rusi@gmail.com> ---
Patch fixes issue.
I've tried both staging-4.12 and staging-drm-next branches.
Thanks Christian

PS. It will be nice if Vedran could confirmed this for Vega before we close.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1255 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (16 preceding siblings ...)
  2017-09-08 15:54 ` bugzilla-daemon
@ 2017-09-09  1:49 ` bugzilla-daemon
  2017-09-09 19:11 ` bugzilla-daemon
                   ` (5 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-09  1:49 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 1215 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #17 from Dieter Nützel <Dieter@nuetzel-hh.de> ---
(In reply to Christian König from comment #15)
> Created attachment 134082 [details] [review]
> Possible fix
> 
> Please try the attached kernel patch.

Hello Christian,

you've made your 'homework'...;-)

> To repeat my question: Does patch "drm/amdgpu: fix moved list handling in
> the VM" fix the issue?
>
> Do you guys have this in your kernel branch yet? If not that lockup is
> expected.

No, I haven't.
It was fallen into the cranks of the repeated DC rebase of Alex's
'amd-staging-drm-next' tree (didn't noticed it for the last 7 days, Alex
vacation). 
I'll make it short. NO that didn't solve it for me, too.

But _this_ patch is GOLD:
drm-amdgpu-fix-VM-sync-with-always-valid-BOs.mbox

Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>

Best 'glmark2' Score I've ever seen.
RX580, 8 GB
Xeon X3470, 4/8, 3 GHz
24 GB

glmark2 Score: 6428

with additional load on the gfx cores through parallel running
'opencl-example/run_tests.sh' I got

glmark2 Score: 7574

Good job!

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 2581 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (17 preceding siblings ...)
  2017-09-09  1:49 ` bugzilla-daemon
@ 2017-09-09 19:11 ` bugzilla-daemon
  2017-09-09 19:52 ` bugzilla-daemon
                   ` (4 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-09 19:11 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 648 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #18 from Vedran Miletić <vedran@miletic.net> ---
(In reply to Arek Ruśniak from comment #16)
> Patch fixes issue.
> I've tried both staging-4.12 and staging-drm-next branches.
> Thanks Christian
> 
> PS. It will be nice if Vedran could confirmed this for Vega before we close.

I can confirm that after applying the patch the issue doesn't occur for me. (I
hope that's enough, I can't claim more than that since I have done 2-3 upgrades
of mesa/llvm since I last tested the broken kernel.)

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1614 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (18 preceding siblings ...)
  2017-09-09 19:11 ` bugzilla-daemon
@ 2017-09-09 19:52 ` bugzilla-daemon
  2017-09-10  4:14 ` bugzilla-daemon
                   ` (3 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-09 19:52 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 466 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #19 from charlie <bug0xa3d2@hushmail.com> ---
Bug 102500 might be related to bug 102598.

I tried to apply patch attachment 134082 to  amd-staging-4.12 (~agd5f/linux)
kernel and drm-next-4.15-wip but it does not apply cleanly.  I applied it
manually to drm-next-4.15-wip and that kernel would not finish compiling.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 2009 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (19 preceding siblings ...)
  2017-09-09 19:52 ` bugzilla-daemon
@ 2017-09-10  4:14 ` bugzilla-daemon
  2017-09-10  4:21 ` bugzilla-daemon
                   ` (2 subsequent siblings)
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-10  4:14 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 553 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #20 from charlie <bug0xa3d2@hushmail.com> ---
I confirm that bug 102500 and bug 102598 are the same.

I split up the patch into 3 parts and they applied cleanly with offsets to
drm-next-4.15-wip.

I then reverted mesa to commit 214b565bc28bc4419f3eec29ab7bbe34080459fe
(winsys/amdgpu: set AMDGPU_GEM_CREATE_VM_ALWAYS_VALID if possible v2) compiled
and started X and corruption and lockups are gone.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1821 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (20 preceding siblings ...)
  2017-09-10  4:14 ` bugzilla-daemon
@ 2017-09-10  4:21 ` bugzilla-daemon
  2017-10-03 21:36 ` bugzilla-daemon
  2017-10-03 23:13 ` bugzilla-daemon
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-09-10  4:21 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 510 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

charlie <bug0xa3d2@hushmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |bug0xa3d2@hushmail.com

--- Comment #21 from charlie <bug0xa3d2@hushmail.com> ---
*** Bug 102598 has been marked as a duplicate of this bug. ***

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 2196 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (21 preceding siblings ...)
  2017-09-10  4:21 ` bugzilla-daemon
@ 2017-10-03 21:36 ` bugzilla-daemon
  2017-10-03 23:13 ` bugzilla-daemon
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-10-03 21:36 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 290 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

--- Comment #22 from Vedran Miletić <vedran@miletic.net> ---
The patch has been included in amd-staging-drm-next for a while, should this
bug be closed?

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1172 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] GPU fault detected, somethimes lockup
  2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
                   ` (22 preceding siblings ...)
  2017-10-03 21:36 ` bugzilla-daemon
@ 2017-10-03 23:13 ` bugzilla-daemon
  23 siblings, 0 replies; 25+ messages in thread
From: bugzilla-daemon @ 2017-10-03 23:13 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 430 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=102500

Arek Ruśniak <arek.rusi@gmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|---                         |FIXED

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1306 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 25+ messages in thread

end of thread, other threads:[~2017-10-03 23:13 UTC | newest]

Thread overview: 25+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-08-31 20:44 [Bug 102500] [polaris10][amd-staging-4.12] GPU fault detected, somethimes lockup bugzilla-daemon
2017-08-31 21:58 ` bugzilla-daemon
2017-09-01  9:55 ` bugzilla-daemon
2017-09-01 10:04 ` bugzilla-daemon
2017-09-01 15:37 ` bugzilla-daemon
2017-09-01 21:36 ` bugzilla-daemon
2017-09-02  7:37 ` bugzilla-daemon
2017-09-02  7:38 ` bugzilla-daemon
2017-09-02 10:30 ` bugzilla-daemon
2017-09-03 16:34 ` bugzilla-daemon
2017-09-04 19:31 ` [Bug 102500] [polaris10, vega10][amd-staging-4.12, amd-staging-drm-next] " bugzilla-daemon
2017-09-07 20:13 ` bugzilla-daemon
2017-09-07 23:23 ` bugzilla-daemon
2017-09-08  8:18 ` bugzilla-daemon
2017-09-08 10:13 ` bugzilla-daemon
2017-09-08 11:08 ` bugzilla-daemon
2017-09-08 12:19 ` bugzilla-daemon
2017-09-08 15:54 ` bugzilla-daemon
2017-09-09  1:49 ` bugzilla-daemon
2017-09-09 19:11 ` bugzilla-daemon
2017-09-09 19:52 ` bugzilla-daemon
2017-09-10  4:14 ` bugzilla-daemon
2017-09-10  4:21 ` bugzilla-daemon
2017-10-03 21:36 ` bugzilla-daemon
2017-10-03 23:13 ` bugzilla-daemon

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.