All of lore.kernel.org
 help / color / mirror / Atom feed
* [Bug 103736] Sudden system freezes, dmesg errors
@ 2017-11-14 15:12 bugzilla-daemon
  2017-11-14 15:53 ` bugzilla-daemon
                   ` (15 more replies)
  0 siblings, 16 replies; 17+ messages in thread
From: bugzilla-daemon @ 2017-11-14 15:12 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 1623 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

            Bug ID: 103736
           Summary: Sudden system freezes, dmesg errors
           Product: DRI
           Version: XOrg git
          Hardware: x86-64 (AMD64)
                OS: Linux (All)
            Status: NEW
          Severity: normal
          Priority: medium
         Component: DRM/AMDgpu
          Assignee: dri-devel@lists.freedesktop.org
          Reporter: shiverly@mt2015.com

Created attachment 135450
  --> https://bugs.freedesktop.org/attachment.cgi?id=135450&action=edit
dmesg errors

I installed Ubuntu Mate 17.10 and M-bab drivers
(https://github.com/M-Bab/linux-kernel-amdgpu-binaries, without them one
monitor is always black but powered on). 

Almost every day system freezes suddenly after random amount of time, which can
be from 5 minutes to 3+ hours. Only power button helps, no logs are saved but
dmesg has errors. 

I think this is either AMDGPU bug or something ryzen related (most likely not,
because they manifest as sudden reboots, never as system freezes. And last bios
update stopped them 2 months ago).

Graphics:  Card: Advanced Micro Devices [AMD/ATI] Tonga PRO [Radeon R9 285/380]
           Display Server: x11 (X.Org 1.19.5 )
           drivers: ati,amdgpu (unloaded: modesetting,fbdev,vesa,radeon)
           Resolution: 1920x1080@60.00hz, 1920x1080@60.00hz
           OpenGL: renderer: AMD Radeon R9 200 Series (TONGA / DRM 3.23.0 /
4.13.11+, LLVM 5.0.1)
           version: 4.5 Mesa 17.4.0-devel

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 3184 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, dmesg errors
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
@ 2017-11-14 15:53 ` bugzilla-daemon
  2017-11-14 18:37 ` bugzilla-daemon
                   ` (14 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2017-11-14 15:53 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 974 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

Michel Dänzer <michel@daenzer.net> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |andresx7@gmail.com

--- Comment #1 from Michel Dänzer <michel@daenzer.net> ---
(In reply to Shiverly from comment #0)
> [...] dmesg has errors. 

I only see messages about failing to allocate a larger BAR, which is harmless.


> I think this is either AMDGPU bug or something ryzen related (most likely
> not, because they manifest as sudden reboots, never as system freezes. And
> last bios update stopped them 2 months ago).

FWIW, Andres Rodriguez reported similar symptoms with a Ryzen system on IRC,
and raising voltages / disabling Cool'n'Quiet / disabling C6 states fixed them
for him.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 2371 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, dmesg errors
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
  2017-11-14 15:53 ` bugzilla-daemon
@ 2017-11-14 18:37 ` bugzilla-daemon
  2017-11-15 17:55 ` bugzilla-daemon
                   ` (13 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2017-11-14 18:37 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 481 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #2 from Andres Rodriguez <andresx7@gmail.com> ---

> FWIW, Andres Rodriguez reported similar symptoms with a Ryzen system on IRC,
> and raising voltages / disabling Cool'n'Quiet / disabling C6 states fixed
> them for him.

I raised the memory and the core voltages specifically. The other voltages like
SoC were left untouched.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1276 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, dmesg errors
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
  2017-11-14 15:53 ` bugzilla-daemon
  2017-11-14 18:37 ` bugzilla-daemon
@ 2017-11-15 17:55 ` bugzilla-daemon
  2017-11-15 18:29 ` bugzilla-daemon
                   ` (12 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2017-11-15 17:55 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 752 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #3 from Shiverly <shiverly@mt2015.com> ---
(In reply to Andres Rodriguez from comment #2)
> > FWIW, Andres Rodriguez reported similar symptoms with a Ryzen system on IRC,
> > and raising voltages / disabling Cool'n'Quiet / disabling C6 states fixed
> > them for him.
> 
> I raised the memory and the core voltages specifically. The other voltages
> like SoC were left untouched.

I didn't have these symptoms in arch or ubuntu 16.04 LTS, only when using this
driver/kernel combination (which is only one that keeps both monitors usable).
Long compilation jobs don't cause system freezes either.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1598 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, dmesg errors
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (2 preceding siblings ...)
  2017-11-15 17:55 ` bugzilla-daemon
@ 2017-11-15 18:29 ` bugzilla-daemon
  2017-11-15 18:32 ` [Bug 103736] Sudden system freezes, GPU fault detected bugzilla-daemon
                   ` (11 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2017-11-15 18:29 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 5825 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #4 from Shiverly <shiverly@mt2015.com> ---
I got some logs. Maybe they are related (found them in journalctl)

Nov 15 20:09:06 tibu-pc kernel: gmc_v8_0_process_interrupt: 626 callbacks
suppressed
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x01, vmid 5) at
page 154068154, read from 'TC5' (0x54433500) (192)
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A0C0001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x092EE4BA
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x05d0c001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x02, vmid 5) at
page 5545728, read from 'TC7' (0x54433700) (68)
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A044002
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00549F00
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x05d00001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x01, vmid 5) at
page 154068154, read from 'TC0' (0x54433000) (8)
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A008001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x092EE4BA
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x05d00801
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x02, vmid 5) at
page 5541638, read from 'TC9' (0x54433900) (136)
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A088002
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00548F06
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x06500001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x01, vmid 5) at
page 154068170, read from 'TC0' (0x54433000) (8)
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A008001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x092EE4CA
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x06500801
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x02, vmid 5) at
page 5541634, read from 'TC8' (0x54433800) (64)
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A040002
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00548F02
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x06504001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x01, vmid 5) at
page 154068170, read from 'TC7' (0x54433700) (68)
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A044001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x092EE4CA
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x06504401
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x01, vmid 5) at
page 154068170, read from 'TC2' (0x54433200) (0)
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A000001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x092EE4CA
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x06500001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x01, vmid 5) at
page 154068171, read from 'TC7' (0x54433700) (68)
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A044001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x092EE4CB
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x06584401
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x01, vmid 5) at
page 154068170, read from 'TC7' (0x54433700) (68)
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A044001
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x092EE4CA
Nov 15 20:08:20 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x06504401
Nov 15 20:08:20 tibu-pc kernel: gmc_v8_0_process_interrupt: 1830 callbacks
suppressed
Nov 15 20:08:03 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x01, vmid 1) at
page 154054911, read from 'TC5' (0x54433500) (192)
Nov 15 20:08:03 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x020C0001
Nov 15 20:08:03 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x092EB0FF
Nov 15 20:08:03 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x07f8c001
Nov 15 20:08:03 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x01, vmid 1) at
page 154054911, read from 'TC0' (0x54433000) (8)
Nov 15 20:08:03 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x02008001
Nov 15 20:08:03 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x092EB0FF
Nov 15 20:08:03 tibu-pc kernel: amdgpu 0000:22:00.0: GPU fault detected: 147
0x07f80801
Nov 15 20:08:03 tibu-pc kernel: amdgpu 0000:22:00.0: VM fault (0x01, vmid 1) at
page 154054911, read from 'TC11' (0x54433131) (128)
Nov 15 20:08:03 tibu-pc kernel: amdgpu 0000:22:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x02080001

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 6576 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (3 preceding siblings ...)
  2017-11-15 18:29 ` bugzilla-daemon
@ 2017-11-15 18:32 ` bugzilla-daemon
  2017-11-18  9:10 ` bugzilla-daemon
                   ` (10 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2017-11-15 18:32 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 452 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

Shiverly <shiverly@mt2015.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|Sudden system freezes,      |Sudden system freezes, GPU
                   |dmesg errors                |fault detected

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1111 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (4 preceding siblings ...)
  2017-11-15 18:32 ` [Bug 103736] Sudden system freezes, GPU fault detected bugzilla-daemon
@ 2017-11-18  9:10 ` bugzilla-daemon
  2018-01-28 11:04 ` bugzilla-daemon
                   ` (9 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2017-11-18  9:10 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 563 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #5 from Shiverly <shiverly@mt2015.com> ---
One way to get crash quickly is to play Overpass map in CS:GO in terrorist
spawn. Textures near the stairs show corrupted, and system always hangs in
first 5 minutes of gameplay. I think it's 3D related, because just using simple
text editor or being in Ctrl-Alt-Fx terminal never hangs, but browser can cause
hang but it's less quick to manifest than playing 3D game.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1326 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (5 preceding siblings ...)
  2017-11-18  9:10 ` bugzilla-daemon
@ 2018-01-28 11:04 ` bugzilla-daemon
  2018-01-28 11:13 ` bugzilla-daemon
                   ` (8 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2018-01-28 11:04 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 681 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #6 from Lennart Sauerbeck <fdobugs@lennart.sauerbeck.org> ---
Created attachment 137005
  --> https://bugs.freedesktop.org/attachment.cgi?id=137005&action=edit
Crash while playing Counter-Strike: Global Offensive

I think I'm running into the same issues. Attached is the kernel output while
playing Counter-Strike: Global Offensive. It worked during the warmup, but
froze in the first round, so I'd say about 3-5 minutes after starting the game.

I'm running an up-to-date Debian unstable with Linux 4.14.13 and Mesa 17.3.3.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1659 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (6 preceding siblings ...)
  2018-01-28 11:04 ` bugzilla-daemon
@ 2018-01-28 11:13 ` bugzilla-daemon
  2018-02-11 10:24 ` bugzilla-daemon
                   ` (7 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2018-01-28 11:13 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 1418 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #7 from Lennart Sauerbeck <fdobugs@lennart.sauerbeck.org> ---
Created attachment 137006
  --> https://bugs.freedesktop.org/attachment.cgi?id=137006&action=edit
Errors while playing CS:GO, crash and reboot after opening VLC

Another crash pretty much right after the one from my previous comment. After
rebooting the system to continue playing Counter-Strike: Global Offensive the
errors kept coming, though the system did not freeze (note the timestamps in
the error log).

After shutting down the game, I started VLC to watch a stream and the system
froze immediately. After a short while (<5 minutes) I used Magic SysReq keys to
reboot the system safely, which can also be seen in the log.

A possibly important detail: My system doesn't freeze entirely, only the
graphics output does. Sound still works for a time, even voice chatting
continues to work. However, all X output freezes (e.g. conky on desktop).

I haven't tried going to a virtual console, so do not know whether that still
works.

I also had the same issue while playing Euro Truck Simulator 2, but it never
happened while playing Dota 2. Given this, it seems like some illegal
instruction is passed to the graphics driver. Would an ApiTrace help? If so, I
can try to record one.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 2419 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (7 preceding siblings ...)
  2018-01-28 11:13 ` bugzilla-daemon
@ 2018-02-11 10:24 ` bugzilla-daemon
  2018-02-12 19:22 ` bugzilla-daemon
                   ` (6 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2018-02-11 10:24 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 685 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #8 from Lennart Sauerbeck <fdobugs@lennart.sauerbeck.org> ---
I was able to record an ApiTrace which shows the problem consistently. However,
it's 2.5 gigabytes and contains personal information I'd rather not share on a
public bugtracker -- I think a trace can only be truncated, removing stuff from
the beginning messes up the OpenGL context?

I cannot switch to the virtual console when the freeze is triggered.

I also built radeonsi from current Mesa git
(9b9a89cd795fda462a6ee898ef6e5135ca79d94e) but the problem persisted.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1467 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (8 preceding siblings ...)
  2018-02-11 10:24 ` bugzilla-daemon
@ 2018-02-12 19:22 ` bugzilla-daemon
  2018-02-13 21:30 ` bugzilla-daemon
                   ` (5 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2018-02-12 19:22 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 1203 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #9 from Ernst Sjöstrand <ernstp@gmail.com> ---
I get

[  133.978908] amdgpu 0000:09:00.0: GPU fault detected: 147 0x00198802
[  133.978911] amdgpu 0000:09:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00500003
[  133.978912] amdgpu 0000:09:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x02188002
[  133.978914] amdgpu 0000:09:00.0: VM fault (0x02, vmid 1) at page 5242883,
read from 'TC4' (0x54433400) (392)

or from another boot

[  204.841497] amdgpu 0000:09:00.0: GPU fault detected: 147 0x00188402
[  204.841501] amdgpu 0000:09:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00500003
[  204.841502] amdgpu 0000:09:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x0A084002
[  204.841504] amdgpu 0000:09:00.0: VM fault (0x02, vmid 5) at page 5242883,
read from '' (0x00000000) (132)


When I try to launch steam. It never gets to draw any UI, the computer just
freezes.
This happens with both 4.13(-ubuntu33) and 4.15.2 kernel with Mesa/LLVM from
git (padoka).
When I reverted to Mesa 17.2.8 + LLVM 5.0.0 I could launch steam again.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1971 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (9 preceding siblings ...)
  2018-02-12 19:22 ` bugzilla-daemon
@ 2018-02-13 21:30 ` bugzilla-daemon
  2018-02-13 22:18 ` bugzilla-daemon
                   ` (4 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2018-02-13 21:30 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 305 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #10 from Ernst Sjöstrand <ernstp@gmail.com> ---
The Vehicle Game demo seem to trigger this quite reliably for me:
https://wiki.unrealengine.com/Linux_Demos

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1130 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (10 preceding siblings ...)
  2018-02-13 21:30 ` bugzilla-daemon
@ 2018-02-13 22:18 ` bugzilla-daemon
  2018-03-31 22:41 ` bugzilla-daemon
                   ` (3 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2018-02-13 22:18 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 575 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #11 from Ernst Sjöstrand <ernstp@gmail.com> ---
Ok, the vm faults I see are caused by using Padoka ppa which currently has
https://cgit.freedesktop.org/mesa/mesa/commit/?id=847d0a393d7f0f967f39302900d5330f32b804c8
but not
https://reviews.llvm.org/D41663

That means it can't be the same as the original issue, and also that the
solution for me is just to update to more recent versions. Sorry for the noise
in this bug.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1495 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (11 preceding siblings ...)
  2018-02-13 22:18 ` bugzilla-daemon
@ 2018-03-31 22:41 ` bugzilla-daemon
  2018-04-01  8:58 ` bugzilla-daemon
                   ` (2 subsequent siblings)
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2018-03-31 22:41 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 482 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #12 from aceman <acelists@atlas.sk> ---
Ernst, I have also traced the error you have to usage of OpenCL in the Mesa
clover driver on RX560 with LLVM upgraded from 5.0.1 to 6.0. What do you say is
the solution? Is Mesa using intrinsics that are only in LLVM git? Or is that
LLVM changeset you posted already in the release LLVM 6.0?

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1242 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (12 preceding siblings ...)
  2018-03-31 22:41 ` bugzilla-daemon
@ 2018-04-01  8:58 ` bugzilla-daemon
  2018-04-01 17:28 ` bugzilla-daemon
  2019-11-19  8:26 ` bugzilla-daemon
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2018-04-01  8:58 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 314 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #13 from Ernst Sjöstrand <ernstp@gmail.com> ---
aceman: the problem was mismatching development snapshots, couldn't happen if
you have any real releases in the mix.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1083 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (13 preceding siblings ...)
  2018-04-01  8:58 ` bugzilla-daemon
@ 2018-04-01 17:28 ` bugzilla-daemon
  2019-11-19  8:26 ` bugzilla-daemon
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2018-04-01 17:28 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 262 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

--- Comment #14 from aceman <acelists@atlas.sk> ---
I'm using Mesa git, but LLVM 6.0 release. Is that fine wrt. this mismatch?

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 1022 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

* [Bug 103736] Sudden system freezes, GPU fault detected
  2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
                   ` (14 preceding siblings ...)
  2018-04-01 17:28 ` bugzilla-daemon
@ 2019-11-19  8:26 ` bugzilla-daemon
  15 siblings, 0 replies; 17+ messages in thread
From: bugzilla-daemon @ 2019-11-19  8:26 UTC (permalink / raw)
  To: dri-devel


[-- Attachment #1.1: Type: text/plain, Size: 806 bytes --]

https://bugs.freedesktop.org/show_bug.cgi?id=103736

Martin Peres <martin.peres@free.fr> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         Resolution|---                         |MOVED
             Status|NEW                         |RESOLVED

--- Comment #15 from Martin Peres <martin.peres@free.fr> ---
-- GitLab Migration Automatic Message --

This bug has been migrated to freedesktop.org's GitLab instance and has been
closed from further activity.

You can subscribe and participate further through the new bug through this link
to our GitLab instance: https://gitlab.freedesktop.org/drm/amd/issues/258.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[-- Attachment #1.2: Type: text/html, Size: 2349 bytes --]

[-- Attachment #2: Type: text/plain, Size: 159 bytes --]

_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel

^ permalink raw reply	[flat|nested] 17+ messages in thread

end of thread, other threads:[~2019-11-19  8:26 UTC | newest]

Thread overview: 17+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-11-14 15:12 [Bug 103736] Sudden system freezes, dmesg errors bugzilla-daemon
2017-11-14 15:53 ` bugzilla-daemon
2017-11-14 18:37 ` bugzilla-daemon
2017-11-15 17:55 ` bugzilla-daemon
2017-11-15 18:29 ` bugzilla-daemon
2017-11-15 18:32 ` [Bug 103736] Sudden system freezes, GPU fault detected bugzilla-daemon
2017-11-18  9:10 ` bugzilla-daemon
2018-01-28 11:04 ` bugzilla-daemon
2018-01-28 11:13 ` bugzilla-daemon
2018-02-11 10:24 ` bugzilla-daemon
2018-02-12 19:22 ` bugzilla-daemon
2018-02-13 21:30 ` bugzilla-daemon
2018-02-13 22:18 ` bugzilla-daemon
2018-03-31 22:41 ` bugzilla-daemon
2018-04-01  8:58 ` bugzilla-daemon
2018-04-01 17:28 ` bugzilla-daemon
2019-11-19  8:26 ` bugzilla-daemon

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.