From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Thu, 09 Jul 2015 10:10:12 +0000 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1175985225==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 449BA6ECAB for ; Thu, 9 Jul 2015 03:10:12 -0700 (PDT) List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1175985225== Content-Type: multipart/alternative; boundary="1436436612.AcF4b40.22595"; charset="UTF-8" --1436436612.AcF4b40.22595 Date: Thu, 9 Jul 2015 10:10:12 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 Bug ID: 91278 Summary: Tonga GPU lock/reset fail with Unigine Valley Product: DRI Version: XOrg git Hardware: Other OS: All Status: NEW Severity: normal Priority: medium Component: DRM/AMDgpu Assignee: dri-devel@lists.freedesktop.org Reporter: adf.lists@gmail.com R9 285 kernel agd5f amdgpu with or without patches from https://bugs.freedesktop.org/show_bug.cgi?id=91141 mesa is agd5f with a few patches from mainline to build with current llvm. ddx is git against older xorg. Simpler games like openarena don't lock. Valley settings ultra, 8xAA, fullscreen 1920x1080. Doesn't show in this log but I've also seen some [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update BO_VA (-35) around the lock/reset on previous tests. -- You are receiving this mail because: You are the assignee for the bug. --1436436612.AcF4b40.22595 Date: Thu, 9 Jul 2015 10:10:12 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"
Bug ID 91278
Summary Tonga GPU lock/reset fail with Unigine Valley
Product DRI
Version XOrg git
Hardware Other
OS All
Status NEW
Severity normal
Priority medium
Component DRM/AMDgpu
Assignee dri-devel@lists.freedesktop.org
Reporter adf.lists@gmail.com

R9 285 kernel agd5f amdgpu with or without patches from 

https://bugs.freedesktop.org/show_bug.cgi?id=91141 

mesa is agd5f with a few patches from mainline to build with current llvm.

ddx is git against older xorg.

Simpler games like openarena don't lock.

Valley settings ultra, 8xAA, fullscreen 1920x1080.

Doesn't show in this log but I've also seen some

 [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update BO_VA (-35)

around the lock/reset on previous tests.


You are receiving this mail because:
  • You are the assignee for the bug.
--1436436612.AcF4b40.22595-- --===============1175985225== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1175985225==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Thu, 09 Jul 2015 10:11:09 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1193666980==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 66A196ECB4 for ; Thu, 9 Jul 2015 03:11:09 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1193666980== Content-Type: multipart/alternative; boundary="1436436669.7D1a46b0.22854"; charset="UTF-8" --1436436669.7D1a46b0.22854 Date: Thu, 9 Jul 2015 10:11:09 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #1 from Andy Furniss --- Created attachment 117012 --> https://bugs.freedesktop.org/attachment.cgi?id=117012&action=edit dmesg with lock -- You are receiving this mail because: You are the assignee for the bug. --1436436669.7D1a46b0.22854 Date: Thu, 9 Jul 2015 10:11:09 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"


You are receiving this mail because:
  • You are the assignee for the bug.
--1436436669.7D1a46b0.22854-- --===============1193666980== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1193666980==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Thu, 09 Jul 2015 10:12:56 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1738419276==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 1A7676E1AA for ; Thu, 9 Jul 2015 03:12:56 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1738419276== Content-Type: multipart/alternative; boundary="1436436775.26cD0.23447"; charset="UTF-8" --1436436775.26cD0.23447 Date: Thu, 9 Jul 2015 10:12:55 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #2 from Andy Furniss --- Created attachment 117013 --> https://bugs.freedesktop.org/attachment.cgi?id=117013&action=edit xorg-log for ref (not from lock) -- You are receiving this mail because: You are the assignee for the bug. --1436436775.26cD0.23447 Date: Thu, 9 Jul 2015 10:12:55 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 2 on bug 91278 from
Created attachment 117013 [details]
xorg-log for ref (not from lock)


You are receiving this mail because:
  • You are the assignee for the bug.
--1436436775.26cD0.23447-- --===============1738419276== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1738419276==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Fri, 17 Jul 2015 09:30:42 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1141075797==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 20C5C6E182 for ; Fri, 17 Jul 2015 02:30:42 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1141075797== Content-Type: multipart/alternative; boundary="1437125441.b4ddac2a0.26386"; charset="UTF-8" --1437125441.b4ddac2a0.26386 Date: Fri, 17 Jul 2015 09:30:41 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #3 from Andy Furniss --- I've of course tried various things sine reporting - Valley doesn't always instantly lock. Unreal 4.5 Elemental got half way before locking. Perhaps more interesting I managed to reset/fail resume just browsing - of course I've done a lot of browsing without issue so far. The difference this time was I had a huge ffmpeg/x265 encode going - it was using all my memory (8 Gig and swap had been used a bit), so it's possible memory pressure plays a role - or maybe just a red herring :-) I haven't managed to get a reset running timedemos on openarena or xonotic so far - will try with memory pressure as time allows. The reset when browsing - -rw-rw-r-- 1 andy andy 153K Jun 13 00:04 hacky-fix.jpeg [ 8052.101670] amdgpu 0000:01:00.0: GPU lockup (waiting for 0x000000000000f019 last fence id 0x000000000000f018 on ring 9) [ 8052.101672] amdgpu 0000:01:00.0: failed to sync rings (-35) [ 8052.108912] amdgpu 0000:01:00.0: Saved 9216 dwords of commands on ring 9. [ 8052.108929] amdgpu 0000:01:00.0: GPU softreset: 0x00000100 [ 8052.108930] amdgpu 0000:01:00.0: GRBM_STATUS=0x00003028 [ 8052.108932] amdgpu 0000:01:00.0: GRBM_STATUS2=0x00000008 [ 8052.108934] amdgpu 0000:01:00.0: GRBM_STATUS_SE0=0x00000006 [ 8052.108935] amdgpu 0000:01:00.0: GRBM_STATUS_SE1=0x00000006 [ 8052.108937] amdgpu 0000:01:00.0: GRBM_STATUS_SE2=0x00000006 [ 8052.108938] amdgpu 0000:01:00.0: GRBM_STATUS_SE3=0x00000006 [ 8052.108940] amdgpu 0000:01:00.0: SRBM_STATUS=0x20020240 [ 8052.108941] amdgpu 0000:01:00.0: SRBM_STATUS2=0x00000080 [ 8052.108943] amdgpu 0000:01:00.0: SDMA0_STATUS_REG = 0x76DEED57 [ 8052.108945] amdgpu 0000:01:00.0: SDMA1_STATUS_REG = 0x46DEED57 [ 8052.108946] amdgpu 0000:01:00.0: CP_STAT = 0x00000000 [ 8052.108948] amdgpu 0000:01:00.0: CP_STALLED_STAT1 = 0x00000c00 [ 8052.108949] amdgpu 0000:01:00.0: CP_STALLED_STAT2 = 0x00000000 [ 8052.108951] amdgpu 0000:01:00.0: CP_STALLED_STAT3 = 0x00000000 [ 8052.108953] amdgpu 0000:01:00.0: CP_CPF_BUSY_STAT = 0x00000000 [ 8052.108954] amdgpu 0000:01:00.0: CP_CPF_STALLED_STAT1 = 0x00000000 [ 8052.108956] amdgpu 0000:01:00.0: CP_CPF_STATUS = 0x00000000 [ 8052.108957] amdgpu 0000:01:00.0: CP_CPC_BUSY_STAT = 0x00000000 [ 8052.108959] amdgpu 0000:01:00.0: CP_CPC_STALLED_STAT1 = 0x00000000 [ 8052.108961] amdgpu 0000:01:00.0: CP_CPC_STATUS = 0x00000000 [ 8052.108962] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 8052.108964] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [ 8052.109078] amdgpu 0000:01:00.0: SRBM_SOFT_RESET=0x00000400 [ 8052.110233] amdgpu 0000:01:00.0: GRBM_STATUS=0x00003028 [ 8052.110235] amdgpu 0000:01:00.0: GRBM_STATUS2=0x00000008 [ 8052.110236] amdgpu 0000:01:00.0: GRBM_STATUS_SE0=0x00000006 [ 8052.110238] amdgpu 0000:01:00.0: GRBM_STATUS_SE1=0x00000006 [ 8052.110239] amdgpu 0000:01:00.0: GRBM_STATUS_SE2=0x00000006 [ 8052.110241] amdgpu 0000:01:00.0: GRBM_STATUS_SE3=0x00000006 [ 8052.110242] amdgpu 0000:01:00.0: SRBM_STATUS=0x20020040 [ 8052.110244] amdgpu 0000:01:00.0: SRBM_STATUS2=0x00000080 [ 8052.110245] amdgpu 0000:01:00.0: SDMA0_STATUS_REG = 0x76DEED57 [ 8052.110247] amdgpu 0000:01:00.0: SDMA1_STATUS_REG = 0x46DEED57 [ 8052.110248] amdgpu 0000:01:00.0: CP_STAT = 0x00000000 [ 8052.110250] amdgpu 0000:01:00.0: CP_STALLED_STAT1 = 0x00000c00 [ 8052.110252] amdgpu 0000:01:00.0: CP_STALLED_STAT2 = 0x00000000 [ 8052.110253] amdgpu 0000:01:00.0: CP_STALLED_STAT3 = 0x00000000 [ 8052.110255] amdgpu 0000:01:00.0: CP_CPF_BUSY_STAT = 0x00000000 [ 8052.110256] amdgpu 0000:01:00.0: CP_CPF_STALLED_STAT1 = 0x00000000 [ 8052.110258] amdgpu 0000:01:00.0: CP_CPF_STATUS = 0x00000000 [ 8052.110259] amdgpu 0000:01:00.0: CP_CPC_BUSY_STAT = 0x00000000 [ 8052.110261] amdgpu 0000:01:00.0: CP_CPC_STALLED_STAT1 = 0x00000000 [ 8052.110262] amdgpu 0000:01:00.0: CP_CPC_STATUS = 0x00000000 [ 8052.110282] amdgpu 0000:01:00.0: GPU reset succeeded, trying to resume [ 8052.110289] [drm] probing gen 2 caps for device 1002:5a16 = 31cd02/0 [ 8052.111446] [drm] PCIE GART of 2048M enabled (table at 0x0000000000040000). [ 8052.113940] [drm] ring test on 0 succeeded in 10 usecs [ 8053.856277] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 1 test failed (scratch(0xC040)=0xCAFEDEAD) [ 8054.049187] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 2 test failed (scratch(0xC040)=0xCAFEDEAD) [ 8054.242101] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 3 test failed (scratch(0xC040)=0xCAFEDEAD) [ 8054.435020] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 4 test failed (scratch(0xC040)=0xCAFEDEAD) [ 8054.627925] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 5 test failed (scratch(0xC040)=0xCAFEDEAD) [ 8054.820839] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 6 test failed (scratch(0xC040)=0xCAFEDEAD) [ 8055.013737] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 7 test failed (scratch(0xC040)=0xCAFEDEAD) [ 8055.206669] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 8 test failed (scratch(0xC040)=0xCAFEDEAD) [ 8055.313826] [drm:sdma_v3_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 9 test failed (0xCAFEDEAD) [ 8055.319862] amdgpu 0000:01:00.0: GPU reset failed [ 8055.320787] amdgpu 0000:01:00.0: couldn't schedule ib [ 8055.320806] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update BO_VA (-22) [ 8055.320831] amdgpu 0000:01:00.0: couldn't schedule ib [ 8055.320841] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update BO_VA (-22) [ 8055.320854] amdgpu 0000:01:00.0: couldn't schedule ib [ 8055.320863] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update BO_VA (-22) -- You are receiving this mail because: You are the assignee for the bug. --1437125441.b4ddac2a0.26386 Date: Fri, 17 Jul 2015 09:30:41 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 3 on bug 91278 from
I've of course tried various things sine reporting - Valley doesn't always
instantly lock. Unreal 4.5 Elemental got half way before locking.

Perhaps more interesting I managed to reset/fail resume just browsing - of
course I've done a lot of browsing without issue so far. The difference this
time was I had a huge ffmpeg/x265 encode going - it was using all my memory (8
Gig and swap had been used a bit), so it's possible memory pressure plays a
role - or maybe just a red herring :-)

I haven't managed to get a reset running timedemos on openarena or xonotic so
far - will try with memory pressure as time allows.

The reset when browsing -

-rw-rw-r--  1 andy andy 153K Jun 13 00:04 hacky-fix.jpeg
[ 8052.101670] amdgpu 0000:01:00.0: GPU lockup (waiting for 0x000000000000f019
last fence id 0x000000000000f018 on ring 9)
[ 8052.101672] amdgpu 0000:01:00.0: failed to sync rings (-35)
[ 8052.108912] amdgpu 0000:01:00.0: Saved 9216 dwords of commands on ring 9.
[ 8052.108929] amdgpu 0000:01:00.0: GPU softreset: 0x00000100
[ 8052.108930] amdgpu 0000:01:00.0:   GRBM_STATUS=0x00003028
[ 8052.108932] amdgpu 0000:01:00.0:   GRBM_STATUS2=0x00000008
[ 8052.108934] amdgpu 0000:01:00.0:   GRBM_STATUS_SE0=0x00000006
[ 8052.108935] amdgpu 0000:01:00.0:   GRBM_STATUS_SE1=0x00000006
[ 8052.108937] amdgpu 0000:01:00.0:   GRBM_STATUS_SE2=0x00000006
[ 8052.108938] amdgpu 0000:01:00.0:   GRBM_STATUS_SE3=0x00000006
[ 8052.108940] amdgpu 0000:01:00.0:   SRBM_STATUS=0x20020240
[ 8052.108941] amdgpu 0000:01:00.0:   SRBM_STATUS2=0x00000080
[ 8052.108943] amdgpu 0000:01:00.0:   SDMA0_STATUS_REG   = 0x76DEED57
[ 8052.108945] amdgpu 0000:01:00.0:   SDMA1_STATUS_REG   = 0x46DEED57
[ 8052.108946] amdgpu 0000:01:00.0:   CP_STAT = 0x00000000
[ 8052.108948] amdgpu 0000:01:00.0:   CP_STALLED_STAT1 = 0x00000c00
[ 8052.108949] amdgpu 0000:01:00.0:   CP_STALLED_STAT2 = 0x00000000
[ 8052.108951] amdgpu 0000:01:00.0:   CP_STALLED_STAT3 = 0x00000000
[ 8052.108953] amdgpu 0000:01:00.0:   CP_CPF_BUSY_STAT = 0x00000000
[ 8052.108954] amdgpu 0000:01:00.0:   CP_CPF_STALLED_STAT1 = 0x00000000
[ 8052.108956] amdgpu 0000:01:00.0:   CP_CPF_STATUS = 0x00000000
[ 8052.108957] amdgpu 0000:01:00.0:   CP_CPC_BUSY_STAT = 0x00000000
[ 8052.108959] amdgpu 0000:01:00.0:   CP_CPC_STALLED_STAT1 = 0x00000000
[ 8052.108961] amdgpu 0000:01:00.0:   CP_CPC_STATUS = 0x00000000
[ 8052.108962] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 8052.108964] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 8052.109078] amdgpu 0000:01:00.0: SRBM_SOFT_RESET=0x00000400
[ 8052.110233] amdgpu 0000:01:00.0:   GRBM_STATUS=0x00003028
[ 8052.110235] amdgpu 0000:01:00.0:   GRBM_STATUS2=0x00000008
[ 8052.110236] amdgpu 0000:01:00.0:   GRBM_STATUS_SE0=0x00000006
[ 8052.110238] amdgpu 0000:01:00.0:   GRBM_STATUS_SE1=0x00000006
[ 8052.110239] amdgpu 0000:01:00.0:   GRBM_STATUS_SE2=0x00000006
[ 8052.110241] amdgpu 0000:01:00.0:   GRBM_STATUS_SE3=0x00000006
[ 8052.110242] amdgpu 0000:01:00.0:   SRBM_STATUS=0x20020040
[ 8052.110244] amdgpu 0000:01:00.0:   SRBM_STATUS2=0x00000080
[ 8052.110245] amdgpu 0000:01:00.0:   SDMA0_STATUS_REG   = 0x76DEED57
[ 8052.110247] amdgpu 0000:01:00.0:   SDMA1_STATUS_REG   = 0x46DEED57
[ 8052.110248] amdgpu 0000:01:00.0:   CP_STAT = 0x00000000
[ 8052.110250] amdgpu 0000:01:00.0:   CP_STALLED_STAT1 = 0x00000c00
[ 8052.110252] amdgpu 0000:01:00.0:   CP_STALLED_STAT2 = 0x00000000
[ 8052.110253] amdgpu 0000:01:00.0:   CP_STALLED_STAT3 = 0x00000000
[ 8052.110255] amdgpu 0000:01:00.0:   CP_CPF_BUSY_STAT = 0x00000000
[ 8052.110256] amdgpu 0000:01:00.0:   CP_CPF_STALLED_STAT1 = 0x00000000
[ 8052.110258] amdgpu 0000:01:00.0:   CP_CPF_STATUS = 0x00000000
[ 8052.110259] amdgpu 0000:01:00.0:   CP_CPC_BUSY_STAT = 0x00000000
[ 8052.110261] amdgpu 0000:01:00.0:   CP_CPC_STALLED_STAT1 = 0x00000000
[ 8052.110262] amdgpu 0000:01:00.0:   CP_CPC_STATUS = 0x00000000
[ 8052.110282] amdgpu 0000:01:00.0: GPU reset succeeded, trying to resume
[ 8052.110289] [drm] probing gen 2 caps for device 1002:5a16 = 31cd02/0
[ 8052.111446] [drm] PCIE GART of 2048M enabled (table at 0x0000000000040000).
[ 8052.113940] [drm] ring test on 0 succeeded in 10 usecs
[ 8053.856277] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 1
test failed (scratch(0xC040)=0xCAFEDEAD)
[ 8054.049187] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 2
test failed (scratch(0xC040)=0xCAFEDEAD)
[ 8054.242101] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 3
test failed (scratch(0xC040)=0xCAFEDEAD)
[ 8054.435020] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 4
test failed (scratch(0xC040)=0xCAFEDEAD)
[ 8054.627925] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 5
test failed (scratch(0xC040)=0xCAFEDEAD)
[ 8054.820839] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 6
test failed (scratch(0xC040)=0xCAFEDEAD)
[ 8055.013737] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 7
test failed (scratch(0xC040)=0xCAFEDEAD)
[ 8055.206669] [drm:gfx_v8_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 8
test failed (scratch(0xC040)=0xCAFEDEAD)
[ 8055.313826] [drm:sdma_v3_0_ring_test_ring [amdgpu]] *ERROR* amdgpu: ring 9
test failed (0xCAFEDEAD)
[ 8055.319862] amdgpu 0000:01:00.0: GPU reset failed
[ 8055.320787] amdgpu 0000:01:00.0: couldn't schedule ib
[ 8055.320806] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update BO_VA
(-22)
[ 8055.320831] amdgpu 0000:01:00.0: couldn't schedule ib
[ 8055.320841] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update BO_VA
(-22)
[ 8055.320854] amdgpu 0000:01:00.0: couldn't schedule ib
[ 8055.320863] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn't update BO_VA
(-22)


You are receiving this mail because:
  • You are the assignee for the bug.
--1437125441.b4ddac2a0.26386-- --===============1141075797== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1141075797==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Mon, 20 Jul 2015 23:25:28 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0299288782==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 003526E874 for ; Mon, 20 Jul 2015 16:25:27 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0299288782== Content-Type: multipart/alternative; boundary="1437434727.7a0Eba0.2399"; charset="UTF-8" --1437434727.7a0Eba0.2399 Date: Mon, 20 Jul 2015 23:25:27 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #4 from Andy Furniss --- I tested Xonotic with load over the weekend and it did lock after about 10 minutes - but I then tested without any memory pressure and still managed to lock, it did take longer. I got another lock ehile browsing today - there was nothing else happening at the time, but I had a few minutes earlier compiled llvm and mesa. -- You are receiving this mail because: You are the assignee for the bug. --1437434727.7a0Eba0.2399 Date: Mon, 20 Jul 2015 23:25:27 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 4 on bug 91278 from
I tested Xonotic with load over the weekend and it did lock after about 10
minutes - but I then tested without any memory pressure and still managed to
lock, it did take longer.

I got another lock ehile browsing today - there was nothing else happening at
the time, but I had a few minutes earlier compiled llvm and mesa.


You are receiving this mail because:
  • You are the assignee for the bug.
--1437434727.7a0Eba0.2399-- --===============0299288782== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0299288782==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Mon, 17 Aug 2015 19:25:51 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1927030282==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 462D47204F for ; Mon, 17 Aug 2015 12:25:51 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1927030282== Content-Type: multipart/alternative; boundary="1439839551.bD68EdDE1.18605"; charset="UTF-8" --1439839551.bD68EdDE1.18605 Date: Mon, 17 Aug 2015 19:25:51 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 Mathias Tillman changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |master.homer@gmail.com --- Comment #5 from Mathias Tillman --- Created attachment 117739 --> https://bugs.freedesktop.org/attachment.cgi?id=117739&action=edit Kernel log of hang Have been getting the same hangs, though I get it while just using the computer normally, or even while it was idle. Using Ubuntu vivid with kernel 4.2-rc7 from Ubuntu mainline with the oibaf ppa and a self-compiled xf86-video-amdgpu module. -- You are receiving this mail because: You are the assignee for the bug. --1439839551.bD68EdDE1.18605 Date: Mon, 17 Aug 2015 19:25:51 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" changed bug 91278
What Removed Added
CC   master.homer@gmail.com

Comment # 5 on bug 91278 from
Created attachment 117739 [details]
Kernel log of hang

Have been getting the same hangs, though I get it while just using the computer
normally, or even while it was idle.

Using Ubuntu vivid with kernel 4.2-rc7 from Ubuntu mainline with the oibaf ppa
and a self-compiled xf86-video-amdgpu module.


You are receiving this mail because:
  • You are the assignee for the bug.
--1439839551.bD68EdDE1.18605-- --===============1927030282== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1927030282==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Tue, 18 Aug 2015 06:41:45 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0579490008==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 457AD6EA28 for ; Mon, 17 Aug 2015 23:41:45 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0579490008== Content-Type: multipart/alternative; boundary="1439880105.fD46f731.12951"; charset="UTF-8" --1439880105.fD46f731.12951 Date: Tue, 18 Aug 2015 06:41:45 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 --- Comment #6 from Michel D=C3=A4nzer --- (In reply to Mathias Tillman from comment #5) > Have been getting the same hangs, though I get it while just using the > computer normally, or even while it was idle. The symptoms may be similar, but since the circumstances differ, please file your own report. --=20 You are receiving this mail because: You are the assignee for the bug. --1439880105.fD46f731.12951 Date: Tue, 18 Aug 2015 06:41:45 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

Comment= # 6 on bug 91278<= /a> from Michel D=C3=A4nzer
(In reply to Mathias Tillman from comment #5)
> Have been getting the same hangs, though I get i=
t while just using the
> computer normally, or even while it was idle.

The symptoms may be similar, but since the circumstances differ, please file
your own report.


You are receiving this mail because: =20=20=20=20=20=20
  • You are the assignee for the bug.
--1439880105.fD46f731.12951-- --===============0579490008== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0579490008==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Fri, 28 Aug 2015 10:45:34 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0239636779==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 392436E0D2 for ; Fri, 28 Aug 2015 03:45:34 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0239636779== Content-Type: multipart/alternative; boundary="1440758734.A5Bf2D0.19284"; charset="UTF-8" --1440758734.A5Bf2D0.19284 Date: Fri, 28 Aug 2015 10:45:34 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #7 from Andy Furniss --- Created attachment 117964 --> https://bugs.freedesktop.org/attachment.cgi?id=117964&action=edit hung tast with current agd5f drm-next-4.3 Valley does sometimes get further with newer gits - I have recently got all the way through the scenes. It does still lock though. Attached is a hung task trace with current agd5f drm-next-4.3, libdrm, mesa and a recentish llvm. -- You are receiving this mail because: You are the assignee for the bug. --1440758734.A5Bf2D0.19284 Date: Fri, 28 Aug 2015 10:45:34 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 7 on bug 91278 from
Created attachment 117964 [details]
hung tast with current agd5f drm-next-4.3

Valley does sometimes get further with newer gits - I have recently got all the
way through the scenes. It does still lock though. 

Attached is a hung task trace with current agd5f drm-next-4.3, libdrm, mesa and
a recentish llvm.


You are receiving this mail because:
  • You are the assignee for the bug.
--1440758734.A5Bf2D0.19284-- --===============0239636779== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0239636779==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Thu, 24 Sep 2015 15:43:44 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============2147111659==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 068567A112 for ; Thu, 24 Sep 2015 08:43:45 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============2147111659== Content-Type: multipart/alternative; boundary="1443109424.387373.30128"; charset="UTF-8" --1443109424.387373.30128 Date: Thu, 24 Sep 2015 15:43:44 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 Alex Deucher changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |edward.ocallaghan@koparo.co | |m --- Comment #8 from Alex Deucher --- *** Bug 92087 has been marked as a duplicate of this bug. *** -- You are receiving this mail because: You are the assignee for the bug. --1443109424.387373.30128 Date: Thu, 24 Sep 2015 15:43:44 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" changed bug 91278
What Removed Added
CC   edward.ocallaghan@koparo.com

Comment # 8 on bug 91278 from
*** Bug 92087 has been marked as a duplicate of this bug. ***


You are receiving this mail because:
  • You are the assignee for the bug.
--1443109424.387373.30128-- --===============2147111659== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============2147111659==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Fri, 25 Sep 2015 17:30:51 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1790747768==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 797256E398 for ; Fri, 25 Sep 2015 10:30:51 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1790747768== Content-Type: multipart/alternative; boundary="1443202251.6B0DDD1.7048"; charset="UTF-8" --1443202251.6B0DDD1.7048 Date: Fri, 25 Sep 2015 17:30:51 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #9 from Mathias Tillman --- Created attachment 118448 --> https://bugs.freedesktop.org/attachment.cgi?id=118448&action=edit apitrace of hang Not exactly sure how useful it is, but I have attached an excerpt of an apitrace of the unigine valley demo. I had to cut out most of it, due to the size of it - I've run it several times, and the size has always been >500MB. The last thing in the trace was always glXSwapBuffers, so the excerpt consists of the contents between the last and the next to last glXSwapBuffers lines. -- You are receiving this mail because: You are the assignee for the bug. --1443202251.6B0DDD1.7048 Date: Fri, 25 Sep 2015 17:30:51 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 9 on bug 91278 from
Created attachment 118448 [details]
apitrace of hang

Not exactly sure how useful it is, but I have attached an excerpt of an
apitrace of the unigine valley demo. I had to cut out most of it, due to the
size of it - I've run it several times, and the size has always been >500MB.
The last thing in the trace was always glXSwapBuffers, so the excerpt consists
of the contents between the last and the next to last glXSwapBuffers lines.


You are receiving this mail because:
  • You are the assignee for the bug.
--1443202251.6B0DDD1.7048-- --===============1790747768== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1790747768==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Fri, 25 Sep 2015 21:52:45 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1642815958==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 5154C6E473 for ; Fri, 25 Sep 2015 14:52:45 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1642815958== Content-Type: multipart/alternative; boundary="1443217965.16ca31.15064"; charset="UTF-8" --1443217965.16ca31.15064 Date: Fri, 25 Sep 2015 21:52:45 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 --- Comment #10 from Michel D=C3=A4nzer --- Mathias, so you can reliably reproduce the hang by replaying that apitrace? --=20 You are receiving this mail because: You are the assignee for the bug. --1443217965.16ca31.15064 Date: Fri, 25 Sep 2015 21:52:45 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

Commen= t # 10 on bug 91278<= /a> from Michel D=C3=A4nzer
Mathias, so you can reliably reproduce the hang by replaying t=
hat apitrace?


You are receiving this mail because: =20=20=20=20=20=20
  • You are the assignee for the bug.
--1443217965.16ca31.15064-- --===============1642815958== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1642815958==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Mon, 28 Sep 2015 09:08:52 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0619419631==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 10A946E3C7 for ; Mon, 28 Sep 2015 02:08:52 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0619419631== Content-Type: multipart/alternative; boundary="1443431332.B86ce1.9139"; charset="UTF-8" --1443431332.B86ce1.9139 Date: Mon, 28 Sep 2015 09:08:52 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 --- Comment #11 from Mathias Tillman --- (In reply to Michel D=C3=A4nzer from comment #10) > Mathias, so you can reliably reproduce the hang by replaying that apitrac= e? Unfortunately I haven't been able to replay the apitrace properly - I just = get a black screen with a bunch of errors in the output about not supporting GL= SL 1.50, program not supported etc. I will keep trying though. --=20 You are receiving this mail because: You are the assignee for the bug. --1443431332.B86ce1.9139 Date: Mon, 28 Sep 2015 09:08:52 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

Commen= t # 11 on bug 91278<= /a> from Mathias Tillman
(In reply to Michel D=C3=A4nzer from comment #10)
> Mathias, so you can reliably reproduce the hang =
by replaying that apitrace?

Unfortunately I haven't been able to replay the apitrace properly - I just =
get
a black screen with a bunch of errors in the output about not supporting GL=
SL
1.50, program not supported etc.
I will keep trying though.


You are receiving this mail because: =20=20=20=20=20=20
  • You are the assignee for the bug.
--1443431332.B86ce1.9139-- --===============0619419631== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0619419631==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Mon, 28 Sep 2015 15:35:38 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1499115749==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id B342F6EA04 for ; Mon, 28 Sep 2015 08:35:38 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1499115749== Content-Type: multipart/alternative; boundary="1443454538.e8371.14002"; charset="UTF-8" --1443454538.e8371.14002 Date: Mon, 28 Sep 2015 15:35:38 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 --- Comment #12 from Mathias Tillman --- (In reply to Michel D=C3=A4nzer from comment #10) > Mathias, so you can reliably reproduce the hang by replaying that apitrac= e? Okay, I have been able to replay the trace by compiling apitrace from sourc= e, and renaming glretrace to valley_x64. I can reproduce the hang by replaying, but not in a very useful way as the = hang happens on different frames on each replay, so I wouldn't really call it reproducible at this point. I will see what different options glretrace gives me, to see if I can find = some kind of common denominator between the hangs. --=20 You are receiving this mail because: You are the assignee for the bug. --1443454538.e8371.14002 Date: Mon, 28 Sep 2015 15:35:38 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

Commen= t # 12 on bug 91278<= /a> from Mathias Tillman
(In reply to Michel D=C3=A4nzer from comment #10)
> Mathias, so you can reliably reproduce the hang =
by replaying that apitrace?

Okay, I have been able to replay the trace by compiling apitrace from sourc=
e,
and renaming glretrace to valley_x64.
I can reproduce the hang by replaying, but not in a very useful way as the =
hang
happens on different frames on each replay, so I wouldn't really call it
reproducible at this point.
I will see what different options glretrace gives me, to see if I can find =
some
kind of common denominator between the hangs.


You are receiving this mail because: =20=20=20=20=20=20
  • You are the assignee for the bug.
--1443454538.e8371.14002-- --===============1499115749== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1499115749==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Mon, 28 Sep 2015 21:27:45 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1130345211==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 4F9236E412 for ; Mon, 28 Sep 2015 14:27:45 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1130345211== Content-Type: multipart/alternative; boundary="1443475665.DF5328D1.30512"; charset="UTF-8" --1443475665.DF5328D1.30512 Date: Mon, 28 Sep 2015 21:27:45 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #13 from Mathias Tillman --- Sorry for triple posting, but I have some more info that may or may not be useful. When enabling verbose output from glretrace, I can see that the next to last operation before a hang is always glBindBuffer (I've run it 10 times) even though the rest of the output is very different. This, however, only happens when double buffer visuals is enabled, if I disable it using --sb the output is the same as above, ending with glXSwapBuffers. Not sure if this is a coincidence or not, but it seemed interesting to me. -- You are receiving this mail because: You are the assignee for the bug. --1443475665.DF5328D1.30512 Date: Mon, 28 Sep 2015 21:27:45 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 13 on bug 91278 from
Sorry for triple posting, but I have some more info that may or may not be
useful.
When enabling verbose output from glretrace, I can see that the next to last
operation before a hang is always glBindBuffer (I've run it 10 times) even
though the rest of the output is very different. This, however, only happens
when double buffer visuals is enabled, if I disable it using --sb the output is
the same as above, ending with glXSwapBuffers.
Not sure if this is a coincidence or not, but it seemed interesting to me.


You are receiving this mail because:
  • You are the assignee for the bug.
--1443475665.DF5328D1.30512-- --===============1130345211== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1130345211==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Tue, 29 Sep 2015 10:32:21 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0519956768==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id D60836E570 for ; Tue, 29 Sep 2015 03:32:20 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0519956768== Content-Type: multipart/alternative; boundary="1443522740.cA4c7Ac0.31466"; charset="UTF-8" --1443522740.cA4c7Ac0.31466 Date: Tue, 29 Sep 2015 10:32:20 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #14 from Andy Furniss --- FWIW I don't think the Valley code/shaders/whatever its self triggers this. I can run valley for > an hour depending on luck/state of my box. One way to get a long run for me is to go into mem sleep, come out and while nothing else is running run valley. Based on only a few runs like this, I haven't locked it yet. Randomly starting valley after my PC has been in use all day may lock withing 10 seconds. I am not saying mem sleep cures all locks, just seems to make it hard. After running valley for an hour one time, I mover onto Unreal elemental, several runs, no lock. I then tried Unreal Atlantis and eventually got it to hang with that, though it ran through the scripted bit and I had to start flying around interactively before the hang. Unreal Atlantis is a bit different/annoying. Different in that it requests more vram than I have and annoying as it makes a 1.8 gig cache file under $HOME/.config. This mem sleep observation may be luck. I also haven't tested some variants yet like with vblank_mode=0 or cpufreq_ondemand settings. I run valley all maxed fullscreen - so yet more variables verses what others may be running it with. -- You are receiving this mail because: You are the assignee for the bug. --1443522740.cA4c7Ac0.31466 Date: Tue, 29 Sep 2015 10:32:20 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 14 on bug 91278 from
FWIW I don't think the Valley code/shaders/whatever its self triggers this.

I can run valley for > an hour depending on luck/state of my box.

One way to get a long run for me is to go into mem sleep, come out and while
nothing else is running run valley. Based on only a few runs like this, I
haven't locked it yet. Randomly starting valley after my PC has been in use all
day may lock withing 10 seconds.

I am not saying mem sleep cures all locks, just seems to make it hard. After
running valley for an hour one time, I mover onto Unreal elemental, several
runs, no lock. I then tried Unreal Atlantis and eventually got it to hang with
that, though it ran through the scripted bit and I had to start flying around
interactively before the hang.

Unreal Atlantis is a bit different/annoying. Different in that it requests more
vram than I have and annoying as it makes a 1.8 gig cache file under
$HOME/.config.

This mem sleep observation may be luck. I also haven't tested some variants yet
like with vblank_mode=0 or cpufreq_ondemand settings. I run valley all maxed
fullscreen - so yet more variables verses what others may be running it with.


You are receiving this mail because:
  • You are the assignee for the bug.
--1443522740.cA4c7Ac0.31466-- --===============0519956768== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0519956768==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Tue, 29 Sep 2015 15:47:26 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1224162349==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 7A1016EB0C for ; Tue, 29 Sep 2015 08:47:26 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1224162349== Content-Type: multipart/alternative; boundary="1443541646.10f6Be111.1058"; charset="UTF-8" --1443541646.10f6Be111.1058 Date: Tue, 29 Sep 2015 15:47:26 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #15 from Alex Deucher --- These patches may help: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20150928/302380.html http://lists.freedesktop.org/archives/mesa-dev/2015-September/095718.html -- You are receiving this mail because: You are the assignee for the bug. --1443541646.10f6Be111.1058 Date: Tue, 29 Sep 2015 15:47:26 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"


You are receiving this mail because:
  • You are the assignee for the bug.
--1443541646.10f6Be111.1058-- --===============1224162349== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1224162349==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Tue, 29 Sep 2015 20:33:30 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0667098820==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 0BFA16E128 for ; Tue, 29 Sep 2015 13:33:30 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0667098820== Content-Type: multipart/alternative; boundary="1443558809.F70EC81.7623"; charset="UTF-8" --1443558809.F70EC81.7623 Date: Tue, 29 Sep 2015 20:33:29 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #16 from Mathias Tillman --- (In reply to Alex Deucher from comment #15) > These patches may help: > http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20150928/302380.html > http://lists.freedesktop.org/archives/mesa-dev/2015-September/095718.html Afraid not, applied both of them and it still hangs. One interesting thing is that I was using mesa from the oibaf ppa which compiles against llvm 3.6. While using that I haven't been able to replay my apitrace of valley once - it always hangs before it finishes. However, I compiled mesa against llvm 3.7 (one compiled from source with the patch, and one from llvm's apt repository) and 3.8, and it gets much further now - I've been able to replay the trace three times without a hang, though it does ultimately hang unfortunately. -- You are receiving this mail because: You are the assignee for the bug. --1443558809.F70EC81.7623 Date: Tue, 29 Sep 2015 20:33:29 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 16 on bug 91278 from
(In reply to Alex Deucher from comment #15)
> These patches may help:
> http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20150928/302380.html
> http://lists.freedesktop.org/archives/mesa-dev/2015-September/095718.html
Afraid not, applied both of them and it still hangs. One interesting thing is
that I was using mesa from the oibaf ppa which compiles against llvm 3.6. While
using that I haven't been able to replay my apitrace of valley once - it always
hangs before it finishes. However, I compiled mesa against llvm 3.7 (one
compiled from source with the patch, and one from llvm's apt repository) and
3.8, and it gets much further now - I've been able to replay the trace three
times without a hang, though it does ultimately hang unfortunately.


You are receiving this mail because:
  • You are the assignee for the bug.
--1443558809.F70EC81.7623-- --===============0667098820== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0667098820==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Wed, 30 Sep 2015 09:38:20 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0198230952==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 5E4E26E3B7 for ; Wed, 30 Sep 2015 02:38:20 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0198230952== Content-Type: multipart/alternative; boundary="1443605900.6fB0ec0.17026"; charset="UTF-8" --1443605900.6fB0ec0.17026 Date: Wed, 30 Sep 2015 09:38:20 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #17 from Andy Furniss --- Haven't had time yet to hang with the patches. Yesterday without I them I hung, rebooted, did the memsleep, then tested the rest of the day trying to lock valley and unreal but couldn't. For the whole day, the only logging I got was a few hundred - Sep 29 18:10:47 ph4 kernel: VM fault (0x04, vmid 4) at page 1529213, read from 'TC6' (0x54433600) (72) Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0: GPU fault detected: 146 0x0be84804 Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0017557D Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x08048004 Sep 29 18:10:49 ph4 kernel: VM fault (0x04, vmid 4) at page 1529213, read from 'TC6' (0x54433600) (72) Last thing I applied the patches to couple of days old llvm and mesa gits. This morning ran valley from power off boot after a bit of browsing/mail (yesterday this hung). Only a quick test which I stopped, looked OK but in dmesg I have >10k of - [ 1792.292640] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0918c404 [ 1792.292643] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00136F23 [ 1792.292644] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x060C4004 [ 1792.292646] VM fault (0x04, vmid 3) at page 1273635, read from 'TC4' (0x54433400) (196) [ 1792.292650] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404 [ 1792.292651] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 1792.292652] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [ 1792.292654] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0) [ 1792.292658] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09188404 [ 1792.292659] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 1792.292660] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [ 1792.292661] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0) [ 1792.292666] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09180404 [ 1792.292667] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 1792.292668] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [ 1792.292669] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0) [ 1792.375515] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09188404 [ 1792.375518] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00136F23 [ 1792.375519] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x06084004 [ 1792.375521] VM fault (0x04, vmid 3) at page 1273635, read from 'TC10' (0x54433130) (132) [ 1792.375526] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404 [ 1792.375527] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 1792.375528] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [ 1792.375530] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0) [ 1792.375534] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0918c404 [ 1792.375535] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 1792.375536] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [ 1792.375538] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0) [ 1792.375542] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09180404 [ 1792.375543] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 1792.375544] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [ 1792.375546] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0) [ 1792.432272] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404 [ 1792.432276] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00136F23 [ 1792.432277] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x06044004 [ 1792.432280] VM fault (0x04, vmid 3) at page 1273635, read from 'TC7' (0x54433700) (68) -- You are receiving this mail because: You are the assignee for the bug. --1443605900.6fB0ec0.17026 Date: Wed, 30 Sep 2015 09:38:20 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 17 on bug 91278 from
Haven't had time yet to hang with the patches.

Yesterday without I them I hung, rebooted, did the memsleep, then tested the
rest of the day trying to lock valley and unreal but couldn't. For the whole
day, the only logging I got was a few hundred -

Sep 29 18:10:47 ph4 kernel: VM fault (0x04, vmid 4) at page 1529213, read from
'TC6' (0x54433600) (72)
Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0: GPU fault detected: 146
0x0be84804
Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x0017557D
Sep 29 18:10:49 ph4 kernel: amdgpu 0000:01:00.0:  
VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x08048004
Sep 29 18:10:49 ph4 kernel: VM fault (0x04, vmid 4) at page 1529213, read from
'TC6' (0x54433600) (72)

Last thing I applied the patches to couple of days old llvm and mesa gits.

This morning ran valley from power off boot after a bit of browsing/mail
(yesterday this hung).

Only a quick test which I stopped, looked OK but in dmesg I have >10k of -

[ 1792.292640] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0918c404
[ 1792.292643] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00136F23
[ 1792.292644] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x060C4004
[ 1792.292646] VM fault (0x04, vmid 3) at page 1273635, read from 'TC4'
(0x54433400) (196)
[ 1792.292650] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404
[ 1792.292651] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.292652] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.292654] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.292658] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09188404
[ 1792.292659] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.292660] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.292661] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.292666] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09180404
[ 1792.292667] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.292668] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.292669] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.375515] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09188404
[ 1792.375518] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00136F23
[ 1792.375519] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x06084004
[ 1792.375521] VM fault (0x04, vmid 3) at page 1273635, read from 'TC10'
(0x54433130) (132)
[ 1792.375526] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404
[ 1792.375527] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.375528] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.375530] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.375534] amdgpu 0000:01:00.0: GPU fault detected: 146 0x0918c404
[ 1792.375535] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.375536] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.375538] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.375542] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09180404
[ 1792.375543] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1792.375544] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1792.375546] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1792.432272] amdgpu 0000:01:00.0: GPU fault detected: 146 0x09184404
[ 1792.432276] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00136F23
[ 1792.432277] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x06044004
[ 1792.432280] VM fault (0x04, vmid 3) at page 1273635, read from 'TC7'
(0x54433700) (68)


You are receiving this mail because:
  • You are the assignee for the bug.
--1443605900.6fB0ec0.17026-- --===============0198230952== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0198230952==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Wed, 30 Sep 2015 10:08:57 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0874845094==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id E81916E86F for ; Wed, 30 Sep 2015 03:08:56 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0874845094== Content-Type: multipart/alternative; boundary="1443607736.62D0A7C0.32535"; charset="UTF-8" --1443607736.62D0A7C0.32535 Date: Wed, 30 Sep 2015 10:08:56 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #18 from Andy Furniss --- So after last update I ran valley again briefly and saw a few vmfaults, then a longer run and got thousands again. without touching anything else did echo mem >/sys/power/state and then woke up. 10 minute run of valley has produced zero faults. -- You are receiving this mail because: You are the assignee for the bug. --1443607736.62D0A7C0.32535 Date: Wed, 30 Sep 2015 10:08:56 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 18 on bug 91278 from
So after last update I ran valley again briefly and saw a few vmfaults, then a
longer run and got thousands again.

without touching anything else did echo mem >/sys/power/state and then woke up.

10 minute run of valley has produced zero faults.


You are receiving this mail because:
  • You are the assignee for the bug.
--1443607736.62D0A7C0.32535-- --===============0874845094== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0874845094==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Wed, 30 Sep 2015 19:59:53 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0049498727==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 9E7B26E8C5 for ; Wed, 30 Sep 2015 12:59:53 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0049498727== Content-Type: multipart/alternative; boundary="1443643193.D5Cebf0.24469"; charset="UTF-8" --1443643193.D5Cebf0.24469 Date: Wed, 30 Sep 2015 19:59:53 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #19 from Andy Furniss --- (In reply to Andy Furniss from comment #18) > So after last update I ran valley again briefly and saw a few vmfaults, then > a longer run and got thousands again. > > without touching anything else did echo mem >/sys/power/state and then woke > up. > > 10 minute run of valley has produced zero faults. Further test from power off, nothing else running apart from X/fluxox short run of valley no faults. Reran valley for a bit longer and got thousands. Did memsleep ran valley no faults but after about 10 minutes it hung. -- You are receiving this mail because: You are the assignee for the bug. --1443643193.D5Cebf0.24469 Date: Wed, 30 Sep 2015 19:59:53 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 19 on bug 91278 from
(In reply to Andy Furniss from comment #18)
> So after last update I ran valley again briefly and saw a few vmfaults, then
> a longer run and got thousands again.
> 
> without touching anything else did echo mem >/sys/power/state and then woke
> up.
> 
> 10 minute run of valley has produced zero faults.

Further test from power off, nothing else running apart from X/fluxox short run
of valley no faults. Reran valley for a bit longer and got thousands. Did
memsleep ran valley no faults but after about 10 minutes it hung.


You are receiving this mail because:
  • You are the assignee for the bug.
--1443643193.D5Cebf0.24469-- --===============0049498727== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0049498727==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Wed, 30 Sep 2015 20:42:20 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1573068851==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 070246E8D8 for ; Wed, 30 Sep 2015 13:42:20 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1573068851== Content-Type: multipart/alternative; boundary="1443645739.84DC01.16527"; charset="UTF-8" --1443645739.84DC01.16527 Date: Wed, 30 Sep 2015 20:42:19 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #20 from Mathias Tillman --- (In reply to Andy Furniss from comment #19) > (In reply to Andy Furniss from comment #18) > > So after last update I ran valley again briefly and saw a few vmfaults, then > > a longer run and got thousands again. > > > > without touching anything else did echo mem >/sys/power/state and then woke > > up. > > > > 10 minute run of valley has produced zero faults. > > Further test from power off, nothing else running apart from X/fluxox short > run of valley no faults. Reran valley for a bit longer and got thousands. > Did memsleep ran valley no faults but after about 10 minutes it hung. Do you get those GPU faults in the log even when there's no hang? I haven't checked dmesg while running valley myself, but I do know they always appear when a hang has happened (I'm using ssh to grab dmesg while it's hung). Dmesg is sometimes completely filled with GPU faults, other times it's just a few. I ran it a few minutes ago and only got this: [ 1737.984328] amdgpu 0000:01:00.0: GPU fault detected: 146 0x08804804 [ 1737.984338] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00100110 [ 1737.984343] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A048004 [ 1737.984348] VM fault (0x04, vmid 5) at page 1048848, read from 'TC6' (0x54433600) (72) [ 1737.984355] amdgpu 0000:01:00.0: GPU fault detected: 146 0x08804004 [ 1737.984359] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 1737.984363] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [ 1737.984366] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0) [ 1737.984374] amdgpu 0000:01:00.0: GPU fault detected: 146 0x08800804 [ 1737.984378] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 1737.984381] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [ 1737.984384] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0) -- You are receiving this mail because: You are the assignee for the bug. --1443645739.84DC01.16527 Date: Wed, 30 Sep 2015 20:42:19 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 20 on bug 91278 from
(In reply to Andy Furniss from comment #19)
> (In reply to Andy Furniss from comment #18)
> > So after last update I ran valley again briefly and saw a few vmfaults, then
> > a longer run and got thousands again.
> > 
> > without touching anything else did echo mem >/sys/power/state and then woke
> > up.
> > 
> > 10 minute run of valley has produced zero faults.
> 
> Further test from power off, nothing else running apart from X/fluxox short
> run of valley no faults. Reran valley for a bit longer and got thousands.
> Did memsleep ran valley no faults but after about 10 minutes it hung.

Do you get those GPU faults in the log even when there's no hang? I haven't
checked dmesg while running valley myself, but I do know they always appear
when a hang has happened (I'm using ssh to grab dmesg while it's hung).

Dmesg is sometimes completely filled with GPU faults, other times it's just a
few. I ran it a few minutes ago and only got this:

[ 1737.984328] amdgpu 0000:01:00.0: GPU fault detected: 146 0x08804804
[ 1737.984338] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00100110
[ 1737.984343] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x0A048004
[ 1737.984348] VM fault (0x04, vmid 5) at page 1048848, read from 'TC6'
(0x54433600) (72)
[ 1737.984355] amdgpu 0000:01:00.0: GPU fault detected: 146 0x08804004
[ 1737.984359] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1737.984363] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1737.984366] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)
[ 1737.984374] amdgpu 0000:01:00.0: GPU fault detected: 146 0x08800804
[ 1737.984378] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x00000000
[ 1737.984381] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x00000000
[ 1737.984384] VM fault (0x00, vmid 0) at page 0, read from '' (0x00000000) (0)


You are receiving this mail because:
  • You are the assignee for the bug.
--1443645739.84DC01.16527-- --===============1573068851== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1573068851==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Wed, 30 Sep 2015 21:15:08 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0847955190==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 075A472180 for ; Wed, 30 Sep 2015 14:15:08 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0847955190== Content-Type: multipart/alternative; boundary="1443647707.A2cA260A0.791"; charset="UTF-8" --1443647707.A2cA260A0.791 Date: Wed, 30 Sep 2015 21:15:07 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #21 from Andy Furniss --- (In reply to Mathias Tillman from comment #20) > Do you get those GPU faults in the log even when there's no hang? Yes and I can also hang without getting any. -- You are receiving this mail because: You are the assignee for the bug. --1443647707.A2cA260A0.791 Date: Wed, 30 Sep 2015 21:15:07 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 21 on bug 91278 from
(In reply to Mathias Tillman from comment #20)

> Do you get those GPU faults in the log even when there's no hang?

Yes and I can also hang without getting any.


You are receiving this mail because:
  • You are the assignee for the bug.
--1443647707.A2cA260A0.791-- --===============0847955190== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0847955190==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Wed, 30 Sep 2015 21:19:52 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0318733290==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id A03436EC4F for ; Wed, 30 Sep 2015 14:19:52 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0318733290== Content-Type: multipart/alternative; boundary="1443647992.6C001.3251"; charset="UTF-8" --1443647992.6C001.3251 Date: Wed, 30 Sep 2015 21:19:52 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #22 from Mathias Tillman --- (In reply to Andy Furniss from comment #21) > (In reply to Mathias Tillman from comment #20) > > > Do you get those GPU faults in the log even when there's no hang? > > Yes and I can also hang without getting any. Actually, I think I've seen hangs without the GPU faults too now that I think about it. Makes you wonder if the GPU faults are related at all, or if this is something else entirely. -- You are receiving this mail because: You are the assignee for the bug. --1443647992.6C001.3251 Date: Wed, 30 Sep 2015 21:19:52 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 22 on bug 91278 from
(In reply to Andy Furniss from comment #21)
> (In reply to Mathias Tillman from comment #20)
> 
> > Do you get those GPU faults in the log even when there's no hang?
> 
> Yes and I can also hang without getting any.

Actually, I think I've seen hangs without the GPU faults too now that I think
about it. Makes you wonder if the GPU faults are related at all, or if this is
something else entirely.


You are receiving this mail because:
  • You are the assignee for the bug.
--1443647992.6C001.3251-- --===============0318733290== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0318733290==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Wed, 30 Sep 2015 21:51:07 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1637576438==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 0CBD86EC52 for ; Wed, 30 Sep 2015 14:51:07 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1637576438== Content-Type: multipart/alternative; boundary="1443649866.fefcd0.20731"; charset="UTF-8" --1443649866.fefcd0.20731 Date: Wed, 30 Sep 2015 21:51:06 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #23 from Andy Furniss --- (In reply to Mathias Tillman from comment #22) > (In reply to Andy Furniss from comment #21) > > (In reply to Mathias Tillman from comment #20) > > > > > Do you get those GPU faults in the log even when there's no hang? > > > > Yes and I can also hang without getting any. > > Actually, I think I've seen hangs without the GPU faults too now that I > think about it. Makes you wonder if the GPU faults are related at all, or if > this is something else entirely. Yea, could be unrelated. I haven't seen thousands until Today so that aspect could be to do with the patches. -- You are receiving this mail because: You are the assignee for the bug. --1443649866.fefcd0.20731 Date: Wed, 30 Sep 2015 21:51:06 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 23 on bug 91278 from
(In reply to Mathias Tillman from comment #22)
> (In reply to Andy Furniss from comment #21)
> > (In reply to Mathias Tillman from comment #20)
> > 
> > > Do you get those GPU faults in the log even when there's no hang?
> > 
> > Yes and I can also hang without getting any.
> 
> Actually, I think I've seen hangs without the GPU faults too now that I
> think about it. Makes you wonder if the GPU faults are related at all, or if
> this is something else entirely.

Yea, could be unrelated.

I haven't seen thousands until Today so that aspect could be to do with the
patches.


You are receiving this mail because:
  • You are the assignee for the bug.
--1443649866.fefcd0.20731-- --===============1637576438== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1637576438==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Wed, 30 Sep 2015 21:55:38 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0914351150==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 3B6BF6E3EF for ; Wed, 30 Sep 2015 14:55:38 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0914351150== Content-Type: multipart/alternative; boundary="1443650138.58080.23274"; charset="UTF-8" --1443650138.58080.23274 Date: Wed, 30 Sep 2015 21:55:38 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #24 from Andy Furniss --- (In reply to Andy Furniss from comment #23) > I haven't seen thousands until Today so that aspect could be to do with the > patches. Ignore that, more grepping of kernel log does show I have got thousands before today. -- You are receiving this mail because: You are the assignee for the bug. --1443650138.58080.23274 Date: Wed, 30 Sep 2015 21:55:38 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 24 on bug 91278 from
(In reply to Andy Furniss from comment #23)

> I haven't seen thousands until Today so that aspect could be to do with the
> patches.

Ignore that, more grepping of kernel log does show I have got thousands before
today.


You are receiving this mail because:
  • You are the assignee for the bug.
--1443650138.58080.23274-- --===============0914351150== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0914351150==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sun, 04 Oct 2015 20:34:21 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1990365474==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 612F66E1E2 for ; Sun, 4 Oct 2015 13:34:21 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1990365474== Content-Type: multipart/alternative; boundary="1443990861.e3D4f0.24540"; charset="UTF-8" --1443990861.e3D4f0.24540 Date: Sun, 4 Oct 2015 20:34:21 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #25 from Andy Furniss --- (In reply to Andy Furniss from comment #24) > (In reply to Andy Furniss from comment #23) > > > I haven't seen thousands until Today so that aspect could be to do with the > > patches. > > Ignore that, more grepping of kernel log does show I have got thousands > before today. Haven't had time to test thoroughly, but I see the latest updated agd5f fixes has a commit to reduce vm faults and I haven't got thousands since changing to that. I also see a new R600_DEBUG=check_vm in mesa. I don't know what, if any, extra info is expected from that, but testing valley with it caused it to quit when it hit a fault after a about minute of running - Detected a VM fault, exiting... in dmesg - [ 261.017278] amdgpu 0000:01:00.0: GPU fault detected: 146 0x01e84804 [ 261.017290] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0015703D [ 261.017296] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x06048004 [ 261.017302] VM fault (0x04, vmid 3) at page 1404989, read from 'TC6' (0x54433600) (72) -- You are receiving this mail because: You are the assignee for the bug. --1443990861.e3D4f0.24540 Date: Sun, 4 Oct 2015 20:34:21 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 25 on bug 91278 from
(In reply to Andy Furniss from comment #24)
> (In reply to Andy Furniss from comment #23)
> 
> > I haven't seen thousands until Today so that aspect could be to do with the
> > patches.
> 
> Ignore that, more grepping of kernel log does show I have got thousands
> before today.

Haven't had time to test thoroughly, but I see the latest updated agd5f fixes
has a commit to reduce vm faults and I haven't got thousands since changing to
that.

I also see a new R600_DEBUG=check_vm in mesa. I don't know what, if any, extra
info is expected from that, but testing valley with it caused it to quit when
it hit a fault after a about minute of running -

Detected a VM fault, exiting...

in dmesg -

[  261.017278] amdgpu 0000:01:00.0: GPU fault detected: 146 0x01e84804
[  261.017290] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR  
0x0015703D
[  261.017296] amdgpu 0000:01:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS
0x06048004
[  261.017302] VM fault (0x04, vmid 3) at page 1404989, read from 'TC6'
(0x54433600) (72)


You are receiving this mail because:
  • You are the assignee for the bug.
--1443990861.e3D4f0.24540-- --===============1990365474== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1990365474==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Mon, 05 Oct 2015 06:23:35 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============2084105078==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id F1AFD6E560 for ; Sun, 4 Oct 2015 23:23:34 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============2084105078== Content-Type: multipart/alternative; boundary="1444026214.1F8A1.12544"; charset="UTF-8" --1444026214.1F8A1.12544 Date: Mon, 5 Oct 2015 06:23:34 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 --- Comment #26 from Michel D=C3=A4nzer --- (In reply to Andy Furniss from comment #25) > I also see a new R600_DEBUG=3Dcheck_vm in mesa. I don't know what, if any, > extra info is expected from that, but testing valley with it caused it to > quit when it hit a fault after a about minute of running - It should generate a file in ~/ddebug_dumps/ with more information about th= e VM fault. --=20 You are receiving this mail because: You are the assignee for the bug. --1444026214.1F8A1.12544 Date: Mon, 5 Oct 2015 06:23:34 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

Commen= t # 26 on bug 91278<= /a> from Michel D=C3=A4nzer
(In reply to Andy Furniss from comment #25)
> I also see a new R600_DEBUG=3Dcheck_vm in mesa. =
I don't know what, if any,
> extra info is expected from that, but testing valley with it caused it=
 to
> quit when it hit a fault after a about minute of running -

It should generate a file in ~/ddebug_dumps/ with more information about th=
e VM
fault.


You are receiving this mail because: =20=20=20=20=20=20
  • You are the assignee for the bug.
--1444026214.1F8A1.12544-- --===============2084105078== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============2084105078==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Mon, 05 Oct 2015 08:54:37 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0645989308==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id CC7586E311 for ; Mon, 5 Oct 2015 01:54:37 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0645989308== Content-Type: multipart/alternative; boundary="1444035277.6471B0.6885"; charset="UTF-8" --1444035277.6471B0.6885 Date: Mon, 5 Oct 2015 08:54:37 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #27 from Andy Furniss --- Created attachment 118665 --> https://bugs.freedesktop.org/attachment.cgi?id=118665&action=edit valley vm fault dump Aha, here it is. -- You are receiving this mail because: You are the assignee for the bug. --1444035277.6471B0.6885 Date: Mon, 5 Oct 2015 08:54:37 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 27 on bug 91278 from
Created attachment 118665 [details]
valley vm fault dump

Aha, here it is.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444035277.6471B0.6885-- --===============0645989308== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0645989308==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Mon, 05 Oct 2015 22:51:56 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0183431337==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 72C8A6E022 for ; Mon, 5 Oct 2015 15:51:56 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0183431337== Content-Type: multipart/alternative; boundary="1444085516.4Bf58dc1.14765"; charset="UTF-8" --1444085516.4Bf58dc1.14765 Date: Mon, 5 Oct 2015 22:51:56 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #28 from Mathias Tillman --- (In reply to Andy Furniss from comment #27) > Created attachment 118665 [details] > valley vm fault dump > > Aha, here it is. I am getting similar results using check_vm. However, since the hang usually doesn't happen after the first vm fail (it did actually for me a few times, but most of the time it didn't) I modified the mesa code to disable closing the program once a vm fault happened. Unfortunately this doesn't really provide me with any interesting information - they have all complained about the paging fault happening in a VERTEX_BUFFER with an identical, or close to identical address. -- You are receiving this mail because: You are the assignee for the bug. --1444085516.4Bf58dc1.14765 Date: Mon, 5 Oct 2015 22:51:56 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 28 on bug 91278 from
(In reply to Andy Furniss from comment #27)
> Created attachment 118665 [details]
> valley vm fault dump
> 
> Aha, here it is.

I am getting similar results using check_vm. However, since the hang usually
doesn't happen after the first vm fail (it did actually for me a few times, but
most of the time it didn't) I modified the mesa code to disable closing the
program once a vm fault happened. Unfortunately this doesn't really provide me
with any interesting information - they have all complained about the paging
fault happening in a VERTEX_BUFFER with an identical, or close to identical
address.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444085516.4Bf58dc1.14765-- --===============0183431337== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0183431337==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Tue, 06 Oct 2015 01:25:30 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0001451104==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id A435B6E42E for ; Mon, 5 Oct 2015 18:25:30 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0001451104== Content-Type: multipart/alternative; boundary="1444094730.5Bbd61.6399"; charset="UTF-8" --1444094730.5Bbd61.6399 Date: Tue, 6 Oct 2015 01:25:30 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 --- Comment #29 from Michel D=C3=A4nzer --- That is interesting, though; the radeonsi driver seems to think there shoul= d be something mapped at the faulting address. This indicates that either the ke= rnel driver fails to handle the mapping properly, or maybe there's a problem with communicating the buffer mapping information from userspace to the kernel driver. --=20 You are receiving this mail because: You are the assignee for the bug. --1444094730.5Bbd61.6399 Date: Tue, 6 Oct 2015 01:25:30 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

Commen= t # 29 on bug 91278<= /a> from Michel D=C3=A4nzer
That is interesting, though; the radeonsi driver seems to thin=
k there should be
something mapped at the faulting address. This indicates that either the ke=
rnel
driver fails to handle the mapping properly, or maybe there's a problem with
communicating the buffer mapping information from userspace to the kernel
driver.


You are receiving this mail because: =20=20=20=20=20=20
  • You are the assignee for the bug.
--1444094730.5Bbd61.6399-- --===============0001451104== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0001451104==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Fri, 09 Oct 2015 16:08:23 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0132041894==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 90CCE6E04C for ; Fri, 9 Oct 2015 09:08:23 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0132041894== Content-Type: multipart/alternative; boundary="1444406903.86eb201.21454"; charset="UTF-8" --1444406903.86eb201.21454 Date: Fri, 9 Oct 2015 16:08:23 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #30 from Mathias Tillman --- Just updated to latest drm-next-4.4-wip and mesa and reran the valley test. Been running for two hours now without a hang (although there have been vm faults). Will test more over the weekend, but it's looking good so far :) -- You are receiving this mail because: You are the assignee for the bug. --1444406903.86eb201.21454 Date: Fri, 9 Oct 2015 16:08:23 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 30 on bug 91278 from
Just updated to latest drm-next-4.4-wip and mesa and reran the valley test.
Been running for two hours now without a hang (although there have been vm
faults). Will test more over the weekend, but it's looking good so far :)


You are receiving this mail because:
  • You are the assignee for the bug.
--1444406903.86eb201.21454-- --===============0132041894== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0132041894==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sat, 10 Oct 2015 07:49:20 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1871144475==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 432156E2CF for ; Sat, 10 Oct 2015 00:49:20 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1871144475== Content-Type: multipart/alternative; boundary="1444463360.AC67E71A1.27983"; charset="UTF-8" --1444463360.AC67E71A1.27983 Date: Sat, 10 Oct 2015 07:49:20 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #31 from Mathias Tillman --- Small update: I have been able to run it for a total of 7 hours (2 + 2 + 3) without a hang on the latest drm-next-4.4-wip. Tried the latest drm-fixes-4.3 and it hung after about 30 minutes, so I definitely think there's something in 4.4 that fixes it. I will do a bisect to see if I can figure out what, more exactly, is the fix. -- You are receiving this mail because: You are the assignee for the bug. --1444463360.AC67E71A1.27983 Date: Sat, 10 Oct 2015 07:49:20 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 31 on bug 91278 from
Small update: I have been able to run it for a total of 7 hours (2 + 2 + 3)
without a hang on the latest drm-next-4.4-wip. Tried the latest drm-fixes-4.3
and it hung after about 30 minutes, so I definitely think there's something in
4.4 that fixes it. I will do a bisect to see if I can figure out what, more
exactly, is the fix.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444463360.AC67E71A1.27983-- --===============1871144475== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1871144475==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sat, 10 Oct 2015 08:59:03 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1972566863==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id C7FBF6E30F for ; Sat, 10 Oct 2015 01:59:02 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1972566863== Content-Type: multipart/alternative; boundary="1444467542.077Cd5e0.14026"; charset="UTF-8" --1444467542.077Cd5e0.14026 Date: Sat, 10 Oct 2015 08:59:02 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #32 from Andy Furniss --- (In reply to Mathias Tillman from comment #31) > Small update: I have been able to run it for a total of 7 hours (2 + 2 + 3) > without a hang on the latest drm-next-4.4-wip. Tried the latest > drm-fixes-4.3 and it hung after about 30 minutes, so I definitely think > there's something in 4.4 that fixes it. I will do a bisect to see if I can > figure out what, more exactly, is the fix. I'll try over the weekend - no locks so far. Bisecting would be a pain for me as I managed to very long lucky runs previously, so wouldn't be able to easily call good. I notice that enable_scheduler is now on by default, maybe flipping that would be quicker than bisect. Long ago I did try some older kernel with that enabled and though it didn't fix IIRC it took longer to lock than was normal at that time. -- You are receiving this mail because: You are the assignee for the bug. --1444467542.077Cd5e0.14026 Date: Sat, 10 Oct 2015 08:59:02 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 32 on bug 91278 from
(In reply to Mathias Tillman from comment #31)
> Small update: I have been able to run it for a total of 7 hours (2 + 2 + 3)
> without a hang on the latest drm-next-4.4-wip. Tried the latest
> drm-fixes-4.3 and it hung after about 30 minutes, so I definitely think
> there's something in 4.4 that fixes it. I will do a bisect to see if I can
> figure out what, more exactly, is the fix.

I'll try over the weekend - no locks so far.

Bisecting would be a pain for me as I managed to very long lucky runs
previously, so wouldn't be able to easily call good.

I notice that enable_scheduler is now on by default, maybe flipping that would
be quicker than bisect. Long ago I did try some older kernel with that enabled
and though it didn't fix IIRC it took longer to lock than was normal at that
time.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444467542.077Cd5e0.14026-- --===============1972566863== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1972566863==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sat, 10 Oct 2015 10:33:39 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0861362247==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 77C956E334 for ; Sat, 10 Oct 2015 03:33:39 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0861362247== Content-Type: multipart/alternative; boundary="1444473219.6e511.10084"; charset="UTF-8" --1444473219.6e511.10084 Date: Sat, 10 Oct 2015 10:33:39 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #33 from Mathias Tillman --- (In reply to Andy Furniss from comment #32) > (In reply to Mathias Tillman from comment #31) > > Small update: I have been able to run it for a total of 7 hours (2 + 2 + 3) > > without a hang on the latest drm-next-4.4-wip. Tried the latest > > drm-fixes-4.3 and it hung after about 30 minutes, so I definitely think > > there's something in 4.4 that fixes it. I will do a bisect to see if I can > > figure out what, more exactly, is the fix. > > I'll try over the weekend - no locks so far. > > Bisecting would be a pain for me as I managed to very long lucky runs > previously, so wouldn't be able to easily call good. > > I notice that enable_scheduler is now on by default, maybe flipping that > would be quicker than bisect. Long ago I did try some older kernel with that > enabled and though it didn't fix IIRC it took longer to lock than was normal > at that time. Just ran it again with the scheduler disabled (from code, not through the module parameter) on 4.4-wip, and sure enough, after about 30 minutes it hung. So it looks like there's something in the scheduler that either makes it not happen as often, or not at all (will need to confirm this). -- You are receiving this mail because: You are the assignee for the bug. --1444473219.6e511.10084 Date: Sat, 10 Oct 2015 10:33:39 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 33 on bug 91278 from
(In reply to Andy Furniss from comment #32)
> (In reply to Mathias Tillman from comment #31)
> > Small update: I have been able to run it for a total of 7 hours (2 + 2 + 3)
> > without a hang on the latest drm-next-4.4-wip. Tried the latest
> > drm-fixes-4.3 and it hung after about 30 minutes, so I definitely think
> > there's something in 4.4 that fixes it. I will do a bisect to see if I can
> > figure out what, more exactly, is the fix.
> 
> I'll try over the weekend - no locks so far.
> 
> Bisecting would be a pain for me as I managed to very long lucky runs
> previously, so wouldn't be able to easily call good.
> 
> I notice that enable_scheduler is now on by default, maybe flipping that
> would be quicker than bisect. Long ago I did try some older kernel with that
> enabled and though it didn't fix IIRC it took longer to lock than was normal
> at that time.

Just ran it again with the scheduler disabled (from code, not through the
module parameter) on 4.4-wip, and sure enough, after about 30 minutes it hung.
So it looks like there's something in the scheduler that either makes it not
happen as often, or not at all (will need to confirm this).


You are receiving this mail because:
  • You are the assignee for the bug.
--1444473219.6e511.10084-- --===============0861362247== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0861362247==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sun, 11 Oct 2015 16:14:11 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1713792135==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 0FF8C6E1F5 for ; Sun, 11 Oct 2015 09:14:12 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1713792135== Content-Type: multipart/alternative; boundary="1444580051.dbcbFb1a1.4150"; charset="UTF-8" --1444580051.dbcbFb1a1.4150 Date: Sun, 11 Oct 2015 16:14:11 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 Grazvydas Ignotas changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |notasas@gmail.com --- Comment #34 from Grazvydas Ignotas --- Created attachment 118824 --> https://bugs.freedesktop.org/attachment.cgi?id=3D118824&action=3Dedit test kernel patch (In reply to Michel D=C3=A4nzer from comment #29) > That is interesting, though; the radeonsi driver seems to think there sho= uld > be something mapped at the faulting address. This indicates that either t= he > kernel driver fails to handle the mapping properly, or maybe there's a > problem with communicating the buffer mapping information from userspace = to > the kernel driver. Judging by the symptoms it feels like some caching/buffering problem somewh= ere.=20 If I understand the code right, most of things are mapped write-combine, wh= ich means the CPU is allowed to write data it any order it likes. Looking at amdgpu/radeon code, there is surprising lack of barriers, basically it's ju= st amdgpu_ring_commit()/radeon_ring_commit() and that's it. But mb() doesn't guarantee that the writes will arrive in program order, it just ensures that all the writes are finished after that mb() statement. So the question is, is it ok for the hardware if in something like amdgpu_ib_schedule() the writes to the ring arrive before the writes to IB?= I do admit I don't understand how the hardware works, like what triggers the hardware to start processing the ring contents, perhaps the write to the la= st word in the ring? If so you clearly need a wmb() before the write which triggers the hardware so that everything is ready before the GPU kicks in. Attached is a debug kernel patch to test if my guess is correct. It's way overkill and will trash performance, but it should show if this is a problem related to CPU caching/buffering. I don't have the hardware to test this myself. --=20 You are receiving this mail because: You are the assignee for the bug. --1444580051.dbcbFb1a1.4150 Date: Sun, 11 Oct 2015 16:14:11 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Grazvydas Ignotas changed bug 91278<= /a>
What Removed Added
CC   notasas@gmail.com

Commen= t # 34 on bug 91278<= /a> from Grazvydas Ignotas
Created attachment 118824 =
[details] [review]
test kernel patch

(In reply to Michel D=C3=A4nzer from comment #29)
> That is interesting, though; the radeonsi driver=
 seems to think there should
> be something mapped at the faulting address. This indicates that eithe=
r the
> kernel driver fails to handle the mapping properly, or maybe there's a
> problem with communicating the buffer mapping information from userspa=
ce to
> the kernel driver.

Judging by the symptoms it feels like some caching/buffering problem somewh=
ere.=20

If I understand the code right, most of things are mapped write-combine, wh=
ich
means the CPU is allowed to write data it any order it likes. Looking at
amdgpu/radeon code, there is surprising lack of barriers, basically it's ju=
st
amdgpu_ring_commit()/radeon_ring_commit() and that's it. But mb() doesn't
guarantee that the writes will arrive in program order, it just ensures that
all the writes are finished after that mb() statement.

So the question is, is it ok for the hardware if in something like
amdgpu_ib_schedule() the writes to the ring arrive before the writes to IB?=
 I
do admit I don't understand how the hardware works, like what triggers the
hardware to start processing the ring contents, perhaps the write to the la=
st
word in the ring? If so you clearly need a wmb() before the write which
triggers the hardware so that everything is ready before the GPU kicks in.

Attached is a debug kernel patch to test if my guess is correct. It's way
overkill and will trash performance, but it should show if this is a problem
related to CPU caching/buffering. I don't have the hardware to test this
myself.


You are receiving this mail because: =20=20=20=20=20=20
  • You are the assignee for the bug.
--1444580051.dbcbFb1a1.4150-- --===============1713792135== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1713792135==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sun, 11 Oct 2015 16:26:58 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0953242431==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 242526E1F5 for ; Sun, 11 Oct 2015 09:26:58 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0953242431== Content-Type: multipart/alternative; boundary="1444580818.be7d82.8122"; charset="UTF-8" --1444580818.be7d82.8122 Date: Sun, 11 Oct 2015 16:26:58 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 --- Comment #35 from Christian K=C3=B6nig --- The ring and IB are just normal system memory made accessible to the GPU. So it's perfectly fine that it's mapped WC by the CPU. The processing of the commands written into the ring and IB are triggered by writing the wptr register. See radeon_ring_set_wptr(). --=20 You are receiving this mail because: You are the assignee for the bug. --1444580818.be7d82.8122 Date: Sun, 11 Oct 2015 16:26:58 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

Commen= t # 35 on bug 91278<= /a> from Christian K=C3=B6nig
The ring and IB are just normal system memory made accessible =
to the GPU. So
it's perfectly fine that it's mapped WC by the CPU.

The processing of the commands written into the ring and IB are triggered by
writing the wptr register.

See radeon_ring_set_wptr().


You are receiving this mail because: =20=20=20=20=20=20
  • You are the assignee for the bug.
--1444580818.be7d82.8122-- --===============0953242431== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0953242431==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sun, 11 Oct 2015 19:37:00 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0301873341==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 98E5C6E40F for ; Sun, 11 Oct 2015 12:37:00 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0301873341== Content-Type: multipart/alternative; boundary="1444592220.d6dCCECe2.25404"; charset="UTF-8" --1444592220.d6dCCECe2.25404 Date: Sun, 11 Oct 2015 19:37:00 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #36 from Mathias Tillman --- I have run the valley test the entire day (10 hours or so) without a single VM fault or hang with the GPU scheduler enabled on 4.4. Re-ran the test with your (Grazvydas) patch and it's showing the same symptoms as before when not using the schedluer - that is, a bunch of VM faults, which lead to a hang after around 30 minutes or so. -- You are receiving this mail because: You are the assignee for the bug. --1444592220.d6dCCECe2.25404 Date: Sun, 11 Oct 2015 19:37:00 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 36 on bug 91278 from
I have run the valley test the entire day (10 hours or so) without a single VM
fault or hang with the GPU scheduler enabled on 4.4.

Re-ran the test with your (Grazvydas) patch and it's showing the same symptoms
as before when not using the schedluer - that is, a bunch of VM faults, which
lead to a hang after around 30 minutes or so.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444592220.d6dCCECe2.25404-- --===============0301873341== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0301873341==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sun, 11 Oct 2015 20:50:44 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0160429071==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 562E16E471 for ; Sun, 11 Oct 2015 13:50:44 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0160429071== Content-Type: multipart/alternative; boundary="1444596644.C570b1.12211"; charset="UTF-8" --1444596644.C570b1.12211 Date: Sun, 11 Oct 2015 20:50:44 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #37 from Grazvydas Ignotas --- ok then my guess is wrong. -- You are receiving this mail because: You are the assignee for the bug. --1444596644.C570b1.12211 Date: Sun, 11 Oct 2015 20:50:44 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 37 on bug 91278 from
ok then my guess is wrong.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444596644.C570b1.12211-- --===============0160429071== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0160429071==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sun, 11 Oct 2015 21:44:45 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1330840960==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 847776E4A9 for ; Sun, 11 Oct 2015 14:44:45 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1330840960== Content-Type: multipart/alternative; boundary="1444599885.D3C71.27849"; charset="UTF-8" --1444599885.D3C71.27849 Date: Sun, 11 Oct 2015 21:44:45 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #38 from Andy Furniss --- (In reply to Andy Furniss from comment #32) > I'll try over the weekend - no locks so far. Still good, I've also tried to hang with Unreal demos and mplayer+uvd, no hangs and no vm faults so far. -- You are receiving this mail because: You are the assignee for the bug. --1444599885.D3C71.27849 Date: Sun, 11 Oct 2015 21:44:45 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 38 on bug 91278 from
(In reply to Andy Furniss from comment #32)

> I'll try over the weekend - no locks so far.

Still good, I've also tried to hang with Unreal demos and mplayer+uvd, no hangs
and no vm faults so far.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444599885.D3C71.27849-- --===============1330840960== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1330840960==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Mon, 12 Oct 2015 11:58:59 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0861940076==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id E0E5C6E92E for ; Mon, 12 Oct 2015 04:58:58 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0861940076== Content-Type: multipart/alternative; boundary="1444651138.47dEdBA2.12022"; charset="UTF-8" --1444651138.47dEdBA2.12022 Date: Mon, 12 Oct 2015 11:58:58 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #39 from Mathias Tillman --- (In reply to Andy Furniss from comment #38) > (In reply to Andy Furniss from comment #32) > > > I'll try over the weekend - no locks so far. > > Still good, I've also tried to hang with Unreal demos and mplayer+uvd, no > hangs and no vm faults so far. Assuming you tried it with the scheduler enabled? Just tried it with the scheduler enabled and semaphores disabled on drm-fixes-4.3, and it also seems to work. Is the scheduler going to be left enabled from now on? If so we could probably say that this issue has been solved (though more testing is probably required). However, something is causing all of those vm faults when the scheduler is disabled, so it might still be worth looking into if it's going to be possible to disable it using a module parameter. -- You are receiving this mail because: You are the assignee for the bug. --1444651138.47dEdBA2.12022 Date: Mon, 12 Oct 2015 11:58:58 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 39 on bug 91278 from
(In reply to Andy Furniss from comment #38)
> (In reply to Andy Furniss from comment #32)
> 
> > I'll try over the weekend - no locks so far.
> 
> Still good, I've also tried to hang with Unreal demos and mplayer+uvd, no
> hangs and no vm faults so far.

Assuming you tried it with the scheduler enabled? Just tried it with the
scheduler enabled and semaphores disabled on drm-fixes-4.3, and it also seems
to work.

Is the scheduler going to be left enabled from now on? If so we could probably
say that this issue has been solved (though more testing is probably required).
However, something is causing all of those vm faults when the scheduler is
disabled, so it might still be worth looking into if it's going to be possible
to disable it using a module parameter.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444651138.47dEdBA2.12022-- --===============0861940076== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0861940076==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Tue, 13 Oct 2015 18:24:07 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0204430105==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 9E9CE6E5F5 for ; Tue, 13 Oct 2015 11:24:07 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0204430105== Content-Type: multipart/alternative; boundary="1444760647.Dfab1.12086"; charset="UTF-8" --1444760647.Dfab1.12086 Date: Tue, 13 Oct 2015 18:24:07 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #40 from Andy Furniss --- (In reply to Mathias Tillman from comment #39) > (In reply to Andy Furniss from comment #38) > > (In reply to Andy Furniss from comment #32) > > > > > I'll try over the weekend - no locks so far. > > > > Still good, I've also tried to hang with Unreal demos and mplayer+uvd, no > > hangs and no vm faults so far. > > Assuming you tried it with the scheduler enabled? Yea, I am running with the default for drm-next-4.4-wip = enabled. -- You are receiving this mail because: You are the assignee for the bug. --1444760647.Dfab1.12086 Date: Tue, 13 Oct 2015 18:24:07 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 40 on bug 91278 from
(In reply to Mathias Tillman from comment #39)
> (In reply to Andy Furniss from comment #38)
> > (In reply to Andy Furniss from comment #32)
> > 
> > > I'll try over the weekend - no locks so far.
> > 
> > Still good, I've also tried to hang with Unreal demos and mplayer+uvd, no
> > hangs and no vm faults so far.
> 
> Assuming you tried it with the scheduler enabled?

Yea, I am running with the default for drm-next-4.4-wip = enabled.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444760647.Dfab1.12086-- --===============0204430105== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0204430105==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Thu, 15 Oct 2015 09:45:23 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1302214870==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 9657E6EE38 for ; Thu, 15 Oct 2015 02:45:23 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1302214870== Content-Type: multipart/alternative; boundary="1444902323.f3A8D2.23799"; charset="UTF-8" --1444902323.f3A8D2.23799 Date: Thu, 15 Oct 2015 09:45:23 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #41 from Mathias Tillman --- Just tried it again with the latest drm-next-4.4 and the kernel parameters amdgpu.enable_scheduler=0 amdgpu.vm_debug=1 amdgpu.enable_semaphores=1. Can't even make it to the beginning of the test (it hangs on the loading screen, if not before that - I have OpenGL acceleration enabled in kwin) when vm_debug=1 before hanging with a bunch of VM faults. If I set vm_debug to 0 it at least starts properly. Not sure if that was the intended behaviour, but that's what happens in any case. -- You are receiving this mail because: You are the assignee for the bug. --1444902323.f3A8D2.23799 Date: Thu, 15 Oct 2015 09:45:23 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 41 on bug 91278 from
Just tried it again with the latest drm-next-4.4 and the kernel parameters
amdgpu.enable_scheduler=0 amdgpu.vm_debug=1 amdgpu.enable_semaphores=1. Can't
even make it to the beginning of the test (it hangs on the loading screen, if
not before that - I have OpenGL acceleration enabled in kwin) when vm_debug=1
before hanging with a bunch of VM faults. If I set vm_debug to 0 it at least
starts properly.
Not sure if that was the intended behaviour, but that's what happens in any
case.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444902323.f3A8D2.23799-- --===============1302214870== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1302214870==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Thu, 15 Oct 2015 13:28:22 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0031622464==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 6501E6EE83 for ; Thu, 15 Oct 2015 06:28:22 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0031622464== Content-Type: multipart/alternative; boundary="1444915702.FEDa0f8f2.27625"; charset="UTF-8" --1444915702.FEDa0f8f2.27625 Date: Thu, 15 Oct 2015 13:28:22 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #42 from Alex Deucher --- (In reply to Mathias Tillman from comment #41) > Just tried it again with the latest drm-next-4.4 and the kernel parameters > amdgpu.enable_scheduler=0 amdgpu.vm_debug=1 amdgpu.enable_semaphores=1. I don't think forcing semaphores will work on tonga/fiji since there were disabled in the code a while ago due to hw bugs. -- You are receiving this mail because: You are the assignee for the bug. --1444915702.FEDa0f8f2.27625 Date: Thu, 15 Oct 2015 13:28:22 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 42 on bug 91278 from
(In reply to Mathias Tillman from comment #41)
> Just tried it again with the latest drm-next-4.4 and the kernel parameters
> amdgpu.enable_scheduler=0 amdgpu.vm_debug=1 amdgpu.enable_semaphores=1.

I don't think forcing semaphores will work on tonga/fiji since there were
disabled in the code a while ago due to hw bugs.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444915702.FEDa0f8f2.27625-- --===============0031622464== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0031622464==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Thu, 15 Oct 2015 13:51:02 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============2037788707==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 5C8766EE89 for ; Thu, 15 Oct 2015 06:51:02 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============2037788707== Content-Type: multipart/alternative; boundary="1444917062.ec60D8D2.1309"; charset="UTF-8" --1444917062.ec60D8D2.1309 Date: Thu, 15 Oct 2015 13:51:02 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 --- Comment #43 from Christian K=C3=B6nig --- (In reply to Alex Deucher from comment #42) > (In reply to Mathias Tillman from comment #41) > > Just tried it again with the latest drm-next-4.4 and the kernel paramet= ers > > amdgpu.enable_scheduler=3D0 amdgpu.vm_debug=3D1 amdgpu.enable_semaphore= s=3D1. >=20 > I don't think forcing semaphores will work on tonga/fiji since there were > disabled in the code a while ago due to hw bugs. Yeah, forcefully enabling semaphores on Tonga will crash rather fast. The purpose of vm_debug !=3D 0 is to stop after the first VM fault. So if you got a VM fault from time to time which was just ignored than sett= ing vm_debug will certainly crash the system. On the other hand you shouldn't get VM faults in the first place. --=20 You are receiving this mail because: You are the assignee for the bug. --1444917062.ec60D8D2.1309 Date: Thu, 15 Oct 2015 13:51:02 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

Commen= t # 43 on bug 91278<= /a> from Christian K=C3=B6nig
(In reply to Alex Deucher from comment #42)
> (In reply to Mathias Tillman from comment #41)
> > Just tried it again with the latest drm-next-4.4 and the kernel p=
arameters
> > amdgpu.enable_scheduler=3D0 amdgpu.vm_debug=3D1 amdgpu.enable_sem=
aphores=3D1.
>=20
> I don't think forcing semaphores will work on tonga/fiji since there w=
ere
> disabled in the code a while ago due to hw bugs.

Yeah, forcefully enabling semaphores on Tonga will crash rather fast.

The purpose of vm_debug !=3D 0 is to stop after the first VM fault.

So if you got a VM fault from time to time which was just ignored than sett=
ing
vm_debug will certainly crash the system.

On the other hand you shouldn't get VM faults in the first place.


You are receiving this mail because: =20=20=20=20=20=20
  • You are the assignee for the bug.
--1444917062.ec60D8D2.1309-- --===============2037788707== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============2037788707==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Thu, 15 Oct 2015 15:51:30 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0928048273==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 07D456EC55 for ; Thu, 15 Oct 2015 08:51:30 -0700 (PDT) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0928048273== Content-Type: multipart/alternative; boundary="1444924289.0ACa3faa2.15867"; charset="UTF-8" --1444924289.0ACa3faa2.15867 Date: Thu, 15 Oct 2015 15:51:29 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #44 from Mathias Tillman --- Just done some tests: enable_scheduler=0, enable_semaphores=1, vm_fault_stop=0, vm_debug=1 Hang after a few minutes, normal desktop use or running the valley test. Many VM faults in dmesg. enable_scheduler=0, enable_semaphores=0, vm_fault_stop=1, vm_debug=1 Hang after a few minutes, normal desktop use or running the valley test. Only one VM fault visible in dmesg, that would be due to vm_fault_stop=1 I'm guessing? enable_scheduler=0, enable_semaphores=0, vm_fault_stop=0, vm_debug=1 Same as above, but with several VM faults in dmesg. enable_scheduler=1, enable_semaphores=0, vm_fault_stop=0, vm_debug=1 All good. -- You are receiving this mail because: You are the assignee for the bug. --1444924289.0ACa3faa2.15867 Date: Thu, 15 Oct 2015 15:51:29 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 44 on bug 91278 from
Just done some tests:

enable_scheduler=0, enable_semaphores=1, vm_fault_stop=0, vm_debug=1
Hang after a few  minutes, normal desktop use or running the valley test. Many
VM faults in dmesg.

enable_scheduler=0, enable_semaphores=0, vm_fault_stop=1, vm_debug=1
Hang after a few  minutes, normal desktop use or running the valley test. Only
one VM fault visible in dmesg, that would be due to vm_fault_stop=1 I'm
guessing?

enable_scheduler=0, enable_semaphores=0, vm_fault_stop=0, vm_debug=1
Same as above, but with several VM faults in dmesg.

enable_scheduler=1, enable_semaphores=0, vm_fault_stop=0, vm_debug=1
All good.


You are receiving this mail because:
  • You are the assignee for the bug.
--1444924289.0ACa3faa2.15867-- --===============0928048273== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0928048273==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sun, 15 Nov 2015 11:24:45 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0175991321==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 8113E6E679 for ; Sun, 15 Nov 2015 03:24:45 -0800 (PST) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0175991321== Content-Type: multipart/alternative; boundary="1447586685.eFC52.11556"; charset="UTF-8" --1447586685.eFC52.11556 Date: Sun, 15 Nov 2015 11:24:45 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #45 from Daniele Ruffini --- Created attachment 119679 --> https://bugs.freedesktop.org/attachment.cgi?id=119679&action=edit journalctl | grep amdgpu Went on for a while, cut short because the error was just repeating -- You are receiving this mail because: You are the assignee for the bug. --1447586685.eFC52.11556 Date: Sun, 15 Nov 2015 11:24:45 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 45 on bug 91278 from
Created attachment 119679 [details]
journalctl | grep amdgpu

Went on for a while, cut short because the error was just repeating


You are receiving this mail because:
  • You are the assignee for the bug.
--1447586685.eFC52.11556-- --===============0175991321== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0175991321==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sun, 15 Nov 2015 11:27:51 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0902376382==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 572F56E67A for ; Sun, 15 Nov 2015 03:27:51 -0800 (PST) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0902376382== Content-Type: multipart/alternative; boundary="1447586871.8288f2.12754"; charset="UTF-8" --1447586871.8288f2.12754 Date: Sun, 15 Nov 2015 11:27:51 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #46 from Daniele Ruffini --- Just forgot to mention that the bug for me happens totally randomly. No Unigine Heave. No intensive graphic usage. It seems to happen when something new is displayed tough but i wasn't able to recreate it metodically. Amd Radeon HD380 4GB. -- You are receiving this mail because: You are the assignee for the bug. --1447586871.8288f2.12754 Date: Sun, 15 Nov 2015 11:27:51 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 46 on bug 91278 from
Just forgot to mention that the bug for me happens totally randomly.
No Unigine Heave. No intensive graphic usage.
It seems to happen when something new is displayed tough but i wasn't able to
recreate it metodically.

Amd Radeon HD380 4GB.


You are receiving this mail because:
  • You are the assignee for the bug.
--1447586871.8288f2.12754-- --===============0902376382== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0902376382==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Sun, 15 Nov 2015 15:11:54 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1080143275==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 93CAC6E6F5 for ; Sun, 15 Nov 2015 07:11:54 -0800 (PST) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1080143275== Content-Type: multipart/alternative; boundary="1447600314.c7b4861.26268"; charset="UTF-8" --1447600314.c7b4861.26268 Date: Sun, 15 Nov 2015 15:11:54 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 --- Comment #47 from Andy Furniss --- Seems like you are using an old kernel - though what I call old may be current for what's released. The hanging for me stopped with 4.4, but 4.3 is release. If you are using 4.3 maybe booting with the option - amdgpu.enable_scheduler=1 will help. If you are used to compiling your own kernels then using one from - http://cgit.freedesktop.org/~agd5f/linux/ like drm-next-4.4 should be stable. Or if you want to test/use the new powerplay try amdgpu-powerplay -- You are receiving this mail because: You are the assignee for the bug. --1447600314.c7b4861.26268 Date: Sun, 15 Nov 2015 15:11:54 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8"

Comment # 47 on bug 91278 from
Seems like you are using an old kernel - though what I call old may be current
for what's released.

The hanging for me stopped with 4.4, but 4.3 is release.

If you are using 4.3 maybe booting with the option -

amdgpu.enable_scheduler=1

will help.

If you are used to compiling your own kernels then using one from -

http://cgit.freedesktop.org/~agd5f/linux/

like drm-next-4.4 should be stable.

Or if you want to test/use the new powerplay try amdgpu-powerplay


You are receiving this mail because:
  • You are the assignee for the bug.
--1447600314.c7b4861.26268-- --===============1080143275== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============1080143275==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Mon, 28 Dec 2015 14:24:04 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0893267481==" Return-path: Received: from culpepper.freedesktop.org (unknown [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 70E76896B0 for ; Mon, 28 Dec 2015 06:24:04 -0800 (PST) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0893267481== Content-Type: multipart/alternative; boundary="1451312644.5FF1c2.22738"; charset="UTF-8" --1451312644.5FF1c2.22738 Date: Mon, 28 Dec 2015 14:24:04 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" https://bugs.freedesktop.org/show_bug.cgi?id=91278 EoD changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |EoD@xmw.de --- Comment #48 from EoD --- I don't get any lockups with kernel 4.4-rc6 and current mesa git on my R380X. -- You are receiving this mail because: You are the assignee for the bug. --1451312644.5FF1c2.22738 Date: Mon, 28 Dec 2015 14:24:04 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" changed bug 91278
What Removed Added
CC   EoD@xmw.de

Comment # 48 on bug 91278 from
I don't get any lockups with kernel 4.4-rc6 and current mesa git on my R380X.


You are receiving this mail because:
  • You are the assignee for the bug.
--1451312644.5FF1c2.22738-- --===============0893267481== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHA6Ly9saXN0 cy5mcmVlZGVza3RvcC5vcmcvbWFpbG1hbi9saXN0aW5mby9kcmktZGV2ZWwK --===============0893267481==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Thu, 15 Sep 2016 17:04:53 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0200562810==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 0F58D6E8EA for ; Thu, 15 Sep 2016 17:04:53 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============0200562810== Content-Type: multipart/alternative; boundary="14739590930.B5FF8DDFb.4343"; charset="UTF-8" --14739590930.B5FF8DDFb.4343 Date: Thu, 15 Sep 2016 17:04:53 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 Andy Furniss changed: What |Removed |Added ---------------------------------------------------------------------------- Resolution|--- |FIXED Status|NEW |RESOLVED --- Comment #49 from Andy Furniss --- I can't speak for everyone on the cc with different h/w, but on Tonga valley has been stable for a long time, so closing. If it's still an issue for anyone you can reopen. --=20 You are receiving this mail because: You are the assignee for the bug.= --14739590930.B5FF8DDFb.4343 Date: Thu, 15 Sep 2016 17:04:53 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated Andy Furniss changed bug 91278<= /a>
What Removed Added
Resolution --- FIXED
Status NEW RESOLVED

Commen= t # 49 on bug 91278<= /a> from Andy Furniss
I can't speak for everyone on the cc with different h/w, but o=
n Tonga valley
has been stable for a long time, so closing.

If it's still an issue for anyone you can reopen.


You are receiving this mail because:
  • You are the assignee for the bug.
= --14739590930.B5FF8DDFb.4343-- --===============0200562810== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============0200562810==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 91278] Tonga GPU lock/reset fail with Unigine Valley Date: Fri, 16 Sep 2016 01:45:18 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============2075272674==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 706AF6E97A for ; Fri, 16 Sep 2016 01:45:18 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============2075272674== Content-Type: multipart/alternative; boundary="14739903181.F339.28287"; charset="UTF-8" --14739903181.F339.28287 Date: Fri, 16 Sep 2016 01:45:18 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D91278 --- Comment #50 from Michel D=C3=A4nzer --- (In reply to Andy Furniss from comment #49) > If it's still an issue for anyone you can reopen. This is your report, so anyone else please file their own instead. --=20 You are receiving this mail because: You are the assignee for the bug.= --14739903181.F339.28287 Date: Fri, 16 Sep 2016 01:45:18 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Commen= t # 50 on bug 91278<= /a> from Michel D=C3=A4nzer
(In reply to Andy Furniss from comment #49)
> If it's still an issue for anyone you can reopen=
.

This is your report, so anyone else please file their own instead.


You are receiving this mail because:
  • You are the assignee for the bug.
= --14739903181.F339.28287-- --===============2075272674== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============2075272674==--