https://bugs.freedesktop.org/show_bug.cgi?id=105113 Bug ID: 105113 Summary: [hawaii] Running Piglit cl/program/execute/calls-struct.cl causes GPU VM error and ring stalled GPU lockup Product: Mesa Version: git Hardware: Other OS: All Status: NEW Severity: normal Priority: medium Component: Drivers/Gallium/radeonsi Assignee: dri-devel@lists.freedesktop.org Reporter: vedran@miletic.net QA Contact: dri-devel@lists.freedesktop.org On kernel 4.16.0-0.rc1.git0.1.fc28.x86_64, with 01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Hawaii XT GL [FirePro W9100] upon running Piglit's cl/program/execute/calls-struct.cl test I get: [ 1574.837119] radeon 0000:01:00.0: GPU fault detected: 147 0x080a8401 [ 1574.837124] radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00400840 [ 1574.837126] radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0A084001 [ 1574.837128] VM fault (0x01, vmid 5) at page 4196416, read from 'TC5' (0x54433500) (132) [ 1585.420894] radeon 0000:01:00.0: ring 0 stalled for more than 10080msec [ 1585.420901] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1585.924885] radeon 0000:01:00.0: ring 0 stalled for more than 10584msec [ 1585.924892] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1586.428890] radeon 0000:01:00.0: ring 0 stalled for more than 11088msec [ 1586.428897] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1586.932902] radeon 0000:01:00.0: ring 0 stalled for more than 11592msec [ 1586.932911] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1587.436903] radeon 0000:01:00.0: ring 0 stalled for more than 12096msec [ 1587.436909] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1587.940855] radeon 0000:01:00.0: ring 0 stalled for more than 12600msec [ 1587.940859] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1588.444913] radeon 0000:01:00.0: ring 0 stalled for more than 13104msec [ 1588.444922] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1588.948909] radeon 0000:01:00.0: ring 0 stalled for more than 13608msec [ 1588.948918] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1589.452909] radeon 0000:01:00.0: ring 0 stalled for more than 14112msec [ 1589.452916] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1589.956912] radeon 0000:01:00.0: ring 0 stalled for more than 14616msec [ 1589.956920] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1590.460913] radeon 0000:01:00.0: ring 0 stalled for more than 15120msec [ 1590.460920] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1590.964927] radeon 0000:01:00.0: ring 0 stalled for more than 15624msec [ 1590.964934] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1591.468898] radeon 0000:01:00.0: ring 0 stalled for more than 16128msec [ 1591.468905] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1591.972882] radeon 0000:01:00.0: ring 0 stalled for more than 16632msec [ 1591.972887] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1592.476903] radeon 0000:01:00.0: ring 0 stalled for more than 17136msec [ 1592.476908] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1592.980928] radeon 0000:01:00.0: ring 0 stalled for more than 17640msec [ 1592.980936] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1593.484931] radeon 0000:01:00.0: ring 0 stalled for more than 18144msec [ 1593.484939] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1593.988933] radeon 0000:01:00.0: ring 0 stalled for more than 18648msec [ 1593.988941] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1594.492935] radeon 0000:01:00.0: ring 0 stalled for more than 19152msec [ 1594.492943] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1594.996951] radeon 0000:01:00.0: ring 0 stalled for more than 19656msec [ 1594.996962] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1595.500953] radeon 0000:01:00.0: ring 0 stalled for more than 20160msec [ 1595.500963] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1596.004957] radeon 0000:01:00.0: ring 0 stalled for more than 20664msec [ 1596.004967] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1596.508970] radeon 0000:01:00.0: ring 0 stalled for more than 21168msec [ 1596.508983] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1597.012966] radeon 0000:01:00.0: ring 0 stalled for more than 21672msec [ 1597.012982] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1597.516969] radeon 0000:01:00.0: ring 0 stalled for more than 22176msec [ 1597.516984] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1598.020970] radeon 0000:01:00.0: ring 0 stalled for more than 22680msec [ 1598.020985] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1598.524974] radeon 0000:01:00.0: ring 0 stalled for more than 23184msec [ 1598.524989] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1599.028975] radeon 0000:01:00.0: ring 0 stalled for more than 23688msec [ 1599.028990] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1599.532977] radeon 0000:01:00.0: ring 0 stalled for more than 24192msec [ 1599.532992] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1600.036981] radeon 0000:01:00.0: ring 0 stalled for more than 24696msec [ 1600.036997] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1600.540984] radeon 0000:01:00.0: ring 0 stalled for more than 25200msec [ 1600.540999] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1601.044948] radeon 0000:01:00.0: ring 0 stalled for more than 25704msec [ 1601.044963] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1601.548986] radeon 0000:01:00.0: ring 0 stalled for more than 26208msec [ 1601.549002] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1602.052966] radeon 0000:01:00.0: ring 0 stalled for more than 26712msec [ 1602.052981] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1602.556999] radeon 0000:01:00.0: ring 0 stalled for more than 27216msec [ 1602.557014] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1603.060934] radeon 0000:01:00.0: ring 0 stalled for more than 27720msec [ 1603.060938] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1603.564977] radeon 0000:01:00.0: ring 0 stalled for more than 28224msec [ 1603.564981] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1604.068965] radeon 0000:01:00.0: ring 0 stalled for more than 28728msec [ 1604.068969] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1604.572967] radeon 0000:01:00.0: ring 0 stalled for more than 29232msec [ 1604.572971] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1605.076978] radeon 0000:01:00.0: ring 0 stalled for more than 29736msec [ 1605.076984] radeon 0000:01:00.0: GPU lockup (current fence id 0x000000000000002b last fence id 0x000000000000002c on ring 0) [ 1605.136370] radeon 0000:01:00.0: Saved 24 dwords of commands on ring 0. [ 1605.136381] radeon 0000:01:00.0: GPU softreset: 0x00000009 [ 1605.136382] radeon 0000:01:00.0: GRBM_STATUS=0xA0403028 [ 1605.136383] radeon 0000:01:00.0: GRBM_STATUS2=0x50000008 [ 1605.136385] radeon 0000:01:00.0: GRBM_STATUS_SE0=0x08000006 [ 1605.136386] radeon 0000:01:00.0: GRBM_STATUS_SE1=0x08000006 [ 1605.136387] radeon 0000:01:00.0: GRBM_STATUS_SE2=0x08000006 [ 1605.136388] radeon 0000:01:00.0: GRBM_STATUS_SE3=0x08000006 [ 1605.136389] radeon 0000:01:00.0: SRBM_STATUS=0x20000040 [ 1605.136390] radeon 0000:01:00.0: SRBM_STATUS2=0x00000000 [ 1605.136391] radeon 0000:01:00.0: SDMA0_STATUS_REG = 0x46CEE557 [ 1605.136392] radeon 0000:01:00.0: SDMA1_STATUS_REG = 0x46CEE557 [ 1605.136393] radeon 0000:01:00.0: CP_STAT = 0x80038600 [ 1605.136394] radeon 0000:01:00.0: CP_STALLED_STAT1 = 0x00000c00 [ 1605.136395] radeon 0000:01:00.0: CP_STALLED_STAT2 = 0x00018000 [ 1605.136397] radeon 0000:01:00.0: CP_STALLED_STAT3 = 0x00000000 [ 1605.136398] radeon 0000:01:00.0: CP_CPF_BUSY_STAT = 0x00000002 [ 1605.136399] radeon 0000:01:00.0: CP_CPF_STALLED_STAT1 = 0x00000000 [ 1605.136400] radeon 0000:01:00.0: CP_CPF_STATUS = 0x80000063 [ 1605.136401] radeon 0000:01:00.0: CP_CPC_BUSY_STAT = 0x00000000 [ 1605.136402] radeon 0000:01:00.0: CP_CPC_STALLED_STAT1 = 0x00000000 [ 1605.136403] radeon 0000:01:00.0: CP_CPC_STATUS = 0x00000000 [ 1605.136404] radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000 [ 1605.136406] radeon 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x00000000 [ 1605.136539] radeon 0000:01:00.0: GRBM_SOFT_RESET=0x00010001 [ 1605.136591] radeon 0000:01:00.0: SRBM_SOFT_RESET=0x00000100 [ 1605.137745] radeon 0000:01:00.0: GRBM_STATUS=0x00003028 [ 1605.137746] radeon 0000:01:00.0: GRBM_STATUS2=0x00000008 [ 1605.137747] radeon 0000:01:00.0: GRBM_STATUS_SE0=0x00000006 [ 1605.137748] radeon 0000:01:00.0: GRBM_STATUS_SE1=0x00000006 [ 1605.137749] radeon 0000:01:00.0: GRBM_STATUS_SE2=0x00000006 [ 1605.137751] radeon 0000:01:00.0: GRBM_STATUS_SE3=0x00000006 [ 1605.137752] radeon 0000:01:00.0: SRBM_STATUS=0x20000040 [ 1605.137752] radeon 0000:01:00.0: SRBM_STATUS2=0x00000000 [ 1605.137754] radeon 0000:01:00.0: SDMA0_STATUS_REG = 0x46CEE557 [ 1605.137755] radeon 0000:01:00.0: SDMA1_STATUS_REG = 0x46CEE557 [ 1605.137756] radeon 0000:01:00.0: CP_STAT = 0x00000000 [ 1605.137757] radeon 0000:01:00.0: CP_STALLED_STAT1 = 0x00000000 [ 1605.137758] radeon 0000:01:00.0: CP_STALLED_STAT2 = 0x00000000 [ 1605.137759] radeon 0000:01:00.0: CP_STALLED_STAT3 = 0x00000000 [ 1605.137760] radeon 0000:01:00.0: CP_CPF_BUSY_STAT = 0x00000000 [ 1605.137761] radeon 0000:01:00.0: CP_CPF_STALLED_STAT1 = 0x00000000 [ 1605.137762] radeon 0000:01:00.0: CP_CPF_STATUS = 0x00000000 [ 1605.137763] radeon 0000:01:00.0: CP_CPC_BUSY_STAT = 0x00000000 [ 1605.137764] radeon 0000:01:00.0: CP_CPC_STALLED_STAT1 = 0x00000000 [ 1605.137766] radeon 0000:01:00.0: CP_CPC_STATUS = 0x00000000 [ 1605.137779] radeon 0000:01:00.0: GPU reset succeeded, trying to resume [ 1605.316214] [drm:ci_dpm_enable [radeon]] *ERROR* ci_start_dpm failed [ 1605.316228] [drm:radeon_pm_resume [radeon]] *ERROR* radeon: dpm resume failed [ 1605.316232] [drm] probing gen 2 caps for device 8086:c01 = 261ad03/e [ 1605.316234] [drm] PCIE gen 3 link speeds already enabled [ 1605.322812] [drm] PCIE GART of 2048M enabled (table at 0x000000000030E000). [ 1605.322948] radeon 0000:01:00.0: WB enabled [ 1605.322963] radeon 0000:01:00.0: fence driver on ring 0 use gpu addr 0x0000000400000c00 and cpu addr 0x0000000069866a2d [ 1605.322964] radeon 0000:01:00.0: fence driver on ring 1 use gpu addr 0x0000000400000c04 and cpu addr 0x000000006efe9aa0 [ 1605.322965] radeon 0000:01:00.0: fence driver on ring 2 use gpu addr 0x0000000400000c08 and cpu addr 0x00000000a652c3ad [ 1605.322966] radeon 0000:01:00.0: fence driver on ring 3 use gpu addr 0x0000000400000c0c and cpu addr 0x00000000fc5d211b [ 1605.322967] radeon 0000:01:00.0: fence driver on ring 4 use gpu addr 0x0000000400000c10 and cpu addr 0x00000000cd5ca2f4 [ 1605.323322] radeon 0000:01:00.0: fence driver on ring 5 use gpu addr 0x0000000000078d30 and cpu addr 0x00000000ae9e3dfe [ 1605.323463] radeon 0000:01:00.0: fence driver on ring 6 use gpu addr 0x0000000400000c18 and cpu addr 0x000000007065469b [ 1605.323464] radeon 0000:01:00.0: fence driver on ring 7 use gpu addr 0x0000000400000c1c and cpu addr 0x00000000b246b6b7 [ 1605.325595] [drm] ring test on 0 succeeded in 4 usecs [ 1605.325659] [drm] ring test on 1 succeeded in 3 usecs [ 1605.325667] [drm] ring test on 2 succeeded in 2 usecs [ 1605.325846] [drm] ring test on 3 succeeded in 5 usecs [ 1605.325852] [drm] ring test on 4 succeeded in 4 usecs [ 1605.371875] [drm] ring test on 5 succeeded in 1 usecs [ 1605.391888] [drm] UVD initialized successfully. [ 1605.493990] [drm] ring test on 6 succeeded in 1223 usecs [ 1605.494000] [drm] ring test on 7 succeeded in 4 usecs [ 1605.494000] [drm] VCE initialized successfully. [ 1605.494036] [drm:radeon_pm_resume [radeon]] *ERROR* radeon: dpm resume failed [ 1605.494372] [drm] ib test on ring 0 succeeded in 0 usecs [ 1605.494505] [drm] ib test on ring 1 succeeded in 0 usecs [ 1605.494638] [drm] ib test on ring 2 succeeded in 0 usecs [ 1605.494771] [drm] ib test on ring 3 succeeded in 0 usecs [ 1605.494903] [drm] ib test on ring 4 succeeded in 0 usecs [ 1606.021037] [drm] ib test on ring 5 succeeded [ 1606.042045] [drm] ib test on ring 6 succeeded [ 1606.042863] [drm] ib test on ring 7 succeeded -- You are receiving this mail because: You are the assignee for the bug.