From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751490Ab2AUTDk (ORCPT ); Sat, 21 Jan 2012 14:03:40 -0500 Received: from mail-tul01m020-f174.google.com ([209.85.214.174]:35500 "EHLO mail-tul01m020-f174.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750829Ab2AUTDi (ORCPT ); Sat, 21 Jan 2012 14:03:38 -0500 MIME-Version: 1.0 Date: Sat, 21 Jan 2012 20:03:37 +0100 Message-ID: Subject: [3.3-rc1]radeon 0000:07:00.0: GPU lockup CP stall for more than 10000msec From: Torsten Kaiser To: linux-kernel@vger.kernel.org Cc: Dave Airlie , Alex Deucher , dri-devel@lists.freedesktop.org Content-Type: text/plain; charset=ISO-8859-1 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org After updating to kernel 3.3-rc1 I have experienced a lockup of my GPU. I left my KDE desktop running until the screensaver turned off the monitors. But on key presses it would not turn back on. Ctrl+Alt+F1 to switch to another virtual console also did not work. Alt+SysRq magic still worked, so I was able to force the syslog to disk and restart the system. >>From the log: Jan 21 19:30:01 thoregon cron[3960]: (root) CMD (test -x /usr/sbin/run-crons && /usr/sbin/run-crons) Jan 21 19:39:41 thoregon kernel: [ 6364.620131] radeon 0000:07:00.0: GPU lockup CP stall for more than 10000msec Jan 21 19:39:41 thoregon kernel: [ 6364.620139] GPU lockup (waiting for 0x0003F1F2 last fence id 0x0003F1F1) Jan 21 19:39:41 thoregon kernel: [ 6364.636341] radeon 0000:07:00.0: GPU softreset Jan 21 19:39:41 thoregon kernel: [ 6364.636348] radeon 0000:07:00.0: R_008010_GRBM_STATUS=0xA0003028 Jan 21 19:39:41 thoregon kernel: [ 6364.636354] radeon 0000:07:00.0: R_008014_GRBM_STATUS2=0x00000002 Jan 21 19:39:41 thoregon kernel: [ 6364.620131] radeon 0000:07:00.0: GPU lockup CP stall for more than 10000msec Jan 21 19:39:41 thoregon kernel: [ 6364.620139] GPU lockup (waiting for 0x0003F1F2 last fence id 0x0003F1F1) Jan 21 19:39:41 thoregon kernel: [ 6364.636341] radeon 0000:07:00.0: GPU softreset Jan 21 19:39:41 thoregon kernel: [ 6364.636348] radeon 0000:07:00.0: R_008010_GRBM_STATUS=0xA0003028 Jan 21 19:39:41 thoregon kernel: [ 6364.636354] radeon 0000:07:00.0: R_008014_GRBM_STATUS2=0x00000002 Jan 21 19:39:41 thoregon kernel: [ 6364.636359] radeon 0000:07:00.0: R_000E50_SRBM_STATUS=0x200000C0 Jan 21 19:39:41 thoregon kernel: [ 6364.636370] radeon 0000:07:00.0: R_008020_GRBM_SOFT_RESET=0x00007FEE Jan 21 19:39:41 thoregon kernel: [ 6364.651219] radeon 0000:07:00.0: R_008020_GRBM_SOFT_RESET=0x00000001 Jan 21 19:39:41 thoregon kernel: [ 6364.667212] radeon 0000:07:00.0: R_008010_GRBM_STATUS=0x00003028 Jan 21 19:39:41 thoregon kernel: [ 6364.667217] radeon 0000:07:00.0: R_008014_GRBM_STATUS2=0x00000002 Jan 21 19:39:41 thoregon kernel: [ 6364.667223] radeon 0000:07:00.0: R_000E50_SRBM_STATUS=0x200000C0 Jan 21 19:39:41 thoregon kernel: [ 6364.668226] radeon 0000:07:00.0: GPU reset succeed Jan 21 19:39:41 thoregon kernel: [ 6364.673142] [drm] PCIE GART of 512M enabled (table at 0x0000000000040000). Jan 21 19:39:41 thoregon kernel: [ 6364.673177] radeon 0000:07:00.0: WB enabled Jan 21 19:39:41 thoregon kernel: [ 6364.673184] [drm] fence driver on ring 0 use gpu addr 0x20000c00 and cpu addr 0xffff880328636c00 Jan 21 19:39:41 thoregon kernel: [ 6364.719445] [drm] ring test on 0 succeeded in 1 usecs Jan 21 19:40:01 thoregon cron[3975]: (root) CMD (test -x /usr/sbin/run-crons && /usr/sbin/run-crons) Jan 21 19:43:37 thoregon kernel: [ 6600.390150] INFO: task X:3098 blocked for more than 120 seconds. Jan 21 19:43:37 thoregon kernel: [ 6600.390157] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. Jan 21 19:43:37 thoregon kernel: [ 6600.390163] X D ffff880337d50a00 0 3098 3077 0x00400000 Jan 21 19:43:37 thoregon kernel: [ 6600.390174] ffff88031df15080 0000000000000086 ffff8802f5087300 0000000000010a00 Jan 21 19:43:37 thoregon kernel: [ 6600.390185] ffff88031bf79fd8 0000000000010a00 ffff88031bf78000 ffff88031bf79fd8 Jan 21 19:43:37 thoregon kernel: [ 6600.390194] 0000000000010a00 ffff88031df15080 0000000000010a00 0000000000010a00 Jan 21 19:43:37 thoregon kernel: [ 6600.390203] Call Trace: Jan 21 19:43:37 thoregon kernel: [ 6600.390219] [] ? __mutex_lock_slowpath+0xc8/0x140 Jan 21 19:43:37 thoregon kernel: [ 6600.390230] [] ? mutex_lock+0x1a/0x40 Jan 21 19:43:37 thoregon kernel: [ 6600.390239] [] ? radeon_ib_get+0x52/0x230 Jan 21 19:43:37 thoregon kernel: [ 6600.390249] [] ? r600_ib_test+0x5a/0x300 Jan 21 19:43:37 thoregon kernel: [ 6600.390258] [] ? rv770_startup+0xf7e/0x1590 Jan 21 19:43:37 thoregon kernel: [ 6600.390267] [] ? rv770_resume+0x2c/0x90 Jan 21 19:43:37 thoregon kernel: [ 6600.390275] [] ? radeon_gpu_reset+0x11e/0x160 Jan 21 19:43:37 thoregon kernel: [ 6600.390284] [] ? radeon_fence_wait+0x363/0x3b0 Jan 21 19:43:37 thoregon kernel: [ 6600.390293] [] ? wake_up_bit+0x40/0x40 Jan 21 19:43:37 thoregon kernel: [ 6600.390301] [] ? radeon_ib_get+0x1e7/0x230 Jan 21 19:43:37 thoregon kernel: [ 6600.390310] [] ? radeon_cs_ioctl+0x27a/0x4d0 Jan 21 19:43:37 thoregon kernel: [ 6600.390319] [] ? drm_ioctl+0x3e4/0x490 Jan 21 19:43:37 thoregon kernel: [ 6600.390327] [] ? radeon_cs_finish_pages+0xa0/0xa0 Jan 21 19:43:37 thoregon kernel: [ 6600.390336] [] ? do_page_fault+0x199/0x420 Jan 21 19:43:37 thoregon kernel: [ 6600.390344] [] ? mmap_region+0x1dc/0x570 Jan 21 19:43:37 thoregon kernel: [ 6600.390352] [] ? do_vfs_ioctl+0x96/0x4e0 Jan 21 19:43:37 thoregon kernel: [ 6600.390359] [] ? __schedule+0x28c/0x630 Jan 21 19:43:37 thoregon kernel: [ 6600.390366] [] ? sys_ioctl+0x49/0x90 Jan 21 19:43:37 thoregon kernel: [ 6600.390375] [] ? system_call_fastpath+0x16/0x1b Jan 21 19:45:08 thoregon kernel: [ 6691.864440] SysRq : Emergency Sync Jan 21 19:45:08 thoregon kernel: [ 6691.864838] Emergency Sync complete Jan 21 19:45:14 thoregon kernel: [ 6697.476112] SysRq : Emergency Remount R/O Jan 21 19:46:33 thoregon kernel: [ 0.000000] Linux version 3.3.0-rc1 (root@thoregon) (gcc version 4.5.3 (Gentoo 4.5.3-r2 p1.0, pie-0.4.6) ) #1 SMP Fri Jan 20 09:54:26 CET 2012 I did not have any trouble with 3.2 or earlier kernel, so it looks like an regression in 3.3-rc1. Info from my card: thoregon ~ # lspci -vvs 07:00.0 07:00.0 VGA compatible controller: Advanced Micro Devices [AMD] nee ATI RV730 PRO [Radeon HD 4650] (prog-if 00 [VGA controller]) Subsystem: Hightech Information System Ltd. Device 2269 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- SERR- Kernel driver in use: radeon Please ask, if you need any other information, I will try to provide it. Torsten