All of lore.kernel.org
 help / color / mirror / Atom feed
* [libva] GPU hung
@ 2012-06-12 19:13 Angela
  2012-06-12 19:29 ` Daniel Vetter
  0 siblings, 1 reply; 10+ messages in thread
From: Angela @ 2012-06-12 19:13 UTC (permalink / raw)
  To: intel-gfx

[-- Attachment #1: Type: text/plain, Size: 1039 bytes --]

Hello
Playing recordings with Mythtv 0.25 VAAPI I get 0.05 sec sound, nothing gets
displayed and have the following reproducible error

dmesg.txt:[ 1104.077320] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer
elapsed... GPU hung

Killing mythfrontend does not work. Running intel_reg_dumper, xvinfo, etc.
afterwards hang. System needs rebooting.

This is on SandyBridge 2600K, 1080i HDMI on today's head kernel (>3.5-rc2)
and updated xorg-edgers on Ubuntu 12.04. No xorg.conf.

Please find debugging info in attached problem_gpu_hang.zip (50KB), with
drm.debug=0x06. 

I reported some months ago a GPU hung issue, but didn't receive an answer, I
hope to get a reply this time (I don't mean an immediate fix, but like to
hear that someone will have a look into it in the next days/weeks).
I hope to get this fixed as many new 1080i channels are available, which are
not viewable atm and hope to have better luck with VAAPI. I have a spare
Nvidia GPU with cooler, but like to stick with Intel Onboard GPU as it is
already cooled.

Angela


[-- Attachment #2: problem_gpu_hang.zip --]
[-- Type: application/octet-stream, Size: 50773 bytes --]

[-- Attachment #3: Type: text/plain, Size: 159 bytes --]

_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [libva] GPU hung
  2012-06-12 19:13 [libva] GPU hung Angela
@ 2012-06-12 19:29 ` Daniel Vetter
  2012-06-12 20:26   ` Angela
  0 siblings, 1 reply; 10+ messages in thread
From: Daniel Vetter @ 2012-06-12 19:29 UTC (permalink / raw)
  To: Angela; +Cc: intel-gfx

For gpu hangs the important thing is the i915_error_state file from
sysfs (the files you've attached are mainly interesting for modeset
issues). I guess the best thing would be to file a bug on
bugs.freedesktop.org with that.
-Daniel

On Tue, Jun 12, 2012 at 9:13 PM, Angela <angela.schmid@wolke7.net> wrote:
> Hello
> Playing recordings with Mythtv 0.25 VAAPI I get 0.05 sec sound, nothing gets
> displayed and have the following reproducible error
>
> dmesg.txt:[ 1104.077320] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer
> elapsed... GPU hung
>
> Killing mythfrontend does not work. Running intel_reg_dumper, xvinfo, etc.
> afterwards hang. System needs rebooting.
>
> This is on SandyBridge 2600K, 1080i HDMI on today's head kernel (>3.5-rc2)
> and updated xorg-edgers on Ubuntu 12.04. No xorg.conf.
>
> Please find debugging info in attached problem_gpu_hang.zip (50KB), with
> drm.debug=0x06.
>
> I reported some months ago a GPU hung issue, but didn't receive an answer, I
> hope to get a reply this time (I don't mean an immediate fix, but like to
> hear that someone will have a look into it in the next days/weeks).
> I hope to get this fixed as many new 1080i channels are available, which are
> not viewable atm and hope to have better luck with VAAPI. I have a spare
> Nvidia GPU with cooler, but like to stick with Intel Onboard GPU as it is
> already cooled.
>
> Angela
>
>
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx@lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/intel-gfx
>



-- 
Daniel Vetter
daniel.vetter@ffwll.ch - +41 (0) 79 364 57 48 - http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [libva] GPU hung
  2012-06-12 19:29 ` Daniel Vetter
@ 2012-06-12 20:26   ` Angela
  2012-06-12 21:29     ` Daniel Vetter
  0 siblings, 1 reply; 10+ messages in thread
From: Angela @ 2012-06-12 20:26 UTC (permalink / raw)
  To: 'Daniel Vetter'; +Cc: intel-gfx

>For gpu hangs the important thing is the i915_error_state file from sysfs
(the files you've attached are mainly interesting for modeset issues). I
guess the best thing would be to file a bug on bugs.freedesktop.org with
that.

Hello Daniel

I get the following:

# cat /sys/kernel/debug/dri/0/i915_error_state
no error state collected
# cat /sys/kernel/debug/dri/64/i915_error_state
no error state collected

mount:
sysfs on /sys type sysfs (rw,noexec,nosuid,nodev)

What is wrong ?

Angela

On Tue, Jun 12, 2012 at 9:13 PM, Angela <angela.schmid@wolke7.net> wrote:
> Hello
> Playing recordings with Mythtv 0.25 VAAPI I get 0.05 sec sound, 
> nothing gets displayed and have the following reproducible error
>
> dmesg.txt:[ 1104.077320] [drm:i915_hangcheck_hung] *ERROR* Hangcheck 
> timer elapsed... GPU hung
>
> Killing mythfrontend does not work. Running intel_reg_dumper, xvinfo, etc.
> afterwards hang. System needs rebooting.
>
> This is on SandyBridge 2600K, 1080i HDMI on today's head kernel 
> (>3.5-rc2) and updated xorg-edgers on Ubuntu 12.04. No xorg.conf.
>
> Please find debugging info in attached problem_gpu_hang.zip (50KB), 
> with drm.debug=0x06.
>
> I reported some months ago a GPU hung issue, but didn't receive an 
> answer, I hope to get a reply this time (I don't mean an immediate 
> fix, but like to hear that someone will have a look into it in the next
days/weeks).
> I hope to get this fixed as many new 1080i channels are available, 
> which are not viewable atm and hope to have better luck with VAAPI. I 
> have a spare Nvidia GPU with cooler, but like to stick with Intel 
> Onboard GPU as it is already cooled.
>
> Angela
>
>
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx@lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/intel-gfx
>



--
Daniel Vetter
daniel.vetter@ffwll.ch - +41 (0) 79 364 57 48 - http://blog.ffwll.ch

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [libva] GPU hung
  2012-06-12 20:26   ` Angela
@ 2012-06-12 21:29     ` Daniel Vetter
  2012-06-12 22:09       ` Angela
  0 siblings, 1 reply; 10+ messages in thread
From: Daniel Vetter @ 2012-06-12 21:29 UTC (permalink / raw)
  To: Angela; +Cc: intel-gfx

On Tue, Jun 12, 2012 at 10:26:19PM +0200, Angela wrote:
> >For gpu hangs the important thing is the i915_error_state file from sysfs
> (the files you've attached are mainly interesting for modeset issues). I
> guess the best thing would be to file a bug on bugs.freedesktop.org with
> that.
> 
> Hello Daniel
> 
> I get the following:
> 
> # cat /sys/kernel/debug/dri/0/i915_error_state
> no error state collected
> # cat /sys/kernel/debug/dri/64/i915_error_state
> no error state collected
> 
> mount:
> sysfs on /sys type sysfs (rw,noexec,nosuid,nodev)
> 
> What is wrong ?

I guess you need to rehang your machine, the error_state doesn't survive a
reboot.
-Daniel
-- 
Daniel Vetter
Mail: daniel@ffwll.ch
Mobile: +41 (0)79 365 57 48

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [libva] GPU hung
  2012-06-12 21:29     ` Daniel Vetter
@ 2012-06-12 22:09       ` Angela
  2012-06-12 22:30         ` Paul Menzel
  0 siblings, 1 reply; 10+ messages in thread
From: Angela @ 2012-06-12 22:09 UTC (permalink / raw)
  To: 'Daniel Vetter', intel-gfx

>> >For gpu hangs the important thing is the i915_error_state file from 
>> >sysfs
>> (the files you've attached are mainly interesting for modeset issues). 
>> I guess the best thing would be to file a bug on bugs.freedesktop.org 
>> with that.
>> 
>> # cat /sys/kernel/debug/dri/0/i915_error_state
>> no error state collected
>> # cat /sys/kernel/debug/dri/64/i915_error_state
>> no error state collected
>> 
>> mount:
>> sysfs on /sys type sysfs (rw,noexec,nosuid,nodev)
>I guess you need to rehang your machine, the error_state doesn't survive a
reboot.

Hung the system and checked i915_error_state before reboot.

Any other hints ?

Angela

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [libva] GPU hung
  2012-06-12 22:09       ` Angela
@ 2012-06-12 22:30         ` Paul Menzel
  2012-06-13 16:27           ` Angela
  0 siblings, 1 reply; 10+ messages in thread
From: Paul Menzel @ 2012-06-12 22:30 UTC (permalink / raw)
  To: Angela; +Cc: intel-gfx


[-- Attachment #1.1: Type: text/plain, Size: 1312 bytes --]

Am Mittwoch, den 13.06.2012, 00:09 +0200 schrieb Angela:
> >> >For gpu hangs the important thing is the i915_error_state file from 
> >> >sysfs
> >> (the files you've attached are mainly interesting for modeset issues). 
> >> I guess the best thing would be to file a bug on bugs.freedesktop.org 
> >> with that.
> >> 
> >> # cat /sys/kernel/debug/dri/0/i915_error_state
> >> no error state collected
> >> # cat /sys/kernel/debug/dri/64/i915_error_state
> >> no error state collected
> >> 
> >> mount:
> >> sysfs on /sys type sysfs (rw,noexec,nosuid,nodev)
> >I guess you need to rehang your machine, the error_state doesn't survive a
> reboot.
> 
> Hung the system and checked i915_error_state before reboot.
> 
> Any other hints ?

I think you have to mount the debug filesystem [1].


Thanks,

Paul


PS: Maybe that and the i915_error_state stuff could be added to the
Intel Web site too [2].

PPS: Angela, you can also join the IRC channel #intel-gfx for live
debugging.

PPPS: Angela, can you use a different mail program which can quote
correctly. Outlook should be able to do it, but I do not know how to
configure it. Thunderbird works under Microsoft Windows.


[1] https://en.wikipedia.org/wiki/Debugfs
[2] http://intellinuxgraphics.org/how_to_report_bug.html

[-- Attachment #1.2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

[-- Attachment #2: Type: text/plain, Size: 159 bytes --]

_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [libva] GPU hung
  2012-06-12 22:30         ` Paul Menzel
@ 2012-06-13 16:27           ` Angela
  2012-06-14  1:32             ` Xiang, Haihao
  0 siblings, 1 reply; 10+ messages in thread
From: Angela @ 2012-06-13 16:27 UTC (permalink / raw)
  To: 'Paul Menzel', 'Daniel Vetter', intel-gfx

> Am Mittwoch, den 13.06.2012, 00:09 +0200 schrieb Angela:
> > >> >For gpu hangs the important thing is the i915_error_state file
> > >> >from sysfs
> > >> (the files you've attached are mainly interesting for modeset issues).
> > >> I guess the best thing would be to file a bug on
> > >> bugs.freedesktop.org with that.
> > >>
> > >> # cat /sys/kernel/debug/dri/0/i915_error_state
> > >> no error state collected
> > >> # cat /sys/kernel/debug/dri/64/i915_error_state
> > >> no error state collected

I copied the wrong line from mount, debugfs is always mounted on Ubuntu
none on /sys/kernel/debug type debugfs (rw)

Still, I don't have any output right after the crash, see above. Tried several times 1080i recordings, either
no error state collected


Below error is with a 720p recording, which behaves and looks different

cat  .../debug/dri/0/i915_error_state, blocks

Jun 13 17:35:39 minerva11 kernel: [68682.433743] [drm:i915_driver_open],
Jun 13 17:35:39 minerva11 kernel: [68682.691256] [drm:i915_driver_open],
Jun 13 17:35:39 minerva11 kernel: [68682.860834] [drm:i915_driver_open],
Jun 13 17:35:40 minerva11 kernel: [68683.185483] [drm:drm_mode_addfb], [FB:27]
Jun 13 17:35:46 minerva11 kernel: [68689.196485] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
Jun 13 17:35:46 minerva11 kernel: [68689.196946] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
Jun 13 17:35:46 minerva11 kernel: [68689.199780] [drm:i915_error_work_func], resetting chip
Jun 13 17:35:46 minerva11 kernel: [68689.199843] [drm:drm_crtc_helper_set_config],
Jun 13 17:35:46 minerva11 kernel: [68689.199845] [drm:drm_crtc_helper_set_config], [CRTC:3] [NOFB]
Jun 13 17:35:46 minerva11 kernel: [68689.199867] [drm:ironlake_crtc_dpms], crtc 0/0 dpms off
Jun 13 17:35:46 minerva11 kernel: [68689.199869] [drm:i915_get_vblank_timestamp], crtc 0 is disabled
Jun 13 17:35:46 minerva11 kernel: [68689.205723] [drm:intel_prepare_page_flip], preparing flip with no unpin work?
Jun 13 17:35:46 minerva11 udevd[19509]: failed to execute '/usr/share/apport/apport-gpu-error-intel.py' '/usr/share/apport/apport-gpu-error-intel.py': No such file or directory
Jun 13 17:36:46  dhclient: last message repeated 4 times
Jun 13 17:36:46 minerva11 kernel: [68689.252474] [drm:intel_disable_pch_pll], disable PCH PLL c6014 (active 1, on? 1) for crtc 3
Jun 13 17:36:46 minerva11 kernel: [68689.252476] [drm:intel_disable_pch_pll], disabling PCH PLL c6014
Jun 13 17:36:46 minerva11 kernel: [68689.252882] [drm:intel_update_fbc],
Jun 13 17:36:46 minerva11 kernel: [68689.252906] ------------[ cut here ]------------
Jun 13 17:36:46 minerva11 kernel: [68689.253355] kernel BUG at drivers/gpu/drm/i915/i915_gem.c:3084!
Jun 13 17:36:46 minerva11 kernel: [68689.253795] invalid opcode: 0000 [#1] SMP
Jun 13 17:36:46 minerva11 kernel: [68689.254227] CPU 3
Jun 13 17:36:46 minerva11 kernel: [68689.254672] Modules linked in: des_generic md4 nls_utf8 cifs xts gf128mul autofs4 dm_crypt binfmt_misc ipt_MASQUERADE xt_conntrack snd_hda_codec_hdmi snd_hda_codec_realtek iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 iptable_filter ip_tables x_tables tda18271c2dd(O) snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_seq_midi snd_rawmidi snd_seq_midi_event drxk(O) snd_seq arc4 ath9k mac80211 ath9k_common ath9k_hw eeepc_wmi ddbridge(O) snd_timer snd_seq_device dvb_core(O) asus_wmi cxd2099(O) coretemp hid_generic ath btusb snd mei bluetooth psmouse serio_raw lpc_ich sparse_keymap soundcore cfg80211 snd_page_alloc microcode lp parport usbhid hid usb_storage uas 8139too mxm_wmi 8139cp ghash_clmulni_intel aesni_intel cryptd firewire_ohci ae
 s_x86_64 firewire_core crc_itu_t i915 drm_kms_helper drm ahci e1000e libahci i2c_algo_bit xhci_hcd video wmi [last unloaded: kvm]
Jun 13 17:36:46 minerva11 kernel: [68689.256832]
Jun 13 17:36:46 miJun 13 17:44:59 minerva11 kernel: imklog 5.8.6, log source = /proc/kmsg started.

Again with 1080i, I also noticed the stacktrace is not always the same, however ends always at the same point (intel_unpin_fb_obj):

Jun 13 18:13:36 minerva11 kernel: [ 1737.267614] ------------[ cut here ]------------
Jun 13 18:13:36 minerva11 kernel: [ 1737.267630] kernel BUG at drivers/gpu/drm/i915/i915_gem.c:3084!
Jun 13 18:13:36 minerva11 kernel: [ 1737.267646] invalid opcode: 0000 [#1] SMP
Jun 13 18:13:36 minerva11 kernel: [ 1737.267657] CPU 2
Jun 13 18:13:36 minerva11 kernel: [ 1737.267683] Modules linked in: xts gf128mul autofs4 binfmt_misc dm_crypt btusb ipt_MASQUERADE xt_conntrack iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 iptable_filter ip_tables x_tables snd_hda_codec_hdmi snd_hda_codec_realtek tda18271c2dd(O) arc4 ath9k mac80211 snd_hda_intel ath9k_common snd_hda_codec snd_hwdep snd_pcm snd_seq_midi ath9k_hw snd_rawmidi snd_seq_midi_event drxk(O) snd_seq psmouse eeepc_wmi asus_wmi ddbridge(O) ath sparse_keymap serio_raw dvb_core(O) cxd2099(O) coretemp hid_generic snd_timer snd_seq_device ath3k mei lpc_ich snd bluetooth soundcore snd_page_alloc microcode cfg80211 lp parport usbhid hid usb_storage uas firewire_ohci firewire_core crc_itu_t 8139too 8139cp ghash_clmulni_intel xhci_hcd aesni_intel cryptd a
 es_x86_64 i915 mxm_wmi drm_kms_helper drm ahci libahci i2c_algo_bit e1000e video wmi [last unloaded: kvm]
Jun 13 18:13:36 minerva11 kernel: [ 1737.267849]
Jun 13 18:13:36 minerva11 kernel: [ 1737.267856] Pid: 4612, comm: Xorg Tainted: G        W  O 3.5.0-rc2-mine5+ #1 System manufacturer System Product Name/P8Z68-V PRO
Jun 13 18:13:36 minerva11 kernel: [ 1737.267891] RIP: 0010:[<ffffffffa00f4aff>]  [<ffffffffa00f4aff>] i915_gem_object_unpin+0x4f/0x60 [i915]
Jun 13 18:13:36 minerva11 kernel: [ 1737.267913] RSP: 0018:ffff8801d2b9fb48  EFLAGS: 00010246
Jun 13 18:13:36 minerva11 kernel: [ 1737.267926] RAX: ffff88020edb4000 RBX: ffff880213d3b000 RCX: 0000000000000000
Jun 13 18:13:36 minerva11 kernel: [ 1737.267942] RDX: 0000000003020402 RSI: 0000000000070008 RDI: ffff88021147f000
Jun 13 18:13:36 minerva11 kernel: [ 1737.267959] RBP: ffff8801d2b9fb48 R08: 000000000000000a R09: 0000000000000000
Jun 13 18:13:36 minerva11 kernel: [ 1737.267976] R10: 0000000000000000 R11: 0000000000000000 R12: ffff880214139820
Jun 13 18:13:36 minerva11 kernel: [ 1737.267992] R13: ffff88020edb4000 R14: ffff880214139800 R15: ffff8801d2b9fc98
Jun 13 18:13:36 minerva11 kernel: [ 1737.268009] FS:  00007fd0bc238880(0000) GS:ffff88021f280000(0000) knlGS:0000000000000000
Jun 13 18:13:36 minerva11 kernel: [ 1737.268028] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Jun 13 18:13:36 minerva11 kernel: [ 1737.268052] CR2: 00007f2ce2f453d0 CR3: 00000002128a6000 CR4: 00000000000407e0
Jun 13 18:13:36 minerva11 kernel: [ 1737.268069] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jun 13 18:13:36 minerva11 kernel: [ 1737.268096] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Jun 13 18:13:36 minerva11 kernel: [ 1737.268113] Process Xorg (pid: 4612, threadinfo ffff8801d2b9e000, task ffff8801d2addbc0)
Jun 13 18:13:36 minerva11 kernel: [ 1737.268131] Stack:
Jun 13 18:13:36 minerva11 kernel: [ 1737.268138]  ffff8801d2b9fb58 ffffffffa010567c ffff8801d2b9fb88 ffffffffa0108abc
Jun 13 18:13:36 minerva11 kernel: [ 1737.268156]  ffff880214139c38 ffff880213d3b000 ffffffffa01474c0 ffff880214139c68
Jun 13 18:13:36 minerva11 kernel: [ 1737.268173]  ffff8801d2b9fbb8 ffffffffa00d56a5 ffff8801d2b9ffd8 ffff880213d3b000
Jun 13 18:13:36 minerva11 kernel: [ 1737.268190] Call Trace:
Jun 13 18:13:36 minerva11 kernel: [ 1737.268204]  [<ffffffffa010567c>] intel_unpin_fb_obj+0x3c/0x40 [i915]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268224]  [<ffffffffa0108abc>] intel_crtc_disable+0x8c/0xb0 [i915]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268243]  [<ffffffffa00d56a5>] drm_helper_disable_unused_functions+0x115/0x190 [drm_kms_helper]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268266]  [<ffffffffa00d70ce>] drm_crtc_helper_set_config+0xa5e/0xaf0 [drm_kms_helper]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268288]  [<ffffffff8111d095>] ? __generic_file_aio_write+0x235/0x440
Jun 13 18:13:36 minerva11 kernel: [ 1737.268308]  [<ffffffffa010567c>] ? intel_unpin_fb_obj+0x3c/0x40 [i915]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268331]  [<ffffffffa006d35e>] drm_framebuffer_cleanup+0xfe/0x180 [drm]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268352]  [<ffffffffa01024b1>] intel_user_framebuffer_destroy+0x21/0x80 [i915]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268374]  [<ffffffffa0070d4b>] drm_mode_rmfb+0xfb/0x100 [drm]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268393]  [<ffffffffa00604cc>] drm_ioctl+0x47c/0x540 [drm]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268408]  [<ffffffff81063f37>] ? __set_task_blocked+0x37/0x80
Jun 13 18:13:36 minerva11 kernel: [ 1737.268427]  [<ffffffffa0070c50>] ? drm_mode_addfb2+0x690/0x690 [drm]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268444]  [<ffffffff81179912>] ? do_sync_write+0xd2/0x110
Jun 13 18:13:36 minerva11 kernel: [ 1737.268458]  [<ffffffff811b8202>] ? fsnotify+0x1d2/0x2b0
Jun 13 18:13:36 minerva11 kernel: [ 1737.268472]  [<ffffffff8118b988>] do_vfs_ioctl+0x98/0x550
Jun 13 18:13:36 minerva11 kernel: [ 1737.268486]  [<ffffffff8118bed1>] sys_ioctl+0x91/0xa0
Jun 13 18:13:36 minerva11 kernel: [ 1737.268500]  [<ffffffff8162e2e9>] system_call_fastpath+0x16/0x1b
Jun 13 18:13:36 minerva11 kernel: [ 1737.268527] Code: e2 ff 1f fe ff c1 e8 0d 83 c0 0f 83 e0 0f 89 c1 83 e1 0f c1 e1 0d 09 ca 84 c0 89 97 f0 00 00 00 75 07 80 a7 f2 00 00 00 f7 5d c3 <0f> 0b 0f 0b 66 66 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5
Jun 13 18:13:36 minerva11 kernel: [ 1737.268573] RIP  [<ffffffffa00f4aff>] i915_gem_object_unpin+0x4f/0x60 [i915]
Jun 13 18:13:36 minerva11 kernel: [ 1737.268590]  RSP <ffff8801d2b9fb48>

Is /usr/share/apport/apport-gpu-error-intel.py of any help, where can I find it.
Is the kernel stack trace of any good ?  See also my previous post.
Other possibilities of getting a trace ?

Angela

> 
> PS: Maybe that and the i915_error_state stuff could be added to the Intel
> Web site too [2].
> 
> PPS: Angela, you can also join the IRC channel #intel-gfx for live debugging.

Just logged in, don't want to intervene a hot discussion.

> PPPS: Angela, can you use a different mail program which can quote
> correctly. Outlook should be able to do it, but I do not know how to configure
> it. Thunderbird works under Microsoft Windows.

I have quoting enabled in Outlook. 

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [libva] GPU hung
  2012-06-13 16:27           ` Angela
@ 2012-06-14  1:32             ` Xiang, Haihao
  2012-07-02 18:49               ` Christophe Oosterlynck
  0 siblings, 1 reply; 10+ messages in thread
From: Xiang, Haihao @ 2012-06-14  1:32 UTC (permalink / raw)
  To: Angela; +Cc: intel-gfx, 'Paul Menzel'


Hi, Angela

Could you file a bug and provide more details how to reproduce this
issue ?

Thanks
Haihao

> > Am Mittwoch, den 13.06.2012, 00:09 +0200 schrieb Angela:
> > > >> >For gpu hangs the important thing is the i915_error_state file
> > > >> >from sysfs
> > > >> (the files you've attached are mainly interesting for modeset issues).
> > > >> I guess the best thing would be to file a bug on
> > > >> bugs.freedesktop.org with that.
> > > >>
> > > >> # cat /sys/kernel/debug/dri/0/i915_error_state
> > > >> no error state collected
> > > >> # cat /sys/kernel/debug/dri/64/i915_error_state
> > > >> no error state collected
> 
> I copied the wrong line from mount, debugfs is always mounted on Ubuntu
> none on /sys/kernel/debug type debugfs (rw)
> 
> Still, I don't have any output right after the crash, see above. Tried several times 1080i recordings, either
> no error state collected
> 
> 
> Below error is with a 720p recording, which behaves and looks different
> 
> cat  .../debug/dri/0/i915_error_state, blocks
> 
> Jun 13 17:35:39 minerva11 kernel: [68682.433743] [drm:i915_driver_open],
> Jun 13 17:35:39 minerva11 kernel: [68682.691256] [drm:i915_driver_open],
> Jun 13 17:35:39 minerva11 kernel: [68682.860834] [drm:i915_driver_open],
> Jun 13 17:35:40 minerva11 kernel: [68683.185483] [drm:drm_mode_addfb], [FB:27]
> Jun 13 17:35:46 minerva11 kernel: [68689.196485] [drm:i915_hangcheck_hung] *ERROR* Hangcheck timer elapsed... GPU hung
> Jun 13 17:35:46 minerva11 kernel: [68689.196946] [drm] capturing error event; look for more information in /debug/dri/0/i915_error_state
> Jun 13 17:35:46 minerva11 kernel: [68689.199780] [drm:i915_error_work_func], resetting chip
> Jun 13 17:35:46 minerva11 kernel: [68689.199843] [drm:drm_crtc_helper_set_config],
> Jun 13 17:35:46 minerva11 kernel: [68689.199845] [drm:drm_crtc_helper_set_config], [CRTC:3] [NOFB]
> Jun 13 17:35:46 minerva11 kernel: [68689.199867] [drm:ironlake_crtc_dpms], crtc 0/0 dpms off
> Jun 13 17:35:46 minerva11 kernel: [68689.199869] [drm:i915_get_vblank_timestamp], crtc 0 is disabled
> Jun 13 17:35:46 minerva11 kernel: [68689.205723] [drm:intel_prepare_page_flip], preparing flip with no unpin work?
> Jun 13 17:35:46 minerva11 udevd[19509]: failed to execute '/usr/share/apport/apport-gpu-error-intel.py' '/usr/share/apport/apport-gpu-error-intel.py': No such file or directory
> Jun 13 17:36:46  dhclient: last message repeated 4 times
> Jun 13 17:36:46 minerva11 kernel: [68689.252474] [drm:intel_disable_pch_pll], disable PCH PLL c6014 (active 1, on? 1) for crtc 3
> Jun 13 17:36:46 minerva11 kernel: [68689.252476] [drm:intel_disable_pch_pll], disabling PCH PLL c6014
> Jun 13 17:36:46 minerva11 kernel: [68689.252882] [drm:intel_update_fbc],
> Jun 13 17:36:46 minerva11 kernel: [68689.252906] ------------[ cut here ]------------
> Jun 13 17:36:46 minerva11 kernel: [68689.253355] kernel BUG at drivers/gpu/drm/i915/i915_gem.c:3084!
> Jun 13 17:36:46 minerva11 kernel: [68689.253795] invalid opcode: 0000 [#1] SMP
> Jun 13 17:36:46 minerva11 kernel: [68689.254227] CPU 3
> Jun 13 17:36:46 minerva11 kernel: [68689.254672] Modules linked in: des_generic md4 nls_utf8 cifs xts gf128mul autofs4 dm_crypt binfmt_misc ipt_MASQUERADE xt_conntrack snd_hda_codec_hdmi snd_hda_codec_realtek iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 iptable_filter ip_tables x_tables tda18271c2dd(O) snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_seq_midi snd_rawmidi snd_seq_midi_event drxk(O) snd_seq arc4 ath9k mac80211 ath9k_common ath9k_hw eeepc_wmi ddbridge(O) snd_timer snd_seq_device dvb_core(O) asus_wmi cxd2099(O) coretemp hid_generic ath btusb snd mei bluetooth psmouse serio_raw lpc_ich sparse_keymap soundcore cfg80211 snd_page_alloc microcode lp parport usbhid hid usb_storage uas 8139too mxm_wmi 8139cp ghash_clmulni_intel aesni_intel cryptd firewire_ohci 
 aes_x86_64 firewire_core crc_itu_t i915 drm_kms_helper drm ahci e1000e libahci i2c_algo_bit xhci_hcd video wmi [last unloaded: kvm]
> Jun 13 17:36:46 minerva11 kernel: [68689.256832]
> Jun 13 17:36:46 miJun 13 17:44:59 minerva11 kernel: imklog 5.8.6, log source = /proc/kmsg started.
> 
> Again with 1080i, I also noticed the stacktrace is not always the same, however ends always at the same point (intel_unpin_fb_obj):
> 
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267614] ------------[ cut here ]------------
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267630] kernel BUG at drivers/gpu/drm/i915/i915_gem.c:3084!
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267646] invalid opcode: 0000 [#1] SMP
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267657] CPU 2
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267683] Modules linked in: xts gf128mul autofs4 binfmt_misc dm_crypt btusb ipt_MASQUERADE xt_conntrack iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 iptable_filter ip_tables x_tables snd_hda_codec_hdmi snd_hda_codec_realtek tda18271c2dd(O) arc4 ath9k mac80211 snd_hda_intel ath9k_common snd_hda_codec snd_hwdep snd_pcm snd_seq_midi ath9k_hw snd_rawmidi snd_seq_midi_event drxk(O) snd_seq psmouse eeepc_wmi asus_wmi ddbridge(O) ath sparse_keymap serio_raw dvb_core(O) cxd2099(O) coretemp hid_generic snd_timer snd_seq_device ath3k mei lpc_ich snd bluetooth soundcore snd_page_alloc microcode cfg80211 lp parport usbhid hid usb_storage uas firewire_ohci firewire_core crc_itu_t 8139too 8139cp ghash_clmulni_intel xhci_hcd aesni_intel cryptd
  aes_x86_64 i915 mxm_wmi drm_kms_helper drm ahci libahci i2c_algo_bit e1000e video wmi [last unloaded: kvm]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267849]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267856] Pid: 4612, comm: Xorg Tainted: G        W  O 3.5.0-rc2-mine5+ #1 System manufacturer System Product Name/P8Z68-V PRO
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267891] RIP: 0010:[<ffffffffa00f4aff>]  [<ffffffffa00f4aff>] i915_gem_object_unpin+0x4f/0x60 [i915]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267913] RSP: 0018:ffff8801d2b9fb48  EFLAGS: 00010246
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267926] RAX: ffff88020edb4000 RBX: ffff880213d3b000 RCX: 0000000000000000
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267942] RDX: 0000000003020402 RSI: 0000000000070008 RDI: ffff88021147f000
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267959] RBP: ffff8801d2b9fb48 R08: 000000000000000a R09: 0000000000000000
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267976] R10: 0000000000000000 R11: 0000000000000000 R12: ffff880214139820
> Jun 13 18:13:36 minerva11 kernel: [ 1737.267992] R13: ffff88020edb4000 R14: ffff880214139800 R15: ffff8801d2b9fc98
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268009] FS:  00007fd0bc238880(0000) GS:ffff88021f280000(0000) knlGS:0000000000000000
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268028] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268052] CR2: 00007f2ce2f453d0 CR3: 00000002128a6000 CR4: 00000000000407e0
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268069] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268096] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268113] Process Xorg (pid: 4612, threadinfo ffff8801d2b9e000, task ffff8801d2addbc0)
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268131] Stack:
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268138]  ffff8801d2b9fb58 ffffffffa010567c ffff8801d2b9fb88 ffffffffa0108abc
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268156]  ffff880214139c38 ffff880213d3b000 ffffffffa01474c0 ffff880214139c68
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268173]  ffff8801d2b9fbb8 ffffffffa00d56a5 ffff8801d2b9ffd8 ffff880213d3b000
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268190] Call Trace:
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268204]  [<ffffffffa010567c>] intel_unpin_fb_obj+0x3c/0x40 [i915]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268224]  [<ffffffffa0108abc>] intel_crtc_disable+0x8c/0xb0 [i915]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268243]  [<ffffffffa00d56a5>] drm_helper_disable_unused_functions+0x115/0x190 [drm_kms_helper]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268266]  [<ffffffffa00d70ce>] drm_crtc_helper_set_config+0xa5e/0xaf0 [drm_kms_helper]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268288]  [<ffffffff8111d095>] ? __generic_file_aio_write+0x235/0x440
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268308]  [<ffffffffa010567c>] ? intel_unpin_fb_obj+0x3c/0x40 [i915]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268331]  [<ffffffffa006d35e>] drm_framebuffer_cleanup+0xfe/0x180 [drm]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268352]  [<ffffffffa01024b1>] intel_user_framebuffer_destroy+0x21/0x80 [i915]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268374]  [<ffffffffa0070d4b>] drm_mode_rmfb+0xfb/0x100 [drm]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268393]  [<ffffffffa00604cc>] drm_ioctl+0x47c/0x540 [drm]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268408]  [<ffffffff81063f37>] ? __set_task_blocked+0x37/0x80
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268427]  [<ffffffffa0070c50>] ? drm_mode_addfb2+0x690/0x690 [drm]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268444]  [<ffffffff81179912>] ? do_sync_write+0xd2/0x110
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268458]  [<ffffffff811b8202>] ? fsnotify+0x1d2/0x2b0
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268472]  [<ffffffff8118b988>] do_vfs_ioctl+0x98/0x550
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268486]  [<ffffffff8118bed1>] sys_ioctl+0x91/0xa0
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268500]  [<ffffffff8162e2e9>] system_call_fastpath+0x16/0x1b
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268527] Code: e2 ff 1f fe ff c1 e8 0d 83 c0 0f 83 e0 0f 89 c1 83 e1 0f c1 e1 0d 09 ca 84 c0 89 97 f0 00 00 00 75 07 80 a7 f2 00 00 00 f7 5d c3 <0f> 0b 0f 0b 66 66 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268573] RIP  [<ffffffffa00f4aff>] i915_gem_object_unpin+0x4f/0x60 [i915]
> Jun 13 18:13:36 minerva11 kernel: [ 1737.268590]  RSP <ffff8801d2b9fb48>
> 
> Is /usr/share/apport/apport-gpu-error-intel.py of any help, where can I find it.
> Is the kernel stack trace of any good ?  See also my previous post.
> Other possibilities of getting a trace ?
> 
> Angela
> 
> > 
> > PS: Maybe that and the i915_error_state stuff could be added to the Intel
> > Web site too [2].
> > 
> > PPS: Angela, you can also join the IRC channel #intel-gfx for live debugging.
> 
> Just logged in, don't want to intervene a hot discussion.
> 
> > PPPS: Angela, can you use a different mail program which can quote
> > correctly. Outlook should be able to do it, but I do not know how to configure
> > it. Thunderbird works under Microsoft Windows.
> 
> I have quoting enabled in Outlook. 
> 
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx@lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [libva] GPU hung
  2012-06-14  1:32             ` Xiang, Haihao
@ 2012-07-02 18:49               ` Christophe Oosterlynck
  2012-07-04  6:51                 ` Xiang, Haihao
  0 siblings, 1 reply; 10+ messages in thread
From: Christophe Oosterlynck @ 2012-07-02 18:49 UTC (permalink / raw)
  To: intel-gfx

Hi,

Is there any update on this issue or has a bug been reported?

I seem to have a similar issue ("[drm:i915_hangcheck_hung] *ERROR* Hangcheck 
timer") when using vaapi with gstreamer.

Best regards,

Christophe

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [libva] GPU hung
  2012-07-02 18:49               ` Christophe Oosterlynck
@ 2012-07-04  6:51                 ` Xiang, Haihao
  0 siblings, 0 replies; 10+ messages in thread
From: Xiang, Haihao @ 2012-07-04  6:51 UTC (permalink / raw)
  To: Christophe Oosterlynck; +Cc: intel-gfx

On Mon, 2012-07-02 at 18:49 +0000, Christophe Oosterlynck wrote: 
> Hi,
> 
> Is there any update on this issue or has a bug been reported?
> 
> I seem to have a similar issue ("[drm:i915_hangcheck_hung] *ERROR* Hangcheck 
> timer") when using vaapi with gstreamer.
> 

https://bugs.freedesktop.org/show_bug.cgi?id=51061

Angela and I can't reproduce this issue by MPlayer vaapi.  Could you
also give a try ?

Thanks
Haihao

> Best regards,
> 
> Christophe
> 
> _______________________________________________
> Intel-gfx mailing list
> Intel-gfx@lists.freedesktop.org
> http://lists.freedesktop.org/mailman/listinfo/intel-gfx

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2012-07-04  6:49 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2012-06-12 19:13 [libva] GPU hung Angela
2012-06-12 19:29 ` Daniel Vetter
2012-06-12 20:26   ` Angela
2012-06-12 21:29     ` Daniel Vetter
2012-06-12 22:09       ` Angela
2012-06-12 22:30         ` Paul Menzel
2012-06-13 16:27           ` Angela
2012-06-14  1:32             ` Xiang, Haihao
2012-07-02 18:49               ` Christophe Oosterlynck
2012-07-04  6:51                 ` Xiang, Haihao

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.