From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@freedesktop.org Subject: [Bug 105684] Loading amdgpu hits general protection fault: 0000 [#1] SMP NOPTI Date: Tue, 27 Mar 2018 06:39:45 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1558916868==" Return-path: Received: from culpepper.freedesktop.org (culpepper.freedesktop.org [131.252.210.165]) by gabe.freedesktop.org (Postfix) with ESMTP id 096586E564 for ; Tue, 27 Mar 2018 06:39:45 +0000 (UTC) In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dri-devel-bounces@lists.freedesktop.org Sender: "dri-devel" To: dri-devel@lists.freedesktop.org List-Id: dri-devel@lists.freedesktop.org --===============1558916868== Content-Type: multipart/alternative; boundary="15221327840.E066F0c5.2953" Content-Transfer-Encoding: 7bit --15221327840.E066F0c5.2953 Date: Tue, 27 Mar 2018 06:39:44 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated https://bugs.freedesktop.org/show_bug.cgi?id=3D105684 --- Comment #14 from jian-hong@endlessm.com --- Created attachment 138370 --> https://bugs.freedesktop.org/attachment.cgi?id=3D138370&action=3Dedit dmesg of loading amdgpu module - tested in 4.16-rc7 I also tested with linux kernel v4.16-rc7 and got the same problem again wh= en I loaded amdgpu module manually. The attechment is the whole dmesg for this test. <6>[ 96.502996] [drm] amdgpu kernel modesetting enabled. <6>[ 96.511435] AMD IOMMUv2 driver by Joerg Roedel <6>[ 96.518456] Parsing CRAT table with 1 nodes <6>[ 96.519588] Creating topology SYSFS entries <6>[ 96.520729] Topology: Add APU node [0x0:0x0] <6>[ 96.521851] Finished initializing topology <6>[ 96.522991] kfd kfd: Initialized module <7>[ 96.524244] checking generic (e0000000 7f0000) vs hw (e0000000 100000= 00) <6>[ 96.524245] fb: switching to amdgpudrmfb from EFI VGA <6>[ 96.525412] Console: switching to colour dummy device 80x25 <6>[ 96.525551] amdgpu 0000:09:00.0: enabling device (0006 -> 0007) <6>[ 96.525713] [drm] initializing kernel modesetting (RAVEN 0x1002:0x15DD 0x1025:0x1257 0xC6). <6>[ 96.525744] [drm] register mmio base: 0xFE700000 <6>[ 96.525746] [drm] register mmio size: 524288 <6>[ 96.528186] [drm] probing gen 2 caps for device 1022:15db =3D 700d03/e <6>[ 96.528193] [drm] probing mlw for device 1022:15db =3D 700d03 <6>[ 96.528392] [drm] VCN decode is enabled in VM mode <6>[ 96.528395] [drm] VCN encode is enabled in VM mode <6>[ 96.528411] ATOM BIOS: 113-RAVEN-T08 <6>[ 96.528443] [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit <6>[ 96.528455] amdgpu 0000:09:00.0: GTT: 1024M 0x000000F500000000 - 0x000000F53FFFFFFF <6>[ 96.528467] [drm] Detected VRAM RAM=3D1024M, BAR=3D256M <6>[ 96.528468] [drm] RAM width 64bits UNKNOWN <6>[ 96.528530] [TTM] Zone kernel: Available graphics memory: 15950952 k= iB <6>[ 96.528532] [TTM] Zone dma32: Available graphics memory: 2097152 kiB <6>[ 96.528534] [TTM] Initializing pool allocator <6>[ 96.528538] [TTM] Initializing DMA pool allocator <6>[ 96.528567] [drm] amdgpu: 1024M of VRAM memory ready <6>[ 96.528569] [drm] amdgpu: 3072M of GTT memory ready. <6>[ 96.528576] [drm] GART: num cpu pages 262144, num gpu pages 262144 <6>[ 96.529123] [drm] PCIE GART of 1024M enabled (table at 0x000000F400800000). <6>[ 96.550367] [drm] use_doorbell being set to: [true] <6>[ 96.565847] [drm] Found VCN firmware Version: 1.45 Family ID: 18 <6>[ 96.720523] [drm] Display Core initialized with v3.1.27! <4>[ 96.720559] general protection fault: 0000 [#1] SMP NOPTI <4>[ 96.720562] Modules linked in: amdkfd amd_iommu_v2 amdgpu(+) chash gpu_sched ttm drm_kms_helper drm i2c_algo_bit fb_sys_fops syscopyarea sysfillrect sysimgblt efi_pstore arc4 cmac bnep edac_mce_amd kvm_amd ccp kvm irqbypass snd_hda_codec_realtek ath10k_pci ath10k_core ath btusb crct10dif_pclmul crc32_pclmul ghash_clmulni_intel btrtl input_leds btbcm btintel mac80211 snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer snd soundcore blueto= oth ecdh_generic pcbc aesni_intel cfg80211 r8169 i2c_piix4 wmi_bmof mii sparse_keymap shpchp aes_x86_64 crypto_simd glue_helper cryptd psmouse mac_= hid tpm_crb video wmi zram ip_tables x_tables serio_raw uas usb_storage ahci libahci hid_generic usbhid hid <4>[ 96.720624] CPU: 0 PID: 933 Comm: modprobe Not tainted 4.16.0-rc7+ #8 <4>[ 96.720626] Hardware name: Acer Aspire TC-380/Aspire TC-380, BIOS D05 02/01/2018 <4>[ 96.720635] RIP: 0010:prefetch_freepointer+0x15/0x30 <4>[ 96.720637] RSP: 0018:ffffba2808367840 EFLAGS: 00010202 <4>[ 96.720640] RAX: 0000000000000000 RBX: ffff9fcb9d9c5800 RCX: 0000000000000ae8 <4>[ 96.720643] RDX: 0000000000000ae7 RSI: 597f068ab1e00726 RDI: ffff9fcbbf006e80 <4>[ 96.720645] RBP: ffffba2808367840 R08: ffff9fcbbf627160 R09: ffffffffc096ef82 <4>[ 96.720647] R10: 0000000000000024 R11: ffff9fcb99f3ac97 R12: 00000000014080c0 <4>[ 96.720649] R13: ffff9fcbbf006e80 R14: ffff9fcb9d9c5800 R15: ffff9fcbbf006e80 <4>[ 96.720653] FS: 00007f8bc0eab700(0000) GS:ffff9fcbbf600000(0000) knlGS:0000000000000000 <4>[ 96.720655] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 <4>[ 96.720658] CR2: 00007ffc4eec6ff8 CR3: 00000007dd9a6000 CR4: 00000000003406f0 <4>[ 96.720660] Call Trace: <4>[ 96.720666] kmem_cache_alloc_trace+0xa5/0x1c0 <4>[ 96.720741] ? dm_hw_init+0x462/0xed0 [amdgpu] <4>[ 96.720792] dm_hw_init+0x462/0xed0 [amdgpu] <4>[ 96.720832] amdgpu_device_init+0xc1b/0x1340 [amdgpu] <4>[ 96.720872] amdgpu_driver_load_kms+0x8b/0x2c0 [amdgpu] <4>[ 96.720888] drm_dev_register+0x149/0x1e0 [drm] <4>[ 96.720927] amdgpu_pci_probe+0x10a/0x180 [amdgpu] <4>[ 96.720931] local_pci_probe+0x4a/0xa0 <4>[ 96.720934] pci_device_probe+0x109/0x1b0 <4>[ 96.720938] driver_probe_device+0x2bb/0x4a0 <4>[ 96.720941] __driver_attach+0xe2/0xf0 <4>[ 96.720944] ? driver_probe_device+0x4a0/0x4a0 <4>[ 96.720947] bus_for_each_dev+0x6a/0xc0 <4>[ 96.720949] ? kmem_cache_alloc_trace+0x1a6/0x1c0 <4>[ 96.720952] driver_attach+0x1e/0x20 <4>[ 96.720955] bus_add_driver+0x170/0x260 <4>[ 96.720958] driver_register+0x60/0xe0 <4>[ 96.720961] ? 0xffffffffc0af3000 <4>[ 96.720964] __pci_register_driver+0x5a/0x60 <4>[ 96.721003] amdgpu_init+0x83/0x92 [amdgpu] <4>[ 96.721006] do_one_initcall+0x55/0x19d <4>[ 96.721009] ? __vunmap+0x81/0xb0 <4>[ 96.721013] ? _cond_resched+0x1a/0x50 <4>[ 96.721015] ? kmem_cache_alloc_trace+0xa5/0x1c0 <4>[ 96.721019] ? do_init_module+0x27/0x219 <4>[ 96.721021] do_init_module+0x5f/0x219 <4>[ 96.721024] load_module+0x260e/0x2e10 <4>[ 96.721028] ? ima_post_read_file+0x83/0xa0 <4>[ 96.721032] SYSC_finit_module+0xe5/0x120 <4>[ 96.721034] ? SYSC_finit_module+0xe5/0x120 <4>[ 96.721037] SyS_finit_module+0xe/0x10 <4>[ 96.721040] do_syscall_64+0x73/0x130 <4>[ 96.721043] entry_SYSCALL_64_after_hwframe+0x3d/0xa2 <4>[ 96.721045] RIP: 0033:0x7f8bc09f0229 <4>[ 96.721047] RSP: 002b:00007ffc4eeca168 EFLAGS: 00000246 ORIG_RAX: 0000000000000139 <4>[ 96.721050] RAX: ffffffffffffffda RBX: 00005568f4a07230 RCX: 00007f8bc09f0229 <4>[ 96.721053] RDX: 0000000000000000 RSI: 00005568f3310638 RDI: 000000000000000d <4>[ 96.721055] RBP: 00005568f3310638 R08: 0000000000000000 R09: 0000000000000000 <4>[ 96.721057] R10: 000000000000000d R11: 0000000000000246 R12: 0000000000000000 <4>[ 96.721059] R13: 00005568f4a07360 R14: 0000000000040000 R15: 0000000000000000 <4>[ 96.721062] Code: 49 8b 74 24 60 48 c7 c7 18 0c cf b4 e8 15 85 ea ff = eb 90 0f 1f 00 0f 1f 44 00 00 55 48 85 f6 48 89 e5 74 14 48 63 47 20 48 01 c6 = <48> 33 36 48 33 b7 40 01 00 00 0f 18 0e 5d c3 66 90 66 2e 0f 1f=20 <1>[ 96.721091] RIP: prefetch_freepointer+0x15/0x30 RSP: ffffba2808367840 <4>[ 96.721094] ---[ end trace d865bcaaf3cc5d66 ]--- --=20 You are receiving this mail because: You are the assignee for the bug.= --15221327840.E066F0c5.2953 Date: Tue, 27 Mar 2018 06:39:44 +0000 MIME-Version: 1.0 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://bugs.freedesktop.org/ Auto-Submitted: auto-generated

Comme= nt # 14 on bug 10568= 4 from jian-hong&= #64;endlessm.com
Created attachment 138370 [details]
dmesg of loading amdgpu module - tested in 4.16-rc7

I also tested with linux kernel v4.16-rc7 and got the same problem again wh=
en I
loaded amdgpu module manually.
The attechment is the whole dmesg for this test.

<6>[   96.502996] [drm] amdgpu kernel modesetting enabled.
<6>[   96.511435] AMD IOMMUv2 driver by Joerg Roedel <jroedel@suse.de>
<6>[   96.518456] Parsing CRAT table with 1 nodes
<6>[   96.519588] Creating topology SYSFS entries
<6>[   96.520729] Topology: Add APU node [0x0:0x0]
<6>[   96.521851] Finished initializing topology
<6>[   96.522991] kfd kfd: Initialized module
<7>[   96.524244] checking generic (e0000000 7f0000) vs hw (e0000000 =
10000000)
<6>[   96.524245] fb: switching to amdgpudrmfb from EFI VGA
<6>[   96.525412] Console: switching to colour dummy device 80x25
<6>[   96.525551] amdgpu 0000:09:00.0: enabling device (0006 -> 00=
07)
<6>[   96.525713] [drm] initializing kernel modesetting (RAVEN 0x1002=
:0x15DD
0x1025:0x1257 0xC6).
<6>[   96.525744] [drm] register mmio base: 0xFE700000
<6>[   96.525746] [drm] register mmio size: 524288
<6>[   96.528186] [drm] probing gen 2 caps for device 1022:15db =3D 7=
00d03/e
<6>[   96.528193] [drm] probing mlw for device 1022:15db =3D 700d03
<6>[   96.528392] [drm] VCN decode is enabled in VM mode
<6>[   96.528395] [drm] VCN encode is enabled in VM mode
<6>[   96.528411] ATOM BIOS: 113-RAVEN-T08
<6>[   96.528443] [drm] vm size is 262144 GB, 4 levels, block size is=
 9-bit,
fragment size is 9-bit
<6>[   96.528455] amdgpu 0000:09:00.0: GTT: 1024M 0x000000F500000000 -
0x000000F53FFFFFFF
<6>[   96.528467] [drm] Detected VRAM RAM=3D1024M, BAR=3D256M
<6>[   96.528468] [drm] RAM width 64bits UNKNOWN
<6>[   96.528530] [TTM] Zone  kernel: Available graphics memory: 1595=
0952 kiB
<6>[   96.528532] [TTM] Zone   dma32: Available graphics memory: 2097=
152 kiB
<6>[   96.528534] [TTM] Initializing pool allocator
<6>[   96.528538] [TTM] Initializing DMA pool allocator
<6>[   96.528567] [drm] amdgpu: 1024M of VRAM memory ready
<6>[   96.528569] [drm] amdgpu: 3072M of GTT memory ready.
<6>[   96.528576] [drm] GART: num cpu pages 262144, num gpu pages 262=
144
<6>[   96.529123] [drm] PCIE GART of 1024M enabled (table at
0x000000F400800000).
<6>[   96.550367] [drm] use_doorbell being set to: [true]
<6>[   96.565847] [drm] Found VCN firmware Version: 1.45 Family ID: 18
<6>[   96.720523] [drm] Display Core initialized with v3.1.27!
<4>[   96.720559] general protection fault: 0000 [#1] SMP NOPTI
<4>[   96.720562] Modules linked in: amdkfd amd_iommu_v2 amdgpu(+) ch=
ash
gpu_sched ttm drm_kms_helper drm i2c_algo_bit fb_sys_fops syscopyarea
sysfillrect sysimgblt efi_pstore arc4 cmac bnep edac_mce_amd kvm_amd ccp kvm
irqbypass snd_hda_codec_realtek ath10k_pci ath10k_core ath btusb
crct10dif_pclmul crc32_pclmul ghash_clmulni_intel btrtl input_leds btbcm
btintel mac80211 snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_intel
snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer snd soundcore blueto=
oth
ecdh_generic pcbc aesni_intel cfg80211 r8169 i2c_piix4 wmi_bmof mii
sparse_keymap shpchp aes_x86_64 crypto_simd glue_helper cryptd psmouse mac_=
hid
tpm_crb video wmi zram ip_tables x_tables serio_raw uas usb_storage ahci
libahci hid_generic usbhid hid
<4>[   96.720624] CPU: 0 PID: 933 Comm: modprobe Not tainted 4.16.0-r=
c7+ #8
<4>[   96.720626] Hardware name: Acer Aspire TC-380/Aspire TC-380, BI=
OS D05
02/01/2018
<4>[   96.720635] RIP: 0010:prefetch_freepointer+0x15/0x30
<4>[   96.720637] RSP: 0018:ffffba2808367840 EFLAGS: 00010202
<4>[   96.720640] RAX: 0000000000000000 RBX: ffff9fcb9d9c5800 RCX:
0000000000000ae8
<4>[   96.720643] RDX: 0000000000000ae7 RSI: 597f068ab1e00726 RDI:
ffff9fcbbf006e80
<4>[   96.720645] RBP: ffffba2808367840 R08: ffff9fcbbf627160 R09:
ffffffffc096ef82
<4>[   96.720647] R10: 0000000000000024 R11: ffff9fcb99f3ac97 R12:
00000000014080c0
<4>[   96.720649] R13: ffff9fcbbf006e80 R14: ffff9fcb9d9c5800 R15:
ffff9fcbbf006e80
<4>[   96.720653] FS:  00007f8bc0eab700(0000) GS:ffff9fcbbf600000(000=
0)
knlGS:0000000000000000
<4>[   96.720655] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
<4>[   96.720658] CR2: 00007ffc4eec6ff8 CR3: 00000007dd9a6000 CR4:
00000000003406f0
<4>[   96.720660] Call Trace:
<4>[   96.720666]  kmem_cache_alloc_trace+0xa5/0x1c0
<4>[   96.720741]  ? dm_hw_init+0x462/0xed0 [amdgpu]
<4>[   96.720792]  dm_hw_init+0x462/0xed0 [amdgpu]
<4>[   96.720832]  amdgpu_device_init+0xc1b/0x1340 [amdgpu]
<4>[   96.720872]  amdgpu_driver_load_kms+0x8b/0x2c0 [amdgpu]
<4>[   96.720888]  drm_dev_register+0x149/0x1e0 [drm]
<4>[   96.720927]  amdgpu_pci_probe+0x10a/0x180 [amdgpu]
<4>[   96.720931]  local_pci_probe+0x4a/0xa0
<4>[   96.720934]  pci_device_probe+0x109/0x1b0
<4>[   96.720938]  driver_probe_device+0x2bb/0x4a0
<4>[   96.720941]  __driver_attach+0xe2/0xf0
<4>[   96.720944]  ? driver_probe_device+0x4a0/0x4a0
<4>[   96.720947]  bus_for_each_dev+0x6a/0xc0
<4>[   96.720949]  ? kmem_cache_alloc_trace+0x1a6/0x1c0
<4>[   96.720952]  driver_attach+0x1e/0x20
<4>[   96.720955]  bus_add_driver+0x170/0x260
<4>[   96.720958]  driver_register+0x60/0xe0
<4>[   96.720961]  ? 0xffffffffc0af3000
<4>[   96.720964]  __pci_register_driver+0x5a/0x60
<4>[   96.721003]  amdgpu_init+0x83/0x92 [amdgpu]
<4>[   96.721006]  do_one_initcall+0x55/0x19d
<4>[   96.721009]  ? __vunmap+0x81/0xb0
<4>[   96.721013]  ? _cond_resched+0x1a/0x50
<4>[   96.721015]  ? kmem_cache_alloc_trace+0xa5/0x1c0
<4>[   96.721019]  ? do_init_module+0x27/0x219
<4>[   96.721021]  do_init_module+0x5f/0x219
<4>[   96.721024]  load_module+0x260e/0x2e10
<4>[   96.721028]  ? ima_post_read_file+0x83/0xa0
<4>[   96.721032]  SYSC_finit_module+0xe5/0x120
<4>[   96.721034]  ? SYSC_finit_module+0xe5/0x120
<4>[   96.721037]  SyS_finit_module+0xe/0x10
<4>[   96.721040]  do_syscall_64+0x73/0x130
<4>[   96.721043]  entry_SYSCALL_64_after_hwframe+0x3d/0xa2
<4>[   96.721045] RIP: 0033:0x7f8bc09f0229
<4>[   96.721047] RSP: 002b:00007ffc4eeca168 EFLAGS: 00000246 ORIG_RA=
X:
0000000000000139
<4>[   96.721050] RAX: ffffffffffffffda RBX: 00005568f4a07230 RCX:
00007f8bc09f0229
<4>[   96.721053] RDX: 0000000000000000 RSI: 00005568f3310638 RDI:
000000000000000d
<4>[   96.721055] RBP: 00005568f3310638 R08: 0000000000000000 R09:
0000000000000000
<4>[   96.721057] R10: 000000000000000d R11: 0000000000000246 R12:
0000000000000000
<4>[   96.721059] R13: 00005568f4a07360 R14: 0000000000040000 R15:
0000000000000000
<4>[   96.721062] Code: 49 8b 74 24 60 48 c7 c7 18 0c cf b4 e8 15 85 =
ea ff eb
90 0f 1f 00 0f 1f 44 00 00 55 48 85 f6 48 89 e5 74 14 48 63 47 20 48 01 c6 =
<48>
33 36 48 33 b7 40 01 00 00 0f 18 0e 5d c3 66 90 66 2e 0f 1f=20
<1>[   96.721091] RIP: prefetch_freepointer+0x15/0x30 RSP: ffffba2808=
367840
<4>[   96.721094] ---[ end trace d865bcaaf3cc5d66 ]---


You are receiving this mail because:
  • You are the assignee for the bug.
= --15221327840.E066F0c5.2953-- --===============1558916868== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: base64 Content-Disposition: inline X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18KZHJpLWRldmVs IG1haWxpbmcgbGlzdApkcmktZGV2ZWxAbGlzdHMuZnJlZWRlc2t0b3Aub3JnCmh0dHBzOi8vbGlz dHMuZnJlZWRlc2t0b3Aub3JnL21haWxtYW4vbGlzdGluZm8vZHJpLWRldmVsCg== --===============1558916868==--