All of lore.kernel.org
 help / color / mirror / Atom feed
* [error] Drm -> amdgpu Unrecoverable Machine Check
@ 2019-12-02 11:41 Yusuf Altıparmak
  2019-12-02 12:28 ` Christian König
  0 siblings, 1 reply; 12+ messages in thread
From: Yusuf Altıparmak @ 2019-12-02 11:41 UTC (permalink / raw)
  To: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 1424 bytes --]

My embedded board is freezing when I put E9171 on PCIe. What is the meaning
of Unrecoverable Machine Check error about GPU?

Could PCIe settings in .dts file cause this problem? If it is, is there any
sample PCIe configuration for E9171? I attached my dts file.

System is working fine when GPU is not plugged in.

*This is the last console log before freeze:*
[drm] amdgpu kernel modesetting enabled.

[drm] initializing kernel modesetting (POLARIS12 0x1002:0x6987
0x1787:0x2389 0x80).
[drm] register mmio base: 0x20200000

fsl-fman-port ffe488000.port fm1-gb0: renamed from eth0

[drm] register mmio size: 262144

[drm] add ip block number 0 <vi_common>

[drm] add ip block number 1 <gmc_v8_0>

[drm] add ip block number 2 <tonga_ih>

[drm] add ip block number 3 <powerplay>

[drm] add ip block number 4 <dm>

[drm] add ip block number 5 <gfx_v8_0>

[drm] add ip block number 6 <sdma_v3_0>

[drm] add ip block number 7 <uvd_v6_0>

[drm] add ip block number 8 <vce_v3_0>

[drm] UVD is enabled in VM mode

[drm] UVD ENC is enabled in VM mode

[drm] VCE enabled in VM mode

ATOM BIOS: 113-ER16BFC-001

[drm] GPU posting now...

Disabling lock debugging due to kernel taint

Machine check in kernel mode.

Caused by (from MCSR=a000): Load Error Report

Guarded Load Error Report

Kernel panic - not syncing: Unrecoverable Machine check

CPU: 1 PID: 2023 Comm: udevd Tainted: G   M              4.19.26+gc0c2141
#1
Call Trace:

[-- Attachment #1.2: Type: text/html, Size: 3431 bytes --]

[-- Attachment #2: t104xd4rdb.dtsi --]
[-- Type: application/octet-stream, Size: 6555 bytes --]

/*
 * T1040D4RDB/T1042D4RDB Device Tree Source
 *
 * Copyright 2015 Freescale Semiconductor Inc.
 *
 * Redistribution and use in source and binary forms, with or without
 * modification, are permitted provided that the following conditions are met:
 *     * Redistributions of source code must retain the above copyright
 *	 notice, this list of conditions and the following disclaimer.
 *     * Redistributions in binary form must reproduce the above copyright
 *	 notice, this list of conditions and the following disclaimer in the
 *	 documentation and/or other materials provided with the distribution.
 *     * Neither the name of Freescale Semiconductor nor the
 *	 names of its contributors may be used to endorse or promote products
 *	 derived from this software without specific prior written permission.
 *
 *
 * ALTERNATIVELY, this software may be distributed under the terms of the
 * GNU General Public License ("GPL") as published by the Free Software
 * Foundation, either version 2 of that License or (at your option) any
 * later version.
 *
 * THIS SOFTWARE IS PROVIDED BY Freescale Semiconductor "AS IS" AND ANY
 * EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
 * WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
 * DISCLAIMED. IN NO EVENT SHALL Freescale Semiconductor BE LIABLE FOR ANY
 * DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES
 * (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES;
 * LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND
 * ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
 * (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
 * SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
 */

/ {
	reserved-memory {
		#address-cells = <2>;
		#size-cells = <2>;
		ranges;

		bman_fbpr: bman-fbpr {
			size = <0 0x1000000>;
			alignment = <0 0x1000000>;
		};
		qman_fqd: qman-fqd {
			size = <0 0x400000>;
			alignment = <0 0x400000>;
		};
		qman_pfdr: qman-pfdr {
			size = <0 0x2000000>;
			alignment = <0 0x2000000>;
		};
	};

	ifc: localbus@ffe124000 {
		reg = <0xf 0xfe124000 0 0x2000>;
		ranges = <0 0 0xf 0xe8000000 0x08000000
			  2 0 0xf 0xff800000 0x00010000
			  3 0 0xf 0xffdf0000 0x00008000>;

		nor@0,0 {
			#address-cells = <1>;
			#size-cells = <1>;
			compatible = "cfi-flash";
			reg = <0x0 0x0 0x8000000>;
			bank-width = <2>;
			device-width = <1>;
		};

		nand@2,0 {
			#address-cells = <1>;
			#size-cells = <1>;
			compatible = "fsl,ifc-nand";
			reg = <0x2 0x0 0x10000>;
		};

		cpld@3,0 {
			compatible = "fsl,t1040d4rdb-cpld";
			reg = <3 0 0x300>;
		};
	};

	memory {
		device_type = "memory";
	};

	dcsr: dcsr@f00000000 {
		ranges = <0x00000000 0xf 0x00000000 0x01072000>;
	};

	bportals: bman-portals@ff4000000 {
		ranges = <0x0 0xf 0xf4000000 0x2000000>;
	};

	qportals: qman-portals@ff6000000 {
		ranges = <0x0 0xf 0xf6000000 0x2000000>;
	};

	soc: soc@ffe000000 {
		ranges = <0x00000000 0xf 0xfe000000 0x1000000>;
		reg = <0xf 0xfe000000 0 0x00001000>;

		spi@110000 {
			flash@0 {
				#address-cells = <1>;
				#size-cells = <1>;
				compatible = "micron,n25q512ax3", "jedec,spi-nor";
				reg = <0>;
				/* input clock */
				spi-max-frequency = <10000000>;
			};
			slic@1 {
				compatible = "maxim,ds26522";
				reg = <1>;
				spi-max-frequency = <2000000>; /* input clock */
			};
			slic@2 {
				compatible = "maxim,ds26522";
				reg = <2>;
				spi-max-frequency = <2000000>; /* input clock */
			};
		};
		i2c@118000 {
			hwmon@4c {
				compatible = "adi,adt7461";
				reg = <0x4c>;
			};

			rtc@68 {
				compatible = "dallas,ds1337";
				reg = <0x68>;
				interrupts = <0x2 0x1 0 0>;
			};
		};

		i2c@118100 {
			mux@77 {
				/*
				 * Child nodes of mux depend on which i2c
				 * devices are connected via the mini PCI
				 * connector slot1, the mini PCI connector
				 * slot2, the HDMI connector, and the PEX
				 * slot. Systems with such devices attached
				 * should provide a wrapper .dts file that
				 * includes this one, and adds those nodes
				 */
				compatible = "nxp,pca9546";
				reg = <0x77>;
				#address-cells = <1>;
				#size-cells = <0>;
			};
		};

	};

	pci0: pcie@ffe240000 {
		reg = <0xf 0xfe240000 0 0x10000>;
		ranges = <0x02000000 0 0xe0000000 0xc 0x0 0x0 0x10000000
			  0x01000000 0 0x0 0xf 0xf8000000 0x0 0x00010000>;
		pcie@0 {
			ranges = <0x02000000 0 0xe0000000
				  0x02000000 0 0xe0000000
				  0 0x10000000

				  0x01000000 0 0x00000000
				  0x01000000 0 0x00000000
				  0 0x00010000>;
		};
	};

	pci1: pcie@ffe250000 {
		reg = <0xf 0xfe250000 0 0x10000>;
		ranges = <0x02000000 0 0xe0000000 0xc 0x10000000 0x1 0x00000000
			  0x01000000 0 0 0xf 0xf8010000 0 0x00100000>;
		pcie@0 {
			ranges = <0x02000000 0 0xe0000000
				  0x02000000 0 0xe0000000
				  0 0x10000000

				  0x01000000 0 0x00000000
				  0x01000000 0 0x00000000
				  0 0x00010000>;
		};
	};

	pci2: pcie@ffe260000 {
		reg = <0xf 0xfe260000 0 0x10000>;
		ranges = <0x02000000 0 0xe0000000 0xd 0x10000000 0 0x10000000
			  0x01000000 0 0x00000000 0xf 0xf8110000 0 0x00010000>;
		pcie@0 {
			ranges = <0x02000000 0 0xe0000000
				  0x02000000 0 0xe0000000
				  0 0x10000000

				  0x01000000 0 0x00000000
				  0x01000000 0 0x00000000
				  0 0x00010000>;
		};
	};

	pci3: pcie@ffe270000 {
		reg = <0xf 0xfe270000 0 0x10000>;
		ranges = <0x02000000 0 0xe0000000 0xd 0x20000000 0 0x10000000
			  0x01000000 0 0x00000000 0xf 0xf8120000 0 0x00010000>;
		pcie@0 {
			ranges = <0x02000000 0 0xe0000000
				  0x02000000 0 0xe0000000
				  0 0x10000000

				  0x01000000 0 0x00000000
				  0x01000000 0 0x00000000
				  0 0x00010000>;
		};
	};

	qe: qe@ffe140000 {
		ranges = <0x0 0xf 0xfe140000 0x40000>;
		reg = <0xf 0xfe140000 0 0x480>;
		brg-frequency = <0>;
		bus-frequency = <0>;

		si1: si@700 {
			compatible = "fsl,t1040-qe-si";
			reg = <0x700 0x80>;
		};

		siram1: siram@1000 {
			compatible = "fsl,t1040-qe-siram";
			reg = <0x1000 0x800>;
		};

		ucc_hdlc: ucc@2000 {
			compatible = "fsl,ucc-hdlc";
			rx-clock-name = "clk8";
			tx-clock-name = "clk9";
			fsl,rx-sync-clock = "rsync_pin";
			fsl,tx-sync-clock = "tsync_pin";
			fsl,tx-timeslot-mask = <0xfffffffe>;
			fsl,rx-timeslot-mask = <0xfffffffe>;
			fsl,tdm-framer-type = "e1";
			fsl,tdm-id = <0>;
			fsl,siram-entry-id = <0>;
			fsl,tdm-interface;
		};

		ucc_serial: ucc@2200 {
			compatible = "fsl,t1040-ucc-uart";
			port-number = <0>;
			rx-clock-name = "brg2";
			tx-clock-name = "brg2";
		};
	};
};

[-- Attachment #3: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
  2019-12-02 11:41 [error] Drm -> amdgpu Unrecoverable Machine Check Yusuf Altıparmak
@ 2019-12-02 12:28 ` Christian König
  2019-12-02 13:32   ` Yusuf Altıparmak
  0 siblings, 1 reply; 12+ messages in thread
From: Christian König @ 2019-12-02 12:28 UTC (permalink / raw)
  To: Yusuf Altıparmak, amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 2164 bytes --]

Hi Yusuf,

Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
> My embedded board is freezing when I put E9171 on PCIe. What is the 
> meaning of Unrecoverable Machine Check error about GPU?

Well see the explanation on Wikipedia for example: 
https://en.wikipedia.org/wiki/Machine-check_exception

In general it means you have messed up something in your hardware 
configuration.

> Could PCIe settings in .dts file cause this problem?

Possible, but rather unlikely. My best guess is that it is some problem 
with the power supply.

> If it is, is there any sample PCIe configuration for E9171?

The E9171 is just a PCIe device, so the dtsi is actually rather 
uninteresting. What we really need is a full dmesg and maybe lspci 
output would help as well.

Regards,
Christian.

> I attached my dts file.
>
> System is working fine when GPU is not plugged in.
> *
> *
> *This is the last console log before freeze:*
> [drm] amdgpu kernel modesetting enabled.
> [drm] initializing kernel modesetting (POLARIS12 0x1002:0x6987 
> 0x1787:0x2389 0x80).
> [drm] register mmio base: 0x20200000
> fsl-fman-port ffe488000.port fm1-gb0: renamed from eth0
> [drm] register mmio size: 262144
> [drm] add ip block number 0 <vi_common>
> [drm] add ip block number 1 <gmc_v8_0>
> [drm] add ip block number 2 <tonga_ih>
> [drm] add ip block number 3 <powerplay>
> [drm] add ip block number 4 <dm>
> [drm] add ip block number 5 <gfx_v8_0>
> [drm] add ip block number 6 <sdma_v3_0>
> [drm] add ip block number 7 <uvd_v6_0>
> [drm] add ip block number 8 <vce_v3_0>
> [drm] UVD is enabled in VM mode
> [drm] UVD ENC is enabled in VM mode
> [drm] VCE enabled in VM mode
> ATOM BIOS: 113-ER16BFC-001
> [drm] GPU posting now...
> Disabling lock debugging due to kernel taint
> Machine check in kernel mode.
> Caused by (from MCSR=a000): Load Error Report
> Guarded Load Error Report
> Kernel panic - not syncing: Unrecoverable Machine check
> CPU: 1 PID: 2023 Comm: udevd Tainted: G   M  4.19.26+gc0c2141 #1
> Call Trace:
>
> _______________________________________________
> amd-gfx mailing list
> amd-gfx@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/amd-gfx


[-- Attachment #1.2: Type: text/html, Size: 6147 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
  2019-12-02 12:28 ` Christian König
@ 2019-12-02 13:32   ` Yusuf Altıparmak
  2019-12-02 13:41     ` Christian König
  0 siblings, 1 reply; 12+ messages in thread
From: Yusuf Altıparmak @ 2019-12-02 13:32 UTC (permalink / raw)
  To: christian.koenig; +Cc: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 17522 bytes --]

>
>
> I attached my dts file.
>
> System is working fine when GPU is not plugged in.
>
> *This is the last console log before freeze:*
> [drm] amdgpu kernel modesetting enabled.
>
> [drm] initializing kernel modesetting (POLARIS12 0x1002:0x6987
> 0x1787:0x2389 0x80).
> [drm] register mmio base: 0x20200000
>
> fsl-fman-port ffe488000.port fm1-gb0: renamed from eth0
>
> [drm] register mmio size: 262144
>
> [drm] add ip block number 0 <vi_common>
>
> [drm] add ip block number 1 <gmc_v8_0>
>
> [drm] add ip block number 2 <tonga_ih>
>
> [drm] add ip block number 3 <powerplay>
>
> [drm] add ip block number 4 <dm>
>
> [drm] add ip block number 5 <gfx_v8_0>
>
> [drm] add ip block number 6 <sdma_v3_0>
>
> [drm] add ip block number 7 <uvd_v6_0>
>
> [drm] add ip block number 8 <vce_v3_0>
>
> [drm] UVD is enabled in VM mode
>
> [drm] UVD ENC is enabled in VM mode
>
> [drm] VCE enabled in VM mode
>
> ATOM BIOS: 113-ER16BFC-001
>
> [drm] GPU posting now...
>
> Disabling lock debugging due to kernel taint
>
> Machine check in kernel mode.
>
> Caused by (from MCSR=a000): Load Error Report
>
> Guarded Load Error Report
>
> Kernel panic - not syncing: Unrecoverable Machine check
>
> CPU: 1 PID: 2023 Comm: udevd Tainted: G   M              4.19.26+gc0c2141
> #1
> Call Trace:
>
>
>
> _______________________________________________
> amd-gfx mailing listamd-gfx@lists.freedesktop.orghttps://lists.freedesktop.org/mailman/listinfo/amd-gfx
>
>
>

Christian König <ckoenig.leichtzumerken@gmail.com>, 2 Ara 2019 Pzt, 15:28
tarihinde şunu yazdı:

> Hi Yusuf,
>
> Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
>
> My embedded board is freezing when I put E9171 on PCIe. What is the
> meaning of Unrecoverable Machine Check error about GPU?
>
>
> Well see the explanation on Wikipedia for example:
> https://en.wikipedia.org/wiki/Machine-check_exception
>
> In general it means you have messed up something in your hardware
> configuration.
>
> Could PCIe settings in .dts file cause this problem?
>
>
> Possible, but rather unlikely. My best guess is that it is some problem
> with the power supply.
>
> If it is, is there any sample PCIe configuration for E9171?
>
>
> The E9171 is just a PCIe device, so the dtsi is actually rather
> uninteresting. What we really need is a full dmesg and maybe lspci output
> would help as well.
>
> Regards,
> Christian.
>


Hi Christian,

At first, I am using NXP T1042D4RDB-64B which has 256 MB PCIe buffer
according to its. PCIe memory range was arranged to 256 MB in .dts file and
in U-boot configuration file. Driver was giving error with exit code -12
(OUT_OF_MEMORY). But I was able to reach the linux console.

[    5.512922] [drm] amdgpu kernel modesetting enabled.
[    5.517065] [drm] initializing kernel modesetting (POLARIS12
0x1002:0x6987 0x1787:0x2389 0x80).
[    5.524507] amdgpu 0001:01:00.0: Fatal error during GPU init
[    5.529296] amdgpu: probe of 0001:01:00.0 failed with error -12

Then I canged 256 MB to 4GB in .dtsi and U-boot conf file. I also changed
64KB I/O size to 1MB . When I do this, I wasn't able to reach the linux
console because board was freezing. But driver was successfull at this
time. I already mentioned successfull driver console logs up.

*this is lspci -v when GPU is plugged and Memory size is 256 MB.*

root@t1042d4rdb-64b:~# lspci -v
0000:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
(prog-if 00 [Normal decode])
        Device tree node: /sys/firmware/devicetree/base/pcie@ffe240000
/pcie@0
        Flags: bus master, fast devsel, latency 0, IRQ 20
        Memory at <ignored> (32-bit, non-prefetchable)
        Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
        I/O behind bridge: 00000000-0000ffff [size=64K]
        Memory behind bridge: e0000000-efffffff [size=256M]
        Prefetchable memory behind bridge: None
        Capabilities: [44] Power Management version 3
        Capabilities: [4c] Express Root Port (Slot-), MSI 00
        Capabilities: [100] Advanced Error Reporting
        Kernel driver in use: pcieport

0001:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
(prog-if 00 [Normal decode])
        Device tree node: /sys/firmware/devicetree/base/pcie@ffe250000
/pcie@0
        Flags: bus master, fast devsel, latency 0, IRQ 21
        Memory at <ignored> (32-bit, non-prefetchable)
        Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
        I/O behind bridge: 00000000-0000ffff [size=64K]
        Memory behind bridge: e0000000-efffffff [size=256M]
        Prefetchable memory behind bridge: None
        Capabilities: [44] Power Management version 3
        Capabilities: [4c] Express Root Port (Slot-), MSI 00
        Capabilities: [100] Advanced Error Reporting
        Kernel driver in use: pcieport

0001:01:00.0 VGA compatible controller: Advanced Micro Devices, Inc.
[AMD/ATI] Lexa [Radeon E9171 MCM] (rev 80) (prog-if 00 [VGA controller])
        Subsystem: Hightech Information System Ltd. Device 2389
        Flags: fast devsel, IRQ 41
        Memory at c10000000 (64-bit, prefetchable) [size=256M]
        Memory at <ignored> (64-bit, prefetchable)
        I/O ports at 1100 [size=256]
        Memory at <ignored> (32-bit, non-prefetchable)
        Expansion ROM at <ignored> [disabled]
        Capabilities: [48] Vendor Specific Information: Len=08 <?>
        Capabilities: [50] Power Management version 3
        Capabilities: [58] Express Legacy Endpoint, MSI 00
        Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
        Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
Len=010 <?>
        Capabilities: [150] Advanced Error Reporting
        Capabilities: [200] Resizable BAR <?>
        Capabilities: [270] Secondary PCI Express <?>
        Capabilities: [2b0] Address Translation Service (ATS)
        Capabilities: [2c0] Page Request Interface (PRI)
        Capabilities: [2d0] Process Address Space ID (PASID)
        Capabilities: [320] Latency Tolerance Reporting
        Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
        Capabilities: [370] L1 PM Substates
        Kernel modules: amdgpu

0001:01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device
aae0
        Subsystem: Hightech Information System Ltd. Device aae0
        Flags: bus master, fast devsel, latency 0, IRQ 17
        Memory at <ignored> (64-bit, non-prefetchable)
        Capabilities: [48] Vendor Specific Information: Len=08 <?>
        Capabilities: [50] Power Management version 3
        Capabilities: [58] Express Legacy Endpoint, MSI 00
        Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
        Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
Len=010 <?>
        Capabilities: [150] Advanced Error Reporting
        Capabilities: [328] Alternative Routing-ID Interpretation (ARI)

0002:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
(prog-if 00 [Normal decode])
        Device tree node: /sys/firmware/devicetree/base/pcie@ffe260000
/pcie@0
        Flags: bus master, fast devsel, latency 0, IRQ 22
        Memory at <ignored> (32-bit, non-prefetchable)
        Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
        I/O behind bridge: 00000000-0000ffff [size=64K]
        Memory behind bridge: e0000000-efffffff [size=256M]
        Prefetchable memory behind bridge: None
        Capabilities: [44] Power Management version 3
        Capabilities: [4c] Express Root Port (Slot-), MSI 00
        Capabilities: [100] Advanced Error Reporting
        Kernel driver in use: pcieport

0003:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
(prog-if 00 [Normal decode])
        Device tree node: /sys/firmware/devicetree/base/pcie@ffe270000
/pcie@0
        Flags: bus master, fast devsel, latency 0, IRQ 23
        Memory at <ignored> (32-bit, non-prefetchable)
        Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
        I/O behind bridge: 00000000-0000ffff [size=64K]
        Memory behind bridge: e0000000-efffffff [size=256M]
        Prefetchable memory behind bridge: None
        Capabilities: [44] Power Management version 3
        Capabilities: [4c] Express Root Port (Slot-), MSI 00
        Capabilities: [100] Advanced Error Reporting
        Kernel driver in use: pcieport

*AND This is PCIe dmesg message when memory range is 256MB. It's also
giving same message when memory range is arranged as 4GB;*

PCI host bridge /pcie@ffe240000  ranges:
 MEM 0x0000000c00000000..0x0000000c0fffffff -> 0x00000000e0000000
  IO 0x0000000ff8000000..0x0000000ff800ffff -> 0x0000000000000000
/pcie@ffe240000: PCICSRBAR @ 0xff000000
setup_pci_atmu: end of DRAM 200000000
/pcie@ffe240000: Setup 64-bit PCI DMA window
/pcie@ffe240000: WARNING: Outbound window cfg leaves gaps in memory map.
Adjusting the memory map could reduce unnecessary bounce buffering.
/pcie@ffe240000: DMA window size is 0xe0000000
Found FSL PCI host bridge at 0x0000000ffe250000. Firmware bus number: 0->1
PCI host bridge /pcie@ffe250000  ranges:
 MEM 0x0000000c10000000..0x0000000c1fffffff -> 0x00000000e0000000
  IO 0x0000000ff8010000..0x0000000ff801ffff -> 0x0000000000000000
/pcie@ffe250000: PCICSRBAR @ 0xff000000
setup_pci_atmu: end of DRAM 200000000
/pcie@ffe250000: Setup 64-bit PCI DMA window
/pcie@ffe250000: WARNING: Outbound window cfg leaves gaps in memory map.
Adjusting the memory map could reduce unnecessary bounce buffering.
/pcie@ffe250000: DMA window size is 0xe0000000
Found FSL PCI host bridge at 0x0000000ffe260000. Firmware bus number: 0->0
PCI host bridge /pcie@ffe260000  ranges:
 MEM 0x0000000c20000000..0x0000000c2fffffff -> 0x00000000e0000000
  IO 0x0000000ff8020000..0x0000000ff802ffff -> 0x0000000000000000
/pcie@ffe260000: PCICSRBAR @ 0xff000000
setup_pci_atmu: end of DRAM 200000000
/pcie@ffe260000: Setup 64-bit PCI DMA window
/pcie@ffe260000: WARNING: Outbound window cfg leaves gaps in memory map.
Adjusting the memory map could reduce unnecessary bounce buffering.
/pcie@ffe260000: DMA window size is 0xe0000000
Found FSL PCI host bridge at 0x0000000ffe270000. Firmware bus number: 0->0
PCI host bridge /pcie@ffe270000  ranges:
 MEM 0x0000000c30000000..0x0000000c3fffffff -> 0x00000000e0000000
  IO 0x0000000ff8030000..0x0000000ff803ffff -> 0x0000000000000000
/pcie@ffe270000: PCICSRBAR @ 0xff000000
setup_pci_atmu: end of DRAM 200000000
/pcie@ffe270000: Setup 64-bit PCI DMA window
/pcie@ffe270000: WARNING: Outbound window cfg leaves gaps in memory map.
Adjusting the memory map could reduce unnecessary bounce buffering.
/pcie@ffe270000: DMA window size is 0xe0000000
iommu: Adding device ff6000000.qman-portal to group 0
iommu: Adding device ff6004000.qman-portal to group 1
iommu: Adding device ff6008000.qman-portal to group 2
iommu: Adding device ff600c000.qman-portal to group 3
iommu: Adding device ff6010000.qman-portal to group 4
iommu: Adding device ff6014000.qman-portal to group 5
iommu: Adding device ff6018000.qman-portal to group 6
iommu: Adding device ff601c000.qman-portal to group 7
iommu: Adding device ff6020000.qman-portal to group 8
iommu: Adding device ff6024000.qman-portal to group 9
iommu: Adding device ffe100300.dma to group 10
iommu: Adding device ffe101300.dma to group 11
iommu: Adding device ffe114000.sdhc to group 12
iommu: Adding device ffe210000.usb to group 13
iommu: Adding device ffe211000.usb to group 14
iommu: Adding device ffe220000.sata to group 15
iommu: Adding device ffe221000.sata to group 16
iommu: Adding device ffe318000.qman to group 17
iommu: Adding device ffe31a000.bman to group 18
iommu: Adding device ffe240000.pcie to group 19
iommu: Adding device ffe250000.pcie to group 20
iommu: Adding device ffe260000.pcie to group 21
iommu: Adding device ffe270000.pcie to group 22
iommu: Adding device ffe140000.qe to group 23
software IO TLB: mapped [mem 0xfbfff000-0xfffff000] (64MB)
PCI: Probing PCI hardware
fsl-pci ffe240000.pcie: PCI host bridge to bus 0000:00
pci_bus 0000:00: root bus resource [io
 0x8000080000010000-0x800008000001ffff] (bus address [0x0000-0xffff])
pci_bus 0000:00: root bus resource [mem 0xc00000000-0xc0fffffff] (bus
address [0xe0000000-0xefffffff])
pci_bus 0000:00: root bus resource [bus 00]
iommu: Removing device ffe240000.pcie from group 19
iommu: Adding device 0000:00:00.0 to group 24
pci 0000:00:00.0: bridge configuration invalid ([bus 00-00]), reconfiguring
pci 0000:00:00.0: PCI bridge to [bus 01-ff]
fsl-pci ffe250000.pcie: PCI host bridge to bus 0001:00
pci_bus 0001:00: root bus resource [io
 0x8000080000021000-0x8000080000030fff] (bus address [0x0000-0xffff])
pci_bus 0001:00: root bus resource [mem 0xc10000000-0xc1fffffff] (bus
address [0xe0000000-0xefffffff])
pci_bus 0001:00: root bus resource [bus 00-01]
iommu: Removing device ffe250000.pcie from group 20
iommu: Adding device 0001:00:00.0 to group 19
pci 0001:01:00.0: enabling Extended Tags
pci 0001:01:00.0: 4.000 Gb/s available PCIe bandwidth, limited by 5 GT/s x1
link at 0001:00:00.0 (capable of 63.008 Gb/s with 8 GT/s x8 link)
iommu: Adding device 0001:01:00.0 to group 19
pci 0001:01:00.1: enabling Extended Tags
iommu: Adding device 0001:01:00.1 to group 19
pci 0001:00:00.0: PCI bridge to [bus 01-ff]
fsl-pci ffe260000.pcie: PCI host bridge to bus 0002:00
pci_bus 0002:00: root bus resource [io
 0x8000080000032000-0x8000080000041fff] (bus address [0x0000-0xffff])
pci_bus 0002:00: root bus resource [mem 0xc20000000-0xc2fffffff] (bus
address [0xe0000000-0xefffffff])
pci_bus 0002:00: root bus resource [bus 00]
iommu: Removing device ffe260000.pcie from group 21
iommu: Adding device 0002:00:00.0 to group 20
pci 0002:00:00.0: bridge configuration invalid ([bus 00-00]), reconfiguring
pci 0002:00:00.0: PCI bridge to [bus 01-ff]
fsl-pci ffe270000.pcie: PCI host bridge to bus 0003:00
pci_bus 0003:00: root bus resource [io
 0x8000080000043000-0x8000080000052fff] (bus address [0x0000-0xffff])
pci_bus 0003:00: root bus resource [mem 0xc30000000-0xc3fffffff] (bus
address [0xe0000000-0xefffffff])
pci_bus 0003:00: root bus resource [bus 00]
iommu: Removing device ffe270000.pcie from group 22
iommu: Adding device 0003:00:00.0 to group 21
pci 0003:00:00.0: bridge configuration invalid ([bus 00-00]), reconfiguring
pci 0003:00:00.0: PCI bridge to [bus 01-ff]
PCI: Cannot allocate resource region 0 of device 0000:00:00.0, will remap
PCI: Cannot allocate resource region 0 of device 0001:00:00.0, will remap
PCI: Cannot allocate resource region 2 of device 0001:01:00.0, will remap
PCI: Cannot allocate resource region 5 of device 0001:01:00.0, will remap
PCI: Cannot allocate resource region 6 of device 0001:01:00.0, will remap
PCI: Cannot allocate resource region 0 of device 0001:01:00.1, will remap
PCI: Cannot allocate resource region 0 of device 0002:00:00.0, will remap
PCI: Cannot allocate resource region 0 of device 0003:00:00.0, will remap
pci 0000:00:00.0: BAR 0: no space for [mem size 0x01000000]
pci 0000:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
pci 0000:00:00.0: PCI bridge to [bus 01]
pci 0000:00:00.0:   bridge window [io
 0x8000080000010000-0x800008000001ffff]
pci 0000:00:00.0:   bridge window [mem 0xc00000000-0xc0fffffff]
pci_bus 0000:00: Some PCI device resources are unassigned, try booting with
pci=realloc
pci 0001:00:00.0: BAR 0: no space for [mem size 0x01000000]
pci 0001:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
pci 0001:00:00.0: BAR 9: no space for [mem size 0x00200000 64bit pref]
pci 0001:00:00.0: BAR 9: failed to assign [mem size 0x00200000 64bit pref]
pci 0001:01:00.0: BAR 2: no space for [mem size 0x00200000 64bit pref]
pci 0001:01:00.0: BAR 2: failed to assign [mem size 0x00200000 64bit pref]
pci 0001:01:00.0: BAR 5: no space for [mem size 0x00040000]
pci 0001:01:00.0: BAR 5: failed to assign [mem size 0x00040000]
pci 0001:01:00.0: BAR 6: no space for [mem size 0x00020000 pref]
pci 0001:01:00.0: BAR 6: failed to assign [mem size 0x00020000 pref]
pci 0001:01:00.1: BAR 0: no space for [mem size 0x00004000 64bit]
pci 0001:01:00.1: BAR 0: failed to assign [mem size 0x00004000 64bit]
pci 0001:00:00.0: PCI bridge to [bus 01]
pci 0001:00:00.0:   bridge window [io
 0x8000080000021000-0x8000080000030fff]
pci 0001:00:00.0:   bridge window [mem 0xc10000000-0xc1fffffff]
pci_bus 0001:00: Some PCI device resources are unassigned, try booting with
pci=realloc
pci 0002:00:00.0: BAR 0: no space for [mem size 0x01000000]
pci 0002:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
pci 0002:00:00.0: PCI bridge to [bus 01]
pci 0002:00:00.0:   bridge window [io
 0x8000080000032000-0x8000080000041fff]
pci 0002:00:00.0:   bridge window [mem 0xc20000000-0xc2fffffff]
pci_bus 0002:00: Some PCI device resources are unassigned, try booting with
pci=realloc
pci 0003:00:00.0: BAR 0: no space for [mem size 0x01000000]
pci 0003:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
pci 0003:00:00.0: PCI bridge to [bus 01]
pci 0003:00:00.0:   bridge window [io
 0x8000080000043000-0x8000080000052fff]
pci 0003:00:00.0:   bridge window [mem 0xc30000000-0xc3fffffff]
pci_bus 0003:00: Some PCI device resources are unassigned, try booting with
pci=realloc

[-- Attachment #1.2: Type: text/html, Size: 22630 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
  2019-12-02 13:32   ` Yusuf Altıparmak
@ 2019-12-02 13:41     ` Christian König
  2019-12-02 14:20       ` Yusuf Altıparmak
  0 siblings, 1 reply; 12+ messages in thread
From: Christian König @ 2019-12-02 13:41 UTC (permalink / raw)
  To: Yusuf Altıparmak; +Cc: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 19800 bytes --]

Hi Yusuf,

> At first, I am using NXP T1042D4RDB-64B which has 256 MB PCIe buffer 
> according to its. PCIe memory range was arranged to 256 MB in .dts 
> file and in U-boot configuration file. Driver was giving error with 
> exit code -12 (OUT_OF_MEMORY). But I was able to reach the linux console.

That is an expected result. 256MB is not enough for the VRAM BAR and the 
doorbell BAR to fit into. But you can still use VGA emulation that way 
if I'm not completely mistaken.

> Then I canged 256 MB to 4GB in .dtsi and U-boot conf file.
How did you do this? Is your memory layout consistent?

See when you just changed one end address you might need to adjust other 
addresses as well.

Regards,
Christian.

Am 02.12.19 um 14:32 schrieb Yusuf Altıparmak:
>
>
>>     I attached my dts file.
>>
>>     System is working fine when GPU is not plugged in.
>>     *
>>     *
>>     *This is the last console log before freeze:*
>>     [drm] amdgpu kernel modesetting enabled.
>>     [drm] initializing kernel modesetting (POLARIS12 0x1002:0x6987
>>     0x1787:0x2389 0x80).
>>     [drm] register mmio base: 0x20200000
>>     fsl-fman-port ffe488000.port fm1-gb0: renamed from eth0
>>     [drm] register mmio size: 262144
>>     [drm] add ip block number 0 <vi_common>
>>     [drm] add ip block number 1 <gmc_v8_0>
>>     [drm] add ip block number 2 <tonga_ih>
>>     [drm] add ip block number 3 <powerplay>
>>     [drm] add ip block number 4 <dm>
>>     [drm] add ip block number 5 <gfx_v8_0>
>>     [drm] add ip block number 6 <sdma_v3_0>
>>     [drm] add ip block number 7 <uvd_v6_0>
>>     [drm] add ip block number 8 <vce_v3_0>
>>     [drm] UVD is enabled in VM mode
>>     [drm] UVD ENC is enabled in VM mode
>>     [drm] VCE enabled in VM mode
>>     ATOM BIOS: 113-ER16BFC-001
>>     [drm] GPU posting now...
>>     Disabling lock debugging due to kernel taint
>>     Machine check in kernel mode.
>>     Caused by (from MCSR=a000): Load Error Report
>>     Guarded Load Error Report
>>     Kernel panic - not syncing: Unrecoverable Machine check
>>     CPU: 1 PID: 2023 Comm: udevd Tainted: G   M    4.19.26+gc0c2141 #1
>>     Call Trace:
>
>>
>>
>>     _______________________________________________
>>     amd-gfx mailing list
>>     amd-gfx@lists.freedesktop.org  <mailto:amd-gfx@lists.freedesktop.org>
>>     https://lists.freedesktop.org/mailman/listinfo/amd-gfx  <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7C9be244504efb4ff9cc0a08d7772c1630%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637108903547899069&sdata=OL4UKczXinKq5FM3Cid5kYcESkHAl0esXfOB83AUPU8%3D&reserved=0>
>
>
>
> Christian König <ckoenig.leichtzumerken@gmail.com 
> <mailto:ckoenig.leichtzumerken@gmail.com>>, 2 Ara 2019 Pzt, 15:28 
> tarihinde şunu yazdı:
>
>     Hi Yusuf,
>
>     Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
>>     My embedded board is freezing when I put E9171 on PCIe. What is
>>     the meaning of Unrecoverable Machine Check error about GPU?
>
>     Well see the explanation on Wikipedia for example:
>     https://en.wikipedia.org/wiki/Machine-check_exception
>     <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMachine-check_exception&data=02%7C01%7Cchristian.koenig%40amd.com%7C9be244504efb4ff9cc0a08d7772c1630%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637108903547909070&sdata=mBxI9LnRBO4iu5oyHc9OrrFj0K10SiOf5%2FztzmNedaI%3D&reserved=0>
>
>     In general it means you have messed up something in your hardware
>     configuration.
>
>>     Could PCIe settings in .dts file cause this problem?
>
>     Possible, but rather unlikely. My best guess is that it is some
>     problem with the power supply.
>
>>     If it is, is there any sample PCIe configuration for E9171?
>
>     The E9171 is just a PCIe device, so the dtsi is actually rather
>     uninteresting. What we really need is a full dmesg and maybe lspci
>     output would help as well.
>
>     Regards,
>     Christian.
>
>
>
> Hi Christian,
>
> At first, I am using NXP T1042D4RDB-64B which has 256 MB PCIe buffer 
> according to its. PCIe memory range was arranged to 256 MB in .dts 
> file and in U-boot configuration file. Driver was giving error with 
> exit code -12 (OUT_OF_MEMORY). But I was able to reach the linux console.
>
> [    5.512922] [drm] amdgpu kernel modesetting enabled.
> [    5.517065] [drm] initializing kernel modesetting (POLARIS12 
> 0x1002:0x6987 0x1787:0x2389 0x80).
> [    5.524507] amdgpu 0001:01:00.0: Fatal error during GPU init
> [    5.529296] amdgpu: probe of 0001:01:00.0 failed with error -12
>
> Then I canged 256 MB to 4GB in .dtsi and U-boot conf file. I also 
> changed 64KB I/O size to 1MB . When I do this, I wasn't able to reach 
> the linux console because board was freezing. But driver was 
> successfull at this time. I already mentioned successfull driver 
> console logs up.
>
> *this is lspci -v when GPU is plugged and Memory size is 256 MB.*
>
> root@t1042d4rdb-64b:~# lspci -v
> 0000:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 
> 11) (prog-if 00 [Normal decode])
>         Device tree node: 
> /sys/firmware/devicetree/base/pcie@ffe240000/pcie@0
>         Flags: bus master, fast devsel, latency 0, IRQ 20
>         Memory at <ignored> (32-bit, non-prefetchable)
>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>         I/O behind bridge: 00000000-0000ffff [size=64K]
>         Memory behind bridge: e0000000-efffffff [size=256M]
>         Prefetchable memory behind bridge: None
>         Capabilities: [44] Power Management version 3
>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>         Capabilities: [100] Advanced Error Reporting
>         Kernel driver in use: pcieport
>
> 0001:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 
> 11) (prog-if 00 [Normal decode])
>         Device tree node: 
> /sys/firmware/devicetree/base/pcie@ffe250000/pcie@0
>         Flags: bus master, fast devsel, latency 0, IRQ 21
>         Memory at <ignored> (32-bit, non-prefetchable)
>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>         I/O behind bridge: 00000000-0000ffff [size=64K]
>         Memory behind bridge: e0000000-efffffff [size=256M]
>         Prefetchable memory behind bridge: None
>         Capabilities: [44] Power Management version 3
>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>         Capabilities: [100] Advanced Error Reporting
>         Kernel driver in use: pcieport
>
> 0001:01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. 
> [AMD/ATI] Lexa [Radeon E9171 MCM] (rev 80) (prog-if 00 [VGA controller])
>         Subsystem: Hightech Information System Ltd. Device 2389
>         Flags: fast devsel, IRQ 41
>         Memory at c10000000 (64-bit, prefetchable) [size=256M]
>         Memory at <ignored> (64-bit, prefetchable)
>         I/O ports at 1100 [size=256]
>         Memory at <ignored> (32-bit, non-prefetchable)
>         Expansion ROM at <ignored> [disabled]
>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>         Capabilities: [50] Power Management version 3
>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>         Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 
> Len=010 <?>
>         Capabilities: [150] Advanced Error Reporting
>         Capabilities: [200] Resizable BAR <?>
>         Capabilities: [270] Secondary PCI Express <?>
>         Capabilities: [2b0] Address Translation Service (ATS)
>         Capabilities: [2c0] Page Request Interface (PRI)
>         Capabilities: [2d0] Process Address Space ID (PASID)
>         Capabilities: [320] Latency Tolerance Reporting
>         Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
>         Capabilities: [370] L1 PM Substates
>         Kernel modules: amdgpu
>
> 0001:01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] 
> Device aae0
>         Subsystem: Hightech Information System Ltd. Device aae0
>         Flags: bus master, fast devsel, latency 0, IRQ 17
>         Memory at <ignored> (64-bit, non-prefetchable)
>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>         Capabilities: [50] Power Management version 3
>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>         Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 
> Len=010 <?>
>         Capabilities: [150] Advanced Error Reporting
>         Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
>
> 0002:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 
> 11) (prog-if 00 [Normal decode])
>         Device tree node: 
> /sys/firmware/devicetree/base/pcie@ffe260000/pcie@0
>         Flags: bus master, fast devsel, latency 0, IRQ 22
>         Memory at <ignored> (32-bit, non-prefetchable)
>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>         I/O behind bridge: 00000000-0000ffff [size=64K]
>         Memory behind bridge: e0000000-efffffff [size=256M]
>         Prefetchable memory behind bridge: None
>         Capabilities: [44] Power Management version 3
>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>         Capabilities: [100] Advanced Error Reporting
>         Kernel driver in use: pcieport
>
> 0003:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 
> 11) (prog-if 00 [Normal decode])
>         Device tree node: 
> /sys/firmware/devicetree/base/pcie@ffe270000/pcie@0
>         Flags: bus master, fast devsel, latency 0, IRQ 23
>         Memory at <ignored> (32-bit, non-prefetchable)
>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>         I/O behind bridge: 00000000-0000ffff [size=64K]
>         Memory behind bridge: e0000000-efffffff [size=256M]
>         Prefetchable memory behind bridge: None
>         Capabilities: [44] Power Management version 3
>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>         Capabilities: [100] Advanced Error Reporting
>         Kernel driver in use: pcieport
>
> *AND This is PCIe dmesg message when memory range is 256MB. It's also 
> giving same message when memory range is arranged as 4GB;*
>
> PCI host bridge /pcie@ffe240000  ranges:
>  MEM 0x0000000c00000000..0x0000000c0fffffff -> 0x00000000e0000000
>   IO 0x0000000ff8000000..0x0000000ff800ffff -> 0x0000000000000000
> /pcie@ffe240000: PCICSRBAR @ 0xff000000
> setup_pci_atmu: end of DRAM 200000000
> /pcie@ffe240000: Setup 64-bit PCI DMA window
> /pcie@ffe240000: WARNING: Outbound window cfg leaves gaps in memory 
> map. Adjusting the memory map could reduce unnecessary bounce buffering.
> /pcie@ffe240000: DMA window size is 0xe0000000
> Found FSL PCI host bridge at 0x0000000ffe250000. Firmware bus number: 0->1
> PCI host bridge /pcie@ffe250000  ranges:
>  MEM 0x0000000c10000000..0x0000000c1fffffff -> 0x00000000e0000000
>   IO 0x0000000ff8010000..0x0000000ff801ffff -> 0x0000000000000000
> /pcie@ffe250000: PCICSRBAR @ 0xff000000
> setup_pci_atmu: end of DRAM 200000000
> /pcie@ffe250000: Setup 64-bit PCI DMA window
> /pcie@ffe250000: WARNING: Outbound window cfg leaves gaps in memory 
> map. Adjusting the memory map could reduce unnecessary bounce buffering.
> /pcie@ffe250000: DMA window size is 0xe0000000
> Found FSL PCI host bridge at 0x0000000ffe260000. Firmware bus number: 0->0
> PCI host bridge /pcie@ffe260000  ranges:
>  MEM 0x0000000c20000000..0x0000000c2fffffff -> 0x00000000e0000000
>   IO 0x0000000ff8020000..0x0000000ff802ffff -> 0x0000000000000000
> /pcie@ffe260000: PCICSRBAR @ 0xff000000
> setup_pci_atmu: end of DRAM 200000000
> /pcie@ffe260000: Setup 64-bit PCI DMA window
> /pcie@ffe260000: WARNING: Outbound window cfg leaves gaps in memory 
> map. Adjusting the memory map could reduce unnecessary bounce buffering.
> /pcie@ffe260000: DMA window size is 0xe0000000
> Found FSL PCI host bridge at 0x0000000ffe270000. Firmware bus number: 0->0
> PCI host bridge /pcie@ffe270000  ranges:
>  MEM 0x0000000c30000000..0x0000000c3fffffff -> 0x00000000e0000000
>   IO 0x0000000ff8030000..0x0000000ff803ffff -> 0x0000000000000000
> /pcie@ffe270000: PCICSRBAR @ 0xff000000
> setup_pci_atmu: end of DRAM 200000000
> /pcie@ffe270000: Setup 64-bit PCI DMA window
> /pcie@ffe270000: WARNING: Outbound window cfg leaves gaps in memory 
> map. Adjusting the memory map could reduce unnecessary bounce buffering.
> /pcie@ffe270000: DMA window size is 0xe0000000
> iommu: Adding device ff6000000.qman-portal to group 0
> iommu: Adding device ff6004000.qman-portal to group 1
> iommu: Adding device ff6008000.qman-portal to group 2
> iommu: Adding device ff600c000.qman-portal to group 3
> iommu: Adding device ff6010000.qman-portal to group 4
> iommu: Adding device ff6014000.qman-portal to group 5
> iommu: Adding device ff6018000.qman-portal to group 6
> iommu: Adding device ff601c000.qman-portal to group 7
> iommu: Adding device ff6020000.qman-portal to group 8
> iommu: Adding device ff6024000.qman-portal to group 9
> iommu: Adding device ffe100300.dma to group 10
> iommu: Adding device ffe101300.dma to group 11
> iommu: Adding device ffe114000.sdhc to group 12
> iommu: Adding device ffe210000.usb to group 13
> iommu: Adding device ffe211000.usb to group 14
> iommu: Adding device ffe220000.sata to group 15
> iommu: Adding device ffe221000.sata to group 16
> iommu: Adding device ffe318000.qman to group 17
> iommu: Adding device ffe31a000.bman to group 18
> iommu: Adding device ffe240000.pcie to group 19
> iommu: Adding device ffe250000.pcie to group 20
> iommu: Adding device ffe260000.pcie to group 21
> iommu: Adding device ffe270000.pcie to group 22
> iommu: Adding device ffe140000.qe to group 23
> software IO TLB: mapped [mem 0xfbfff000-0xfffff000] (64MB)
> PCI: Probing PCI hardware
> fsl-pci ffe240000.pcie: PCI host bridge to bus 0000:00
> pci_bus 0000:00: root bus resource [io 
>  0x8000080000010000-0x800008000001ffff] (bus address [0x0000-0xffff])
> pci_bus 0000:00: root bus resource [mem 0xc00000000-0xc0fffffff] (bus 
> address [0xe0000000-0xefffffff])
> pci_bus 0000:00: root bus resource [bus 00]
> iommu: Removing device ffe240000.pcie from group 19
> iommu: Adding device 0000:00:00.0 to group 24
> pci 0000:00:00.0: bridge configuration invalid ([bus 00-00]), 
> reconfiguring
> pci 0000:00:00.0: PCI bridge to [bus 01-ff]
> fsl-pci ffe250000.pcie: PCI host bridge to bus 0001:00
> pci_bus 0001:00: root bus resource [io 
>  0x8000080000021000-0x8000080000030fff] (bus address [0x0000-0xffff])
> pci_bus 0001:00: root bus resource [mem 0xc10000000-0xc1fffffff] (bus 
> address [0xe0000000-0xefffffff])
> pci_bus 0001:00: root bus resource [bus 00-01]
> iommu: Removing device ffe250000.pcie from group 20
> iommu: Adding device 0001:00:00.0 to group 19
> pci 0001:01:00.0: enabling Extended Tags
> pci 0001:01:00.0: 4.000 Gb/s available PCIe bandwidth, limited by 5 
> GT/s x1 link at 0001:00:00.0 (capable of 63.008 Gb/s with 8 GT/s x8 link)
> iommu: Adding device 0001:01:00.0 to group 19
> pci 0001:01:00.1: enabling Extended Tags
> iommu: Adding device 0001:01:00.1 to group 19
> pci 0001:00:00.0: PCI bridge to [bus 01-ff]
> fsl-pci ffe260000.pcie: PCI host bridge to bus 0002:00
> pci_bus 0002:00: root bus resource [io 
>  0x8000080000032000-0x8000080000041fff] (bus address [0x0000-0xffff])
> pci_bus 0002:00: root bus resource [mem 0xc20000000-0xc2fffffff] (bus 
> address [0xe0000000-0xefffffff])
> pci_bus 0002:00: root bus resource [bus 00]
> iommu: Removing device ffe260000.pcie from group 21
> iommu: Adding device 0002:00:00.0 to group 20
> pci 0002:00:00.0: bridge configuration invalid ([bus 00-00]), 
> reconfiguring
> pci 0002:00:00.0: PCI bridge to [bus 01-ff]
> fsl-pci ffe270000.pcie: PCI host bridge to bus 0003:00
> pci_bus 0003:00: root bus resource [io 
>  0x8000080000043000-0x8000080000052fff] (bus address [0x0000-0xffff])
> pci_bus 0003:00: root bus resource [mem 0xc30000000-0xc3fffffff] (bus 
> address [0xe0000000-0xefffffff])
> pci_bus 0003:00: root bus resource [bus 00]
> iommu: Removing device ffe270000.pcie from group 22
> iommu: Adding device 0003:00:00.0 to group 21
> pci 0003:00:00.0: bridge configuration invalid ([bus 00-00]), 
> reconfiguring
> pci 0003:00:00.0: PCI bridge to [bus 01-ff]
> PCI: Cannot allocate resource region 0 of device 0000:00:00.0, will remap
> PCI: Cannot allocate resource region 0 of device 0001:00:00.0, will remap
> PCI: Cannot allocate resource region 2 of device 0001:01:00.0, will remap
> PCI: Cannot allocate resource region 5 of device 0001:01:00.0, will remap
> PCI: Cannot allocate resource region 6 of device 0001:01:00.0, will remap
> PCI: Cannot allocate resource region 0 of device 0001:01:00.1, will remap
> PCI: Cannot allocate resource region 0 of device 0002:00:00.0, will remap
> PCI: Cannot allocate resource region 0 of device 0003:00:00.0, will remap
> pci 0000:00:00.0: BAR 0: no space for [mem size 0x01000000]
> pci 0000:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
> pci 0000:00:00.0: PCI bridge to [bus 01]
> pci 0000:00:00.0:   bridge window [io 
>  0x8000080000010000-0x800008000001ffff]
> pci 0000:00:00.0:   bridge window [mem 0xc00000000-0xc0fffffff]
> pci_bus 0000:00: Some PCI device resources are unassigned, try booting 
> with pci=realloc
> pci 0001:00:00.0: BAR 0: no space for [mem size 0x01000000]
> pci 0001:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
> pci 0001:00:00.0: BAR 9: no space for [mem size 0x00200000 64bit pref]
> pci 0001:00:00.0: BAR 9: failed to assign [mem size 0x00200000 64bit pref]
> pci 0001:01:00.0: BAR 2: no space for [mem size 0x00200000 64bit pref]
> pci 0001:01:00.0: BAR 2: failed to assign [mem size 0x00200000 64bit pref]
> pci 0001:01:00.0: BAR 5: no space for [mem size 0x00040000]
> pci 0001:01:00.0: BAR 5: failed to assign [mem size 0x00040000]
> pci 0001:01:00.0: BAR 6: no space for [mem size 0x00020000 pref]
> pci 0001:01:00.0: BAR 6: failed to assign [mem size 0x00020000 pref]
> pci 0001:01:00.1: BAR 0: no space for [mem size 0x00004000 64bit]
> pci 0001:01:00.1: BAR 0: failed to assign [mem size 0x00004000 64bit]
> pci 0001:00:00.0: PCI bridge to [bus 01]
> pci 0001:00:00.0:   bridge window [io 
>  0x8000080000021000-0x8000080000030fff]
> pci 0001:00:00.0:   bridge window [mem 0xc10000000-0xc1fffffff]
> pci_bus 0001:00: Some PCI device resources are unassigned, try booting 
> with pci=realloc
> pci 0002:00:00.0: BAR 0: no space for [mem size 0x01000000]
> pci 0002:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
> pci 0002:00:00.0: PCI bridge to [bus 01]
> pci 0002:00:00.0:   bridge window [io 
>  0x8000080000032000-0x8000080000041fff]
> pci 0002:00:00.0:   bridge window [mem 0xc20000000-0xc2fffffff]
> pci_bus 0002:00: Some PCI device resources are unassigned, try booting 
> with pci=realloc
> pci 0003:00:00.0: BAR 0: no space for [mem size 0x01000000]
> pci 0003:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
> pci 0003:00:00.0: PCI bridge to [bus 01]
> pci 0003:00:00.0:   bridge window [io 
>  0x8000080000043000-0x8000080000052fff]
> pci 0003:00:00.0:   bridge window [mem 0xc30000000-0xc3fffffff]
> pci_bus 0003:00: Some PCI device resources are unassigned, try booting 
> with pci=realloc
>
>


[-- Attachment #1.2: Type: text/html, Size: 35787 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
  2019-12-02 13:41     ` Christian König
@ 2019-12-02 14:20       ` Yusuf Altıparmak
  2019-12-02 14:58         ` Christian König
  0 siblings, 1 reply; 12+ messages in thread
From: Yusuf Altıparmak @ 2019-12-02 14:20 UTC (permalink / raw)
  To: Christian König; +Cc: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 22380 bytes --]

> That is an expected result. 256MB is not enough for the VRAM BAR and the
> doorbell BAR to fit into. But you can still use VGA emulation that way if
> I'm not completely mistaken.
>

Hmm, then what procedure should I follow to take a VGA output. It seems
Graphic Card does not have a VGA output. And isn't there any way to use
this GPU with a 256MB buffered PCIe?



> Then I canged 256 MB to 4GB in .dtsi and U-boot conf file.
>
> How did you do this? Is your memory layout consistent?
>
> See when you just changed one end address you might need to adjust other
> addresses as well.
>
> Regards,
> Christian.
>

It's not consistent I seems. At first I changed .dtsi like it's consistent.
I only changed PCIe1 Device Memory and I/O range. It gave same error. Then
I also changed PCIe2 and PCIe3 devices starting adress according to PCIe1
device end adress in .dtsi. I am not sure I made this correctly or not but
it gave same result again. This is why I asked "If it is, is there any
sample PCIe configuration for E9171?".


*This is a code piece from my T104xRDB.h file which is contain PCIe
configuration variables for U-boot. I made changes like it's not
consistent. RAM size is 8 GB.*

#ifdef CONFIG_PCI
/* controller 1, direct to uli, tgtid 3, Base address 20000 */
#ifdef CONFIG_PCIE1
#define CONFIG_SYS_PCIE1_MEM_VIRT *0x80000000*
#define CONFIG_SYS_PCIE1_MEM_BUS 0xe0000000
#define CONFIG_SYS_PCIE1_MEM_PHYS 0xc00000000ull
#define CONFIG_SYS_PCIE1_MEM_SIZE *0x10000000 /* 256M */*
#define CONFIG_SYS_PCIE1_IO_VIRT 0xf8000000
#define CONFIG_SYS_PCIE1_IO_BUS 0x00000000
#define CONFIG_SYS_PCIE1_IO_PHYS *0xff8000000ull*
#define CONFIG_SYS_PCIE1_IO_SIZE *0x00010000 /* 64k */*
#endif

/* controller 2, Slot 2, tgtid 2, Base address 201000 */
#ifdef CONFIG_PCIE2
#define CONFIG_SYS_PCIE2_MEM_VIRT *0x90000000*
#define CONFIG_SYS_PCIE2_MEM_BUS 0xe0000000
#define CONFIG_SYS_PCIE2_MEM_PHYS 0xc10000000ull
#define CONFIG_SYS_PCIE2_MEM_SIZE* 0x100000000 /* 4GB */*
#define CONFIG_SYS_PCIE2_IO_VIRT 0xf8010000
#define CONFIG_SYS_PCIE2_IO_BUS 0x00000000
#define CONFIG_SYS_PCIE2_IO_PHYS 0xff8010000ull
#define CONFIG_SYS_PCIE2_IO_SIZE *0x00100000 /* 1M */*
#endif

/* controller 3, Slot 1, tgtid 1, Base address 202000 */
#ifdef CONFIG_PCIE3
#define CONFIG_SYS_PCIE3_MEM_VIRT *0x190000000 /* I changed this to
0x190000000 instead of 0xa0000000 because PCIE2 end adress is changed. End
adress is to 4 GB (0x100000000 hex so I added 0x90000000 with 100000000) */*
#define CONFIG_SYS_PCIE3_MEM_BUS 0xe0000000
#define CONFIG_SYS_PCIE3_MEM_PHYS 0xd10000000ull
#define CONFIG_SYS_PCIE3_MEM_SIZE 0x10000000 /* 256M */
#define CONFIG_SYS_PCIE3_IO_VIRT 0xf8110000
#define CONFIG_SYS_PCIE3_IO_BUS 0x00000000
#define CONFIG_SYS_PCIE3_IO_PHYS *0xff8110000ull /* Did same things for IO *
#define CONFIG_SYS_PCIE3_IO_SIZE 0x00010000 /* 64k */
#endif

/* controller 4, Base address 203000 */
#ifdef CONFIG_PCIE4
#define CONFIG_SYS_PCIE4_MEM_VIRT *0x200000000 /* SAME STEP WITH PCIE3.
This time I added with 256 MB (0x10000000) */*
#define CONFIG_SYS_PCIE4_MEM_BUS 0xe0000000
#define CONFIG_SYS_PCIE4_MEM_PHYS 0xd20000000ull
#define CONFIG_SYS_PCIE4_MEM_SIZE 0x10000000 /* 256M */
#define CONFIG_SYS_PCIE4_IO_VIRT 0xf8120000
#define CONFIG_SYS_PCIE4_IO_BUS 0x00000000
#define CONFIG_SYS_PCIE4_IO_PHYS *0xff8120000ull /* Did same things for IO *
#define CONFIG_SYS_PCIE4_IO_SIZE 0x00010000 /* 64k */
#endif




Am 02.12.19 um 14:32 schrieb Yusuf Altıparmak:
>
>
>> I attached my dts file.
>>
>> System is working fine when GPU is not plugged in.
>>
>> *This is the last console log before freeze:*
>> [drm] amdgpu kernel modesetting enabled.
>>
>> [drm] initializing kernel modesetting (POLARIS12 0x1002:0x6987
>> 0x1787:0x2389 0x80).
>> [drm] register mmio base: 0x20200000
>>
>> fsl-fman-port ffe488000.port fm1-gb0: renamed from eth0
>>
>> [drm] register mmio size: 262144
>>
>> [drm] add ip block number 0 <vi_common>
>>
>> [drm] add ip block number 1 <gmc_v8_0>
>>
>> [drm] add ip block number 2 <tonga_ih>
>>
>> [drm] add ip block number 3 <powerplay>
>>
>> [drm] add ip block number 4 <dm>
>>
>> [drm] add ip block number 5 <gfx_v8_0>
>>
>> [drm] add ip block number 6 <sdma_v3_0>
>>
>> [drm] add ip block number 7 <uvd_v6_0>
>>
>> [drm] add ip block number 8 <vce_v3_0>
>>
>> [drm] UVD is enabled in VM mode
>>
>> [drm] UVD ENC is enabled in VM mode
>>
>> [drm] VCE enabled in VM mode
>>
>> ATOM BIOS: 113-ER16BFC-001
>>
>> [drm] GPU posting now...
>>
>> Disabling lock debugging due to kernel taint
>>
>> Machine check in kernel mode.
>>
>> Caused by (from MCSR=a000): Load Error Report
>>
>> Guarded Load Error Report
>>
>> Kernel panic - not syncing: Unrecoverable Machine check
>>
>> CPU: 1 PID: 2023 Comm: udevd Tainted: G   M              4.19.26+gc0c2141
>> #1
>> Call Trace:
>>
>>
>>
>> _______________________________________________
>> amd-gfx mailing listamd-gfx@lists.freedesktop.orghttps://lists.freedesktop.org/mailman/listinfo/amd-gfx <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7C9be244504efb4ff9cc0a08d7772c1630%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637108903547899069&sdata=OL4UKczXinKq5FM3Cid5kYcESkHAl0esXfOB83AUPU8%3D&reserved=0>
>>
>>
>>
>
> Christian König <ckoenig.leichtzumerken@gmail.com>, 2 Ara 2019 Pzt, 15:28
> tarihinde şunu yazdı:
>
>> Hi Yusuf,
>>
>> Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
>>
>> My embedded board is freezing when I put E9171 on PCIe. What is the
>> meaning of Unrecoverable Machine Check error about GPU?
>>
>>
>> Well see the explanation on Wikipedia for example:
>> https://en.wikipedia.org/wiki/Machine-check_exception
>> <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMachine-check_exception&data=02%7C01%7Cchristian.koenig%40amd.com%7C9be244504efb4ff9cc0a08d7772c1630%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637108903547909070&sdata=mBxI9LnRBO4iu5oyHc9OrrFj0K10SiOf5%2FztzmNedaI%3D&reserved=0>
>>
>> In general it means you have messed up something in your hardware
>> configuration.
>>
>> Could PCIe settings in .dts file cause this problem?
>>
>>
>> Possible, but rather unlikely. My best guess is that it is some problem
>> with the power supply.
>>
>> If it is, is there any sample PCIe configuration for E9171?
>>
>>
>> The E9171 is just a PCIe device, so the dtsi is actually rather
>> uninteresting. What we really need is a full dmesg and maybe lspci output
>> would help as well.
>>
>> Regards,
>> Christian.
>>
>
>
> Hi Christian,
>
> At first, I am using NXP T1042D4RDB-64B which has 256 MB PCIe buffer
> according to its. PCIe memory range was arranged to 256 MB in .dts file and
> in U-boot configuration file. Driver was giving error with exit code -12
> (OUT_OF_MEMORY). But I was able to reach the linux console.
>
> [    5.512922] [drm] amdgpu kernel modesetting enabled.
> [    5.517065] [drm] initializing kernel modesetting (POLARIS12
> 0x1002:0x6987 0x1787:0x2389 0x80).
> [    5.524507] amdgpu 0001:01:00.0: Fatal error during GPU init
> [    5.529296] amdgpu: probe of 0001:01:00.0 failed with error -12
>
> Then I canged 256 MB to 4GB in .dtsi and U-boot conf file. I also changed
> 64KB I/O size to 1MB . When I do this, I wasn't able to reach the linux
> console because board was freezing. But driver was successfull at this
> time. I already mentioned successfull driver console logs up.
>
> *this is lspci -v when GPU is plugged and Memory size is 256 MB.*
>
> root@t1042d4rdb-64b:~# lspci -v
> 0000:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
> (prog-if 00 [Normal decode])
>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe240000
> /pcie@0
>         Flags: bus master, fast devsel, latency 0, IRQ 20
>         Memory at <ignored> (32-bit, non-prefetchable)
>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>         I/O behind bridge: 00000000-0000ffff [size=64K]
>         Memory behind bridge: e0000000-efffffff [size=256M]
>         Prefetchable memory behind bridge: None
>         Capabilities: [44] Power Management version 3
>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>         Capabilities: [100] Advanced Error Reporting
>         Kernel driver in use: pcieport
>
> 0001:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
> (prog-if 00 [Normal decode])
>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe250000
> /pcie@0
>         Flags: bus master, fast devsel, latency 0, IRQ 21
>         Memory at <ignored> (32-bit, non-prefetchable)
>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>         I/O behind bridge: 00000000-0000ffff [size=64K]
>         Memory behind bridge: e0000000-efffffff [size=256M]
>         Prefetchable memory behind bridge: None
>         Capabilities: [44] Power Management version 3
>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>         Capabilities: [100] Advanced Error Reporting
>         Kernel driver in use: pcieport
>
> 0001:01:00.0 VGA compatible controller: Advanced Micro Devices, Inc.
> [AMD/ATI] Lexa [Radeon E9171 MCM] (rev 80) (prog-if 00 [VGA controller])
>         Subsystem: Hightech Information System Ltd. Device 2389
>         Flags: fast devsel, IRQ 41
>         Memory at c10000000 (64-bit, prefetchable) [size=256M]
>         Memory at <ignored> (64-bit, prefetchable)
>         I/O ports at 1100 [size=256]
>         Memory at <ignored> (32-bit, non-prefetchable)
>         Expansion ROM at <ignored> [disabled]
>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>         Capabilities: [50] Power Management version 3
>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>         Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
> Len=010 <?>
>         Capabilities: [150] Advanced Error Reporting
>         Capabilities: [200] Resizable BAR <?>
>         Capabilities: [270] Secondary PCI Express <?>
>         Capabilities: [2b0] Address Translation Service (ATS)
>         Capabilities: [2c0] Page Request Interface (PRI)
>         Capabilities: [2d0] Process Address Space ID (PASID)
>         Capabilities: [320] Latency Tolerance Reporting
>         Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
>         Capabilities: [370] L1 PM Substates
>         Kernel modules: amdgpu
>
> 0001:01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device
> aae0
>         Subsystem: Hightech Information System Ltd. Device aae0
>         Flags: bus master, fast devsel, latency 0, IRQ 17
>         Memory at <ignored> (64-bit, non-prefetchable)
>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>         Capabilities: [50] Power Management version 3
>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>         Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
> Len=010 <?>
>         Capabilities: [150] Advanced Error Reporting
>         Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
>
> 0002:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
> (prog-if 00 [Normal decode])
>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe260000
> /pcie@0
>         Flags: bus master, fast devsel, latency 0, IRQ 22
>         Memory at <ignored> (32-bit, non-prefetchable)
>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>         I/O behind bridge: 00000000-0000ffff [size=64K]
>         Memory behind bridge: e0000000-efffffff [size=256M]
>         Prefetchable memory behind bridge: None
>         Capabilities: [44] Power Management version 3
>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>         Capabilities: [100] Advanced Error Reporting
>         Kernel driver in use: pcieport
>
> 0003:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
> (prog-if 00 [Normal decode])
>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe270000
> /pcie@0
>         Flags: bus master, fast devsel, latency 0, IRQ 23
>         Memory at <ignored> (32-bit, non-prefetchable)
>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>         I/O behind bridge: 00000000-0000ffff [size=64K]
>         Memory behind bridge: e0000000-efffffff [size=256M]
>         Prefetchable memory behind bridge: None
>         Capabilities: [44] Power Management version 3
>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>         Capabilities: [100] Advanced Error Reporting
>         Kernel driver in use: pcieport
>
> *AND This is PCIe dmesg message when memory range is 256MB. It's also
> giving same message when memory range is arranged as 4GB;*
>
> PCI host bridge /pcie@ffe240000  ranges:
>  MEM 0x0000000c00000000..0x0000000c0fffffff -> 0x00000000e0000000
>   IO 0x0000000ff8000000..0x0000000ff800ffff -> 0x0000000000000000
> /pcie@ffe240000: PCICSRBAR @ 0xff000000
> setup_pci_atmu: end of DRAM 200000000
> /pcie@ffe240000: Setup 64-bit PCI DMA window
> /pcie@ffe240000: WARNING: Outbound window cfg leaves gaps in memory map.
> Adjusting the memory map could reduce unnecessary bounce buffering.
> /pcie@ffe240000: DMA window size is 0xe0000000
> Found FSL PCI host bridge at 0x0000000ffe250000. Firmware bus number: 0->1
> PCI host bridge /pcie@ffe250000  ranges:
>  MEM 0x0000000c10000000..0x0000000c1fffffff -> 0x00000000e0000000
>   IO 0x0000000ff8010000..0x0000000ff801ffff -> 0x0000000000000000
> /pcie@ffe250000: PCICSRBAR @ 0xff000000
> setup_pci_atmu: end of DRAM 200000000
> /pcie@ffe250000: Setup 64-bit PCI DMA window
> /pcie@ffe250000: WARNING: Outbound window cfg leaves gaps in memory map.
> Adjusting the memory map could reduce unnecessary bounce buffering.
> /pcie@ffe250000: DMA window size is 0xe0000000
> Found FSL PCI host bridge at 0x0000000ffe260000. Firmware bus number: 0->0
> PCI host bridge /pcie@ffe260000  ranges:
>  MEM 0x0000000c20000000..0x0000000c2fffffff -> 0x00000000e0000000
>   IO 0x0000000ff8020000..0x0000000ff802ffff -> 0x0000000000000000
> /pcie@ffe260000: PCICSRBAR @ 0xff000000
> setup_pci_atmu: end of DRAM 200000000
> /pcie@ffe260000: Setup 64-bit PCI DMA window
> /pcie@ffe260000: WARNING: Outbound window cfg leaves gaps in memory map.
> Adjusting the memory map could reduce unnecessary bounce buffering.
> /pcie@ffe260000: DMA window size is 0xe0000000
> Found FSL PCI host bridge at 0x0000000ffe270000. Firmware bus number: 0->0
> PCI host bridge /pcie@ffe270000  ranges:
>  MEM 0x0000000c30000000..0x0000000c3fffffff -> 0x00000000e0000000
>   IO 0x0000000ff8030000..0x0000000ff803ffff -> 0x0000000000000000
> /pcie@ffe270000: PCICSRBAR @ 0xff000000
> setup_pci_atmu: end of DRAM 200000000
> /pcie@ffe270000: Setup 64-bit PCI DMA window
> /pcie@ffe270000: WARNING: Outbound window cfg leaves gaps in memory map.
> Adjusting the memory map could reduce unnecessary bounce buffering.
> /pcie@ffe270000: DMA window size is 0xe0000000
> iommu: Adding device ff6000000.qman-portal to group 0
> iommu: Adding device ff6004000.qman-portal to group 1
> iommu: Adding device ff6008000.qman-portal to group 2
> iommu: Adding device ff600c000.qman-portal to group 3
> iommu: Adding device ff6010000.qman-portal to group 4
> iommu: Adding device ff6014000.qman-portal to group 5
> iommu: Adding device ff6018000.qman-portal to group 6
> iommu: Adding device ff601c000.qman-portal to group 7
> iommu: Adding device ff6020000.qman-portal to group 8
> iommu: Adding device ff6024000.qman-portal to group 9
> iommu: Adding device ffe100300.dma to group 10
> iommu: Adding device ffe101300.dma to group 11
> iommu: Adding device ffe114000.sdhc to group 12
> iommu: Adding device ffe210000.usb to group 13
> iommu: Adding device ffe211000.usb to group 14
> iommu: Adding device ffe220000.sata to group 15
> iommu: Adding device ffe221000.sata to group 16
> iommu: Adding device ffe318000.qman to group 17
> iommu: Adding device ffe31a000.bman to group 18
> iommu: Adding device ffe240000.pcie to group 19
> iommu: Adding device ffe250000.pcie to group 20
> iommu: Adding device ffe260000.pcie to group 21
> iommu: Adding device ffe270000.pcie to group 22
> iommu: Adding device ffe140000.qe to group 23
> software IO TLB: mapped [mem 0xfbfff000-0xfffff000] (64MB)
> PCI: Probing PCI hardware
> fsl-pci ffe240000.pcie: PCI host bridge to bus 0000:00
> pci_bus 0000:00: root bus resource [io
>  0x8000080000010000-0x800008000001ffff] (bus address [0x0000-0xffff])
> pci_bus 0000:00: root bus resource [mem 0xc00000000-0xc0fffffff] (bus
> address [0xe0000000-0xefffffff])
> pci_bus 0000:00: root bus resource [bus 00]
> iommu: Removing device ffe240000.pcie from group 19
> iommu: Adding device 0000:00:00.0 to group 24
> pci 0000:00:00.0: bridge configuration invalid ([bus 00-00]), reconfiguring
> pci 0000:00:00.0: PCI bridge to [bus 01-ff]
> fsl-pci ffe250000.pcie: PCI host bridge to bus 0001:00
> pci_bus 0001:00: root bus resource [io
>  0x8000080000021000-0x8000080000030fff] (bus address [0x0000-0xffff])
> pci_bus 0001:00: root bus resource [mem 0xc10000000-0xc1fffffff] (bus
> address [0xe0000000-0xefffffff])
> pci_bus 0001:00: root bus resource [bus 00-01]
> iommu: Removing device ffe250000.pcie from group 20
> iommu: Adding device 0001:00:00.0 to group 19
> pci 0001:01:00.0: enabling Extended Tags
> pci 0001:01:00.0: 4.000 Gb/s available PCIe bandwidth, limited by 5 GT/s
> x1 link at 0001:00:00.0 (capable of 63.008 Gb/s with 8 GT/s x8 link)
> iommu: Adding device 0001:01:00.0 to group 19
> pci 0001:01:00.1: enabling Extended Tags
> iommu: Adding device 0001:01:00.1 to group 19
> pci 0001:00:00.0: PCI bridge to [bus 01-ff]
> fsl-pci ffe260000.pcie: PCI host bridge to bus 0002:00
> pci_bus 0002:00: root bus resource [io
>  0x8000080000032000-0x8000080000041fff] (bus address [0x0000-0xffff])
> pci_bus 0002:00: root bus resource [mem 0xc20000000-0xc2fffffff] (bus
> address [0xe0000000-0xefffffff])
> pci_bus 0002:00: root bus resource [bus 00]
> iommu: Removing device ffe260000.pcie from group 21
> iommu: Adding device 0002:00:00.0 to group 20
> pci 0002:00:00.0: bridge configuration invalid ([bus 00-00]), reconfiguring
> pci 0002:00:00.0: PCI bridge to [bus 01-ff]
> fsl-pci ffe270000.pcie: PCI host bridge to bus 0003:00
> pci_bus 0003:00: root bus resource [io
>  0x8000080000043000-0x8000080000052fff] (bus address [0x0000-0xffff])
> pci_bus 0003:00: root bus resource [mem 0xc30000000-0xc3fffffff] (bus
> address [0xe0000000-0xefffffff])
> pci_bus 0003:00: root bus resource [bus 00]
> iommu: Removing device ffe270000.pcie from group 22
> iommu: Adding device 0003:00:00.0 to group 21
> pci 0003:00:00.0: bridge configuration invalid ([bus 00-00]), reconfiguring
> pci 0003:00:00.0: PCI bridge to [bus 01-ff]
> PCI: Cannot allocate resource region 0 of device 0000:00:00.0, will remap
> PCI: Cannot allocate resource region 0 of device 0001:00:00.0, will remap
> PCI: Cannot allocate resource region 2 of device 0001:01:00.0, will remap
> PCI: Cannot allocate resource region 5 of device 0001:01:00.0, will remap
> PCI: Cannot allocate resource region 6 of device 0001:01:00.0, will remap
> PCI: Cannot allocate resource region 0 of device 0001:01:00.1, will remap
> PCI: Cannot allocate resource region 0 of device 0002:00:00.0, will remap
> PCI: Cannot allocate resource region 0 of device 0003:00:00.0, will remap
> pci 0000:00:00.0: BAR 0: no space for [mem size 0x01000000]
> pci 0000:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
> pci 0000:00:00.0: PCI bridge to [bus 01]
> pci 0000:00:00.0:   bridge window [io
>  0x8000080000010000-0x800008000001ffff]
> pci 0000:00:00.0:   bridge window [mem 0xc00000000-0xc0fffffff]
> pci_bus 0000:00: Some PCI device resources are unassigned, try booting
> with pci=realloc
> pci 0001:00:00.0: BAR 0: no space for [mem size 0x01000000]
> pci 0001:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
> pci 0001:00:00.0: BAR 9: no space for [mem size 0x00200000 64bit pref]
> pci 0001:00:00.0: BAR 9: failed to assign [mem size 0x00200000 64bit pref]
> pci 0001:01:00.0: BAR 2: no space for [mem size 0x00200000 64bit pref]
> pci 0001:01:00.0: BAR 2: failed to assign [mem size 0x00200000 64bit pref]
> pci 0001:01:00.0: BAR 5: no space for [mem size 0x00040000]
> pci 0001:01:00.0: BAR 5: failed to assign [mem size 0x00040000]
> pci 0001:01:00.0: BAR 6: no space for [mem size 0x00020000 pref]
> pci 0001:01:00.0: BAR 6: failed to assign [mem size 0x00020000 pref]
> pci 0001:01:00.1: BAR 0: no space for [mem size 0x00004000 64bit]
> pci 0001:01:00.1: BAR 0: failed to assign [mem size 0x00004000 64bit]
> pci 0001:00:00.0: PCI bridge to [bus 01]
> pci 0001:00:00.0:   bridge window [io
>  0x8000080000021000-0x8000080000030fff]
> pci 0001:00:00.0:   bridge window [mem 0xc10000000-0xc1fffffff]
> pci_bus 0001:00: Some PCI device resources are unassigned, try booting
> with pci=realloc
> pci 0002:00:00.0: BAR 0: no space for [mem size 0x01000000]
> pci 0002:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
> pci 0002:00:00.0: PCI bridge to [bus 01]
> pci 0002:00:00.0:   bridge window [io
>  0x8000080000032000-0x8000080000041fff]
> pci 0002:00:00.0:   bridge window [mem 0xc20000000-0xc2fffffff]
> pci_bus 0002:00: Some PCI device resources are unassigned, try booting
> with pci=realloc
> pci 0003:00:00.0: BAR 0: no space for [mem size 0x01000000]
> pci 0003:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
> pci 0003:00:00.0: PCI bridge to [bus 01]
> pci 0003:00:00.0:   bridge window [io
>  0x8000080000043000-0x8000080000052fff]
> pci 0003:00:00.0:   bridge window [mem 0xc30000000-0xc3fffffff]
> pci_bus 0003:00: Some PCI device resources are unassigned, try booting
> with pci=realloc
>
>
>
>

[-- Attachment #1.2: Type: text/html, Size: 35399 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
  2019-12-02 14:20       ` Yusuf Altıparmak
@ 2019-12-02 14:58         ` Christian König
  2019-12-02 16:31           ` Yusuf Altıparmak
  0 siblings, 1 reply; 12+ messages in thread
From: Christian König @ 2019-12-02 14:58 UTC (permalink / raw)
  To: Yusuf Altıparmak, Christian König; +Cc: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 25269 bytes --]

Hi Yusuf,

Am 02.12.19 um 15:20 schrieb Yusuf Altıparmak:
>
>     That is an expected result. 256MB is not enough for the VRAM BAR
>     and the doorbell BAR to fit into. But you can still use VGA
>     emulation that way if I'm not completely mistaken.
>
>
> Hmm, then what procedure should I follow to take a VGA output. It 
> seems Graphic Card does not have a VGA output. And isn't there any way 
> to use this GPU with a 256MB buffered PCIe?

Most likely not. There is support for resizing the VRAM BAR, but usually 
you can only make it larger and not smaller.

Please give me the output of "sudo setpci -s 0001:01:00.0 ECAP15+4.l 
ECAP15+8.l" if you want to double check that.

>>     Then I canged 256 MB to 4GB in .dtsi and U-boot conf file.
>     How did you do this? Is your memory layout consistent?
>
>     See when you just changed one end address you might need to adjust
>     other addresses as well.
>
>     Regards,
>     Christian.
>
> It's not consistent I seems. At first I changed .dtsi like it's 
> consistent. I only changed PCIe1 Device Memory and I/O range. It gave 
> same error. Then I also changed PCIe2 and PCIe3 devices starting 
> adress according to PCIe1 device end adress in .dtsi. I am not sure I 
> made this correctly or not but it gave same result again. This is why 
> I asked "If it is, is there any sample PCIe configuration for E9171?".

Well you rather need to ask if anybody has sample PCIe configuration for 
GPUs in general. That problem is not really E9171 related. You might 
want to ask NXP for that maybe.

>
> *This is a code piece from my T104xRDB.h file which is contain PCIe 
> configuration variables for U-boot. I made changes like it's not 
> consistent. RAM size is 8 GB.
> *

Sorry, no idea if that is correct or not. You need to ask NXP for help 
with that.

Regards,
Christian.

>
> #ifdef CONFIG_PCI
> /* controller 1, direct to uli, tgtid 3, Base address 20000 */
> #ifdef CONFIG_PCIE1
> #define CONFIG_SYS_PCIE1_MEM_VIRT *0x80000000*
> #define CONFIG_SYS_PCIE1_MEM_BUS 0xe0000000
> #define CONFIG_SYS_PCIE1_MEM_PHYS 0xc00000000ull
> #define CONFIG_SYS_PCIE1_MEM_SIZE *0x10000000 /* 256M */*
> #define CONFIG_SYS_PCIE1_IO_VIRT 0xf8000000
> #define CONFIG_SYS_PCIE1_IO_BUS 0x00000000
> #define CONFIG_SYS_PCIE1_IO_PHYS *0xff8000000ull*
> #define CONFIG_SYS_PCIE1_IO_SIZE *0x00010000 /* 64k */*
> #endif
>
> /* controller 2, Slot 2, tgtid 2, Base address 201000 */
> #ifdef CONFIG_PCIE2
> #define CONFIG_SYS_PCIE2_MEM_VIRT *0x90000000*
> #define CONFIG_SYS_PCIE2_MEM_BUS 0xe0000000
> #define CONFIG_SYS_PCIE2_MEM_PHYS 0xc10000000ull
> #define CONFIG_SYS_PCIE2_MEM_SIZE*0x100000000 /* 4GB */*
> #define CONFIG_SYS_PCIE2_IO_VIRT 0xf8010000
> #define CONFIG_SYS_PCIE2_IO_BUS 0x00000000
> #define CONFIG_SYS_PCIE2_IO_PHYS 0xff8010000ull
> #define CONFIG_SYS_PCIE2_IO_SIZE *0x00100000 /* 1M */*
> #endif
>
> /* controller 3, Slot 1, tgtid 1, Base address 202000 */
> #ifdef CONFIG_PCIE3
> #define CONFIG_SYS_PCIE3_MEM_VIRT *0x190000000 /* I changed this to 
> 0x190000000 instead of 0xa0000000 because PCIE2 end adress is changed. 
> End adress is to 4 GB (0x100000000 hex so I added 0x90000000 with 
> 100000000) */*
> #define CONFIG_SYS_PCIE3_MEM_BUS 0xe0000000
> #define CONFIG_SYS_PCIE3_MEM_PHYS 0xd10000000ull
> #define CONFIG_SYS_PCIE3_MEM_SIZE 0x10000000 /* 256M */
> #define CONFIG_SYS_PCIE3_IO_VIRT 0xf8110000
> #define CONFIG_SYS_PCIE3_IO_BUS 0x00000000
> #define CONFIG_SYS_PCIE3_IO_PHYS *0xff8110000ull /* Did same things 
> for IO *
> #define CONFIG_SYS_PCIE3_IO_SIZE 0x00010000 /* 64k */
> #endif
>
> /* controller 4, Base address 203000 */
> #ifdef CONFIG_PCIE4
> #define CONFIG_SYS_PCIE4_MEM_VIRT *0x200000000 /* SAME STEP WITH 
> PCIE3. This time I added with 256 MB (0x10000000) */*
> #define CONFIG_SYS_PCIE4_MEM_BUS 0xe0000000
> #define CONFIG_SYS_PCIE4_MEM_PHYS 0xd20000000ull
> #define CONFIG_SYS_PCIE4_MEM_SIZE 0x10000000 /* 256M */
> #define CONFIG_SYS_PCIE4_IO_VIRT 0xf8120000
> #define CONFIG_SYS_PCIE4_IO_BUS 0x00000000
> #define CONFIG_SYS_PCIE4_IO_PHYS *0xff8120000ull /* Did same things 
> for IO *
> #define CONFIG_SYS_PCIE4_IO_SIZE 0x00010000 /* 64k */
> #endif
>
>
>
>
>     Am 02.12.19 um 14:32 schrieb Yusuf Altıparmak:
>>
>>
>>>         I attached my dts file.
>>>
>>>         System is working fine when GPU is not plugged in.
>>>         *
>>>         *
>>>         *This is the last console log before freeze:*
>>>         [drm] amdgpu kernel modesetting enabled.
>>>         [drm] initializing kernel modesetting (POLARIS12
>>>         0x1002:0x6987 0x1787:0x2389 0x80).
>>>         [drm] register mmio base: 0x20200000
>>>         fsl-fman-port ffe488000.port fm1-gb0: renamed from eth0
>>>         [drm] register mmio size: 262144
>>>         [drm] add ip block number 0 <vi_common>
>>>         [drm] add ip block number 1 <gmc_v8_0>
>>>         [drm] add ip block number 2 <tonga_ih>
>>>         [drm] add ip block number 3 <powerplay>
>>>         [drm] add ip block number 4 <dm>
>>>         [drm] add ip block number 5 <gfx_v8_0>
>>>         [drm] add ip block number 6 <sdma_v3_0>
>>>         [drm] add ip block number 7 <uvd_v6_0>
>>>         [drm] add ip block number 8 <vce_v3_0>
>>>         [drm] UVD is enabled in VM mode
>>>         [drm] UVD ENC is enabled in VM mode
>>>         [drm] VCE enabled in VM mode
>>>         ATOM BIOS: 113-ER16BFC-001
>>>         [drm] GPU posting now...
>>>         Disabling lock debugging due to kernel taint
>>>         Machine check in kernel mode.
>>>         Caused by (from MCSR=a000): Load Error Report
>>>         Guarded Load Error Report
>>>         Kernel panic - not syncing: Unrecoverable Machine check
>>>         CPU: 1 PID: 2023 Comm: udevd Tainted: G   M            
>>>          4.19.26+gc0c2141 #1
>>>         Call Trace:
>>
>>>
>>>
>>>         _______________________________________________
>>>         amd-gfx mailing list
>>>         amd-gfx@lists.freedesktop.org  <mailto:amd-gfx@lists.freedesktop.org>
>>>         https://lists.freedesktop.org/mailman/listinfo/amd-gfx  <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7C9be244504efb4ff9cc0a08d7772c1630%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637108903547899069&sdata=OL4UKczXinKq5FM3Cid5kYcESkHAl0esXfOB83AUPU8%3D&reserved=0>
>>
>>
>>
>>     Christian König <ckoenig.leichtzumerken@gmail.com
>>     <mailto:ckoenig.leichtzumerken@gmail.com>>, 2 Ara 2019 Pzt, 15:28
>>     tarihinde şunu yazdı:
>>
>>         Hi Yusuf,
>>
>>         Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
>>>         My embedded board is freezing when I put E9171 on PCIe. What
>>>         is the meaning of Unrecoverable Machine Check error about GPU?
>>
>>         Well see the explanation on Wikipedia for example:
>>         https://en.wikipedia.org/wiki/Machine-check_exception
>>         <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMachine-check_exception&data=02%7C01%7Cchristian.koenig%40amd.com%7C9be244504efb4ff9cc0a08d7772c1630%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637108903547909070&sdata=mBxI9LnRBO4iu5oyHc9OrrFj0K10SiOf5%2FztzmNedaI%3D&reserved=0>
>>
>>         In general it means you have messed up something in your
>>         hardware configuration.
>>
>>>         Could PCIe settings in .dts file cause this problem?
>>
>>         Possible, but rather unlikely. My best guess is that it is
>>         some problem with the power supply.
>>
>>>         If it is, is there any sample PCIe configuration for E9171?
>>
>>         The E9171 is just a PCIe device, so the dtsi is actually
>>         rather uninteresting. What we really need is a full dmesg and
>>         maybe lspci output would help as well.
>>
>>         Regards,
>>         Christian.
>>
>>
>>
>>     Hi Christian,
>>
>>     At first, I am using NXP T1042D4RDB-64B which has 256 MB PCIe
>>     buffer according to its. PCIe memory range was arranged to 256 MB
>>     in .dts file and in U-boot configuration file. Driver was giving
>>     error with exit code -12 (OUT_OF_MEMORY). But I was able to reach
>>     the linux console.
>>
>>     [    5.512922] [drm] amdgpu kernel modesetting enabled.
>>     [    5.517065] [drm] initializing kernel modesetting (POLARIS12
>>     0x1002:0x6987 0x1787:0x2389 0x80).
>>     [    5.524507] amdgpu 0001:01:00.0: Fatal error during GPU init
>>     [    5.529296] amdgpu: probe of 0001:01:00.0 failed with error -12
>>
>>     Then I canged 256 MB to 4GB in .dtsi and U-boot conf file. I also
>>     changed 64KB I/O size to 1MB . When I do this, I wasn't able to
>>     reach the linux console because board was freezing. But driver
>>     was successfull at this time. I already mentioned successfull
>>     driver console logs up.
>>
>>     *this is lspci -v when GPU is plugged and Memory size is 256 MB.*
>>
>>     root@t1042d4rdb-64b:~# lspci -v
>>     0000:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824
>>     (rev 11) (prog-if 00 [Normal decode])
>>             Device tree node:
>>     /sys/firmware/devicetree/base/pcie@ffe240000/pcie@0
>>             Flags: bus master, fast devsel, latency 0, IRQ 20
>>             Memory at <ignored> (32-bit, non-prefetchable)
>>             Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>             I/O behind bridge: 00000000-0000ffff [size=64K]
>>             Memory behind bridge: e0000000-efffffff [size=256M]
>>             Prefetchable memory behind bridge: None
>>             Capabilities: [44] Power Management version 3
>>             Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>             Capabilities: [100] Advanced Error Reporting
>>             Kernel driver in use: pcieport
>>
>>     0001:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824
>>     (rev 11) (prog-if 00 [Normal decode])
>>             Device tree node:
>>     /sys/firmware/devicetree/base/pcie@ffe250000/pcie@0
>>             Flags: bus master, fast devsel, latency 0, IRQ 21
>>             Memory at <ignored> (32-bit, non-prefetchable)
>>             Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>             I/O behind bridge: 00000000-0000ffff [size=64K]
>>             Memory behind bridge: e0000000-efffffff [size=256M]
>>             Prefetchable memory behind bridge: None
>>             Capabilities: [44] Power Management version 3
>>             Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>             Capabilities: [100] Advanced Error Reporting
>>             Kernel driver in use: pcieport
>>
>>     0001:01:00.0 VGA compatible controller: Advanced Micro Devices,
>>     Inc. [AMD/ATI] Lexa [Radeon E9171 MCM] (rev 80) (prog-if 00 [VGA
>>     controller])
>>             Subsystem: Hightech Information System Ltd. Device 2389
>>             Flags: fast devsel, IRQ 41
>>             Memory at c10000000 (64-bit, prefetchable) [size=256M]
>>             Memory at <ignored> (64-bit, prefetchable)
>>             I/O ports at 1100 [size=256]
>>             Memory at <ignored> (32-bit, non-prefetchable)
>>             Expansion ROM at <ignored> [disabled]
>>             Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>             Capabilities: [50] Power Management version 3
>>             Capabilities: [58] Express Legacy Endpoint, MSI 00
>>             Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>>             Capabilities: [100] Vendor Specific Information: ID=0001
>>     Rev=1 Len=010 <?>
>>             Capabilities: [150] Advanced Error Reporting
>>             Capabilities: [200] Resizable BAR <?>
>>             Capabilities: [270] Secondary PCI Express <?>
>>             Capabilities: [2b0] Address Translation Service (ATS)
>>             Capabilities: [2c0] Page Request Interface (PRI)
>>             Capabilities: [2d0] Process Address Space ID (PASID)
>>             Capabilities: [320] Latency Tolerance Reporting
>>             Capabilities: [328] Alternative Routing-ID Interpretation
>>     (ARI)
>>             Capabilities: [370] L1 PM Substates
>>             Kernel modules: amdgpu
>>
>>     0001:01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI]
>>     Device aae0
>>             Subsystem: Hightech Information System Ltd. Device aae0
>>             Flags: bus master, fast devsel, latency 0, IRQ 17
>>             Memory at <ignored> (64-bit, non-prefetchable)
>>             Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>             Capabilities: [50] Power Management version 3
>>             Capabilities: [58] Express Legacy Endpoint, MSI 00
>>             Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>>             Capabilities: [100] Vendor Specific Information: ID=0001
>>     Rev=1 Len=010 <?>
>>             Capabilities: [150] Advanced Error Reporting
>>             Capabilities: [328] Alternative Routing-ID Interpretation
>>     (ARI)
>>
>>     0002:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824
>>     (rev 11) (prog-if 00 [Normal decode])
>>             Device tree node:
>>     /sys/firmware/devicetree/base/pcie@ffe260000/pcie@0
>>             Flags: bus master, fast devsel, latency 0, IRQ 22
>>             Memory at <ignored> (32-bit, non-prefetchable)
>>             Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>             I/O behind bridge: 00000000-0000ffff [size=64K]
>>             Memory behind bridge: e0000000-efffffff [size=256M]
>>             Prefetchable memory behind bridge: None
>>             Capabilities: [44] Power Management version 3
>>             Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>             Capabilities: [100] Advanced Error Reporting
>>             Kernel driver in use: pcieport
>>
>>     0003:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824
>>     (rev 11) (prog-if 00 [Normal decode])
>>             Device tree node:
>>     /sys/firmware/devicetree/base/pcie@ffe270000/pcie@0
>>             Flags: bus master, fast devsel, latency 0, IRQ 23
>>             Memory at <ignored> (32-bit, non-prefetchable)
>>             Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>             I/O behind bridge: 00000000-0000ffff [size=64K]
>>             Memory behind bridge: e0000000-efffffff [size=256M]
>>             Prefetchable memory behind bridge: None
>>             Capabilities: [44] Power Management version 3
>>             Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>             Capabilities: [100] Advanced Error Reporting
>>             Kernel driver in use: pcieport
>>
>>     *AND This is PCIe dmesg message when memory range is 256MB. It's
>>     also giving same message when memory range is arranged as 4GB;*
>>
>>     PCI host bridge /pcie@ffe240000  ranges:
>>      MEM 0x0000000c00000000..0x0000000c0fffffff -> 0x00000000e0000000
>>       IO 0x0000000ff8000000..0x0000000ff800ffff -> 0x0000000000000000
>>     /pcie@ffe240000: PCICSRBAR @ 0xff000000
>>     setup_pci_atmu: end of DRAM 200000000
>>     /pcie@ffe240000: Setup 64-bit PCI DMA window
>>     /pcie@ffe240000: WARNING: Outbound window cfg leaves gaps in
>>     memory map. Adjusting the memory map could reduce unnecessary
>>     bounce buffering.
>>     /pcie@ffe240000: DMA window size is 0xe0000000
>>     Found FSL PCI host bridge at 0x0000000ffe250000. Firmware bus
>>     number: 0->1
>>     PCI host bridge /pcie@ffe250000  ranges:
>>      MEM 0x0000000c10000000..0x0000000c1fffffff -> 0x00000000e0000000
>>       IO 0x0000000ff8010000..0x0000000ff801ffff -> 0x0000000000000000
>>     /pcie@ffe250000: PCICSRBAR @ 0xff000000
>>     setup_pci_atmu: end of DRAM 200000000
>>     /pcie@ffe250000: Setup 64-bit PCI DMA window
>>     /pcie@ffe250000: WARNING: Outbound window cfg leaves gaps in
>>     memory map. Adjusting the memory map could reduce unnecessary
>>     bounce buffering.
>>     /pcie@ffe250000: DMA window size is 0xe0000000
>>     Found FSL PCI host bridge at 0x0000000ffe260000. Firmware bus
>>     number: 0->0
>>     PCI host bridge /pcie@ffe260000  ranges:
>>      MEM 0x0000000c20000000..0x0000000c2fffffff -> 0x00000000e0000000
>>       IO 0x0000000ff8020000..0x0000000ff802ffff -> 0x0000000000000000
>>     /pcie@ffe260000: PCICSRBAR @ 0xff000000
>>     setup_pci_atmu: end of DRAM 200000000
>>     /pcie@ffe260000: Setup 64-bit PCI DMA window
>>     /pcie@ffe260000: WARNING: Outbound window cfg leaves gaps in
>>     memory map. Adjusting the memory map could reduce unnecessary
>>     bounce buffering.
>>     /pcie@ffe260000: DMA window size is 0xe0000000
>>     Found FSL PCI host bridge at 0x0000000ffe270000. Firmware bus
>>     number: 0->0
>>     PCI host bridge /pcie@ffe270000  ranges:
>>      MEM 0x0000000c30000000..0x0000000c3fffffff -> 0x00000000e0000000
>>       IO 0x0000000ff8030000..0x0000000ff803ffff -> 0x0000000000000000
>>     /pcie@ffe270000: PCICSRBAR @ 0xff000000
>>     setup_pci_atmu: end of DRAM 200000000
>>     /pcie@ffe270000: Setup 64-bit PCI DMA window
>>     /pcie@ffe270000: WARNING: Outbound window cfg leaves gaps in
>>     memory map. Adjusting the memory map could reduce unnecessary
>>     bounce buffering.
>>     /pcie@ffe270000: DMA window size is 0xe0000000
>>     iommu: Adding device ff6000000.qman-portal to group 0
>>     iommu: Adding device ff6004000.qman-portal to group 1
>>     iommu: Adding device ff6008000.qman-portal to group 2
>>     iommu: Adding device ff600c000.qman-portal to group 3
>>     iommu: Adding device ff6010000.qman-portal to group 4
>>     iommu: Adding device ff6014000.qman-portal to group 5
>>     iommu: Adding device ff6018000.qman-portal to group 6
>>     iommu: Adding device ff601c000.qman-portal to group 7
>>     iommu: Adding device ff6020000.qman-portal to group 8
>>     iommu: Adding device ff6024000.qman-portal to group 9
>>     iommu: Adding device ffe100300.dma to group 10
>>     iommu: Adding device ffe101300.dma to group 11
>>     iommu: Adding device ffe114000.sdhc to group 12
>>     iommu: Adding device ffe210000.usb to group 13
>>     iommu: Adding device ffe211000.usb to group 14
>>     iommu: Adding device ffe220000.sata to group 15
>>     iommu: Adding device ffe221000.sata to group 16
>>     iommu: Adding device ffe318000.qman to group 17
>>     iommu: Adding device ffe31a000.bman to group 18
>>     iommu: Adding device ffe240000.pcie to group 19
>>     iommu: Adding device ffe250000.pcie to group 20
>>     iommu: Adding device ffe260000.pcie to group 21
>>     iommu: Adding device ffe270000.pcie to group 22
>>     iommu: Adding device ffe140000.qe to group 23
>>     software IO TLB: mapped [mem 0xfbfff000-0xfffff000] (64MB)
>>     PCI: Probing PCI hardware
>>     fsl-pci ffe240000.pcie: PCI host bridge to bus 0000:00
>>     pci_bus 0000:00: root bus resource [io
>>      0x8000080000010000-0x800008000001ffff] (bus address [0x0000-0xffff])
>>     pci_bus 0000:00: root bus resource [mem 0xc00000000-0xc0fffffff]
>>     (bus address [0xe0000000-0xefffffff])
>>     pci_bus 0000:00: root bus resource [bus 00]
>>     iommu: Removing device ffe240000.pcie from group 19
>>     iommu: Adding device 0000:00:00.0 to group 24
>>     pci 0000:00:00.0: bridge configuration invalid ([bus 00-00]),
>>     reconfiguring
>>     pci 0000:00:00.0: PCI bridge to [bus 01-ff]
>>     fsl-pci ffe250000.pcie: PCI host bridge to bus 0001:00
>>     pci_bus 0001:00: root bus resource [io
>>      0x8000080000021000-0x8000080000030fff] (bus address [0x0000-0xffff])
>>     pci_bus 0001:00: root bus resource [mem 0xc10000000-0xc1fffffff]
>>     (bus address [0xe0000000-0xefffffff])
>>     pci_bus 0001:00: root bus resource [bus 00-01]
>>     iommu: Removing device ffe250000.pcie from group 20
>>     iommu: Adding device 0001:00:00.0 to group 19
>>     pci 0001:01:00.0: enabling Extended Tags
>>     pci 0001:01:00.0: 4.000 Gb/s available PCIe bandwidth, limited by
>>     5 GT/s x1 link at 0001:00:00.0 (capable of 63.008 Gb/s with 8
>>     GT/s x8 link)
>>     iommu: Adding device 0001:01:00.0 to group 19
>>     pci 0001:01:00.1: enabling Extended Tags
>>     iommu: Adding device 0001:01:00.1 to group 19
>>     pci 0001:00:00.0: PCI bridge to [bus 01-ff]
>>     fsl-pci ffe260000.pcie: PCI host bridge to bus 0002:00
>>     pci_bus 0002:00: root bus resource [io
>>      0x8000080000032000-0x8000080000041fff] (bus address [0x0000-0xffff])
>>     pci_bus 0002:00: root bus resource [mem 0xc20000000-0xc2fffffff]
>>     (bus address [0xe0000000-0xefffffff])
>>     pci_bus 0002:00: root bus resource [bus 00]
>>     iommu: Removing device ffe260000.pcie from group 21
>>     iommu: Adding device 0002:00:00.0 to group 20
>>     pci 0002:00:00.0: bridge configuration invalid ([bus 00-00]),
>>     reconfiguring
>>     pci 0002:00:00.0: PCI bridge to [bus 01-ff]
>>     fsl-pci ffe270000.pcie: PCI host bridge to bus 0003:00
>>     pci_bus 0003:00: root bus resource [io
>>      0x8000080000043000-0x8000080000052fff] (bus address [0x0000-0xffff])
>>     pci_bus 0003:00: root bus resource [mem 0xc30000000-0xc3fffffff]
>>     (bus address [0xe0000000-0xefffffff])
>>     pci_bus 0003:00: root bus resource [bus 00]
>>     iommu: Removing device ffe270000.pcie from group 22
>>     iommu: Adding device 0003:00:00.0 to group 21
>>     pci 0003:00:00.0: bridge configuration invalid ([bus 00-00]),
>>     reconfiguring
>>     pci 0003:00:00.0: PCI bridge to [bus 01-ff]
>>     PCI: Cannot allocate resource region 0 of device 0000:00:00.0,
>>     will remap
>>     PCI: Cannot allocate resource region 0 of device 0001:00:00.0,
>>     will remap
>>     PCI: Cannot allocate resource region 2 of device 0001:01:00.0,
>>     will remap
>>     PCI: Cannot allocate resource region 5 of device 0001:01:00.0,
>>     will remap
>>     PCI: Cannot allocate resource region 6 of device 0001:01:00.0,
>>     will remap
>>     PCI: Cannot allocate resource region 0 of device 0001:01:00.1,
>>     will remap
>>     PCI: Cannot allocate resource region 0 of device 0002:00:00.0,
>>     will remap
>>     PCI: Cannot allocate resource region 0 of device 0003:00:00.0,
>>     will remap
>>     pci 0000:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>     pci 0000:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>     pci 0000:00:00.0: PCI bridge to [bus 01]
>>     pci 0000:00:00.0:   bridge window [io
>>      0x8000080000010000-0x800008000001ffff]
>>     pci 0000:00:00.0:   bridge window [mem 0xc00000000-0xc0fffffff]
>>     pci_bus 0000:00: Some PCI device resources are unassigned, try
>>     booting with pci=realloc
>>     pci 0001:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>     pci 0001:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>     pci 0001:00:00.0: BAR 9: no space for [mem size 0x00200000 64bit
>>     pref]
>>     pci 0001:00:00.0: BAR 9: failed to assign [mem size 0x00200000
>>     64bit pref]
>>     pci 0001:01:00.0: BAR 2: no space for [mem size 0x00200000 64bit
>>     pref]
>>     pci 0001:01:00.0: BAR 2: failed to assign [mem size 0x00200000
>>     64bit pref]
>>     pci 0001:01:00.0: BAR 5: no space for [mem size 0x00040000]
>>     pci 0001:01:00.0: BAR 5: failed to assign [mem size 0x00040000]
>>     pci 0001:01:00.0: BAR 6: no space for [mem size 0x00020000 pref]
>>     pci 0001:01:00.0: BAR 6: failed to assign [mem size 0x00020000 pref]
>>     pci 0001:01:00.1: BAR 0: no space for [mem size 0x00004000 64bit]
>>     pci 0001:01:00.1: BAR 0: failed to assign [mem size 0x00004000 64bit]
>>     pci 0001:00:00.0: PCI bridge to [bus 01]
>>     pci 0001:00:00.0:   bridge window [io
>>      0x8000080000021000-0x8000080000030fff]
>>     pci 0001:00:00.0:   bridge window [mem 0xc10000000-0xc1fffffff]
>>     pci_bus 0001:00: Some PCI device resources are unassigned, try
>>     booting with pci=realloc
>>     pci 0002:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>     pci 0002:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>     pci 0002:00:00.0: PCI bridge to [bus 01]
>>     pci 0002:00:00.0:   bridge window [io
>>      0x8000080000032000-0x8000080000041fff]
>>     pci 0002:00:00.0:   bridge window [mem 0xc20000000-0xc2fffffff]
>>     pci_bus 0002:00: Some PCI device resources are unassigned, try
>>     booting with pci=realloc
>>     pci 0003:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>     pci 0003:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>     pci 0003:00:00.0: PCI bridge to [bus 01]
>>     pci 0003:00:00.0:   bridge window [io
>>      0x8000080000043000-0x8000080000052fff]
>>     pci 0003:00:00.0:   bridge window [mem 0xc30000000-0xc3fffffff]
>>     pci_bus 0003:00: Some PCI device resources are unassigned, try
>>     booting with pci=realloc
>>
>>
>
>
> _______________________________________________
> amd-gfx mailing list
> amd-gfx@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/amd-gfx


[-- Attachment #1.2: Type: text/html, Size: 45940 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
  2019-12-02 14:58         ` Christian König
@ 2019-12-02 16:31           ` Yusuf Altıparmak
  2019-12-03 10:31             ` Yusuf Altıparmak
  0 siblings, 1 reply; 12+ messages in thread
From: Yusuf Altıparmak @ 2019-12-02 16:31 UTC (permalink / raw)
  To: Christian König; +Cc: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 20414 bytes --]

>
> Most likely not. There is support for resizing the VRAM BAR, but usually
> you can only make it larger and not smaller.
> Please give me the output of "sudo setpci -s 0001:01:00.0 ECAP15+4.l
> ECAP15+8.l" if you want to double check that.
>

Okay I'll try it tomorrow. What does the " sudo setpci -s 0001:01:00.0
ECAP15+4.l ECAP15+8.l" command exactly do ?



> Well you rather need to ask if anybody has sample PCIe configuration for
> GPUs in general. That problem is not really E9171 related. You might want
> to ask NXP for that maybe.
> Sorry, no idea if that is correct or not. You need to ask NXP for help
> with that.
>
>
Okay no problem. At least I know what is the missing point now. The problem
is probably because of the .dtsi and u-boot config files. Memory ranges are
overwriting like you said. I'll ask nxp to take some sample PCIe
configuration for GPUs.

Thank you for your interest Christian.
Regards .


>
> Am 02.12.19 um 14:32 schrieb Yusuf Altıparmak:
>>
>>
>>> I attached my dts file.
>>>
>>> System is working fine when GPU is not plugged in.
>>>
>>> *This is the last console log before freeze:*
>>> [drm] amdgpu kernel modesetting enabled.
>>>
>>> [drm] initializing kernel modesetting (POLARIS12 0x1002:0x6987
>>> 0x1787:0x2389 0x80).
>>> [drm] register mmio base: 0x20200000
>>>
>>> fsl-fman-port ffe488000.port fm1-gb0: renamed from eth0
>>>
>>> [drm] register mmio size: 262144
>>>
>>> [drm] add ip block number 0 <vi_common>
>>>
>>> [drm] add ip block number 1 <gmc_v8_0>
>>>
>>> [drm] add ip block number 2 <tonga_ih>
>>>
>>> [drm] add ip block number 3 <powerplay>
>>>
>>> [drm] add ip block number 4 <dm>
>>>
>>> [drm] add ip block number 5 <gfx_v8_0>
>>>
>>> [drm] add ip block number 6 <sdma_v3_0>
>>>
>>> [drm] add ip block number 7 <uvd_v6_0>
>>>
>>> [drm] add ip block number 8 <vce_v3_0>
>>>
>>> [drm] UVD is enabled in VM mode
>>>
>>> [drm] UVD ENC is enabled in VM mode
>>>
>>> [drm] VCE enabled in VM mode
>>>
>>> ATOM BIOS: 113-ER16BFC-001
>>>
>>> [drm] GPU posting now...
>>>
>>> Disabling lock debugging due to kernel taint
>>>
>>> Machine check in kernel mode.
>>>
>>> Caused by (from MCSR=a000): Load Error Report
>>>
>>> Guarded Load Error Report
>>>
>>> Kernel panic - not syncing: Unrecoverable Machine check
>>>
>>> CPU: 1 PID: 2023 Comm: udevd Tainted: G   M
>>>  4.19.26+gc0c2141 #1
>>> Call Trace:
>>>
>>>
>>>
>>> _______________________________________________
>>> amd-gfx mailing listamd-gfx@lists.freedesktop.orghttps://lists.freedesktop.org/mailman/listinfo/amd-gfx <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7C9be244504efb4ff9cc0a08d7772c1630%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637108903547899069&sdata=OL4UKczXinKq5FM3Cid5kYcESkHAl0esXfOB83AUPU8%3D&reserved=0>
>>>
>>>
>>>
>>
>> Christian König <ckoenig.leichtzumerken@gmail.com>, 2 Ara 2019 Pzt,
>> 15:28 tarihinde şunu yazdı:
>>
>>> Hi Yusuf,
>>>
>>> Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
>>>
>>> My embedded board is freezing when I put E9171 on PCIe. What is the
>>> meaning of Unrecoverable Machine Check error about GPU?
>>>
>>>
>>> Well see the explanation on Wikipedia for example:
>>> https://en.wikipedia.org/wiki/Machine-check_exception
>>> <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMachine-check_exception&data=02%7C01%7Cchristian.koenig%40amd.com%7C9be244504efb4ff9cc0a08d7772c1630%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637108903547909070&sdata=mBxI9LnRBO4iu5oyHc9OrrFj0K10SiOf5%2FztzmNedaI%3D&reserved=0>
>>>
>>> In general it means you have messed up something in your hardware
>>> configuration.
>>>
>>> Could PCIe settings in .dts file cause this problem?
>>>
>>>
>>> Possible, but rather unlikely. My best guess is that it is some problem
>>> with the power supply.
>>>
>>> If it is, is there any sample PCIe configuration for E9171?
>>>
>>>
>>> The E9171 is just a PCIe device, so the dtsi is actually rather
>>> uninteresting. What we really need is a full dmesg and maybe lspci output
>>> would help as well.
>>>
>>> Regards,
>>> Christian.
>>>
>>
>>
>> Hi Christian,
>>
>> At first, I am using NXP T1042D4RDB-64B which has 256 MB PCIe buffer
>> according to its. PCIe memory range was arranged to 256 MB in .dts file and
>> in U-boot configuration file. Driver was giving error with exit code -12
>> (OUT_OF_MEMORY). But I was able to reach the linux console.
>>
>> [    5.512922] [drm] amdgpu kernel modesetting enabled.
>> [    5.517065] [drm] initializing kernel modesetting (POLARIS12
>> 0x1002:0x6987 0x1787:0x2389 0x80).
>> [    5.524507] amdgpu 0001:01:00.0: Fatal error during GPU init
>> [    5.529296] amdgpu: probe of 0001:01:00.0 failed with error -12
>>
>> Then I canged 256 MB to 4GB in .dtsi and U-boot conf file. I also changed
>> 64KB I/O size to 1MB . When I do this, I wasn't able to reach the linux
>> console because board was freezing. But driver was successfull at this
>> time. I already mentioned successfull driver console logs up.
>>
>> *this is lspci -v when GPU is plugged and Memory size is 256 MB.*
>>
>> root@t1042d4rdb-64b:~# lspci -v
>> 0000:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
>> (prog-if 00 [Normal decode])
>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe240000
>> /pcie@0
>>         Flags: bus master, fast devsel, latency 0, IRQ 20
>>         Memory at <ignored> (32-bit, non-prefetchable)
>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>         Prefetchable memory behind bridge: None
>>         Capabilities: [44] Power Management version 3
>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>         Capabilities: [100] Advanced Error Reporting
>>         Kernel driver in use: pcieport
>>
>> 0001:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
>> (prog-if 00 [Normal decode])
>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe250000
>> /pcie@0
>>         Flags: bus master, fast devsel, latency 0, IRQ 21
>>         Memory at <ignored> (32-bit, non-prefetchable)
>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>         Prefetchable memory behind bridge: None
>>         Capabilities: [44] Power Management version 3
>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>         Capabilities: [100] Advanced Error Reporting
>>         Kernel driver in use: pcieport
>>
>> 0001:01:00.0 VGA compatible controller: Advanced Micro Devices, Inc.
>> [AMD/ATI] Lexa [Radeon E9171 MCM] (rev 80) (prog-if 00 [VGA controller])
>>         Subsystem: Hightech Information System Ltd. Device 2389
>>         Flags: fast devsel, IRQ 41
>>         Memory at c10000000 (64-bit, prefetchable) [size=256M]
>>         Memory at <ignored> (64-bit, prefetchable)
>>         I/O ports at 1100 [size=256]
>>         Memory at <ignored> (32-bit, non-prefetchable)
>>         Expansion ROM at <ignored> [disabled]
>>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>         Capabilities: [50] Power Management version 3
>>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>>         Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
>> Len=010 <?>
>>         Capabilities: [150] Advanced Error Reporting
>>         Capabilities: [200] Resizable BAR <?>
>>         Capabilities: [270] Secondary PCI Express <?>
>>         Capabilities: [2b0] Address Translation Service (ATS)
>>         Capabilities: [2c0] Page Request Interface (PRI)
>>         Capabilities: [2d0] Process Address Space ID (PASID)
>>         Capabilities: [320] Latency Tolerance Reporting
>>         Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
>>         Capabilities: [370] L1 PM Substates
>>         Kernel modules: amdgpu
>>
>> 0001:01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device
>> aae0
>>         Subsystem: Hightech Information System Ltd. Device aae0
>>         Flags: bus master, fast devsel, latency 0, IRQ 17
>>         Memory at <ignored> (64-bit, non-prefetchable)
>>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>         Capabilities: [50] Power Management version 3
>>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>>         Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
>> Len=010 <?>
>>         Capabilities: [150] Advanced Error Reporting
>>         Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
>>
>> 0002:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
>> (prog-if 00 [Normal decode])
>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe260000
>> /pcie@0
>>         Flags: bus master, fast devsel, latency 0, IRQ 22
>>         Memory at <ignored> (32-bit, non-prefetchable)
>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>         Prefetchable memory behind bridge: None
>>         Capabilities: [44] Power Management version 3
>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>         Capabilities: [100] Advanced Error Reporting
>>         Kernel driver in use: pcieport
>>
>> 0003:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev 11)
>> (prog-if 00 [Normal decode])
>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe270000
>> /pcie@0
>>         Flags: bus master, fast devsel, latency 0, IRQ 23
>>         Memory at <ignored> (32-bit, non-prefetchable)
>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>         Prefetchable memory behind bridge: None
>>         Capabilities: [44] Power Management version 3
>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>         Capabilities: [100] Advanced Error Reporting
>>         Kernel driver in use: pcieport
>>
>> *AND This is PCIe dmesg message when memory range is 256MB. It's also
>> giving same message when memory range is arranged as 4GB;*
>>
>> PCI host bridge /pcie@ffe240000  ranges:
>>  MEM 0x0000000c00000000..0x0000000c0fffffff -> 0x00000000e0000000
>>   IO 0x0000000ff8000000..0x0000000ff800ffff -> 0x0000000000000000
>> /pcie@ffe240000: PCICSRBAR @ 0xff000000
>> setup_pci_atmu: end of DRAM 200000000
>> /pcie@ffe240000: Setup 64-bit PCI DMA window
>> /pcie@ffe240000: WARNING: Outbound window cfg leaves gaps in memory map.
>> Adjusting the memory map could reduce unnecessary bounce buffering.
>> /pcie@ffe240000: DMA window size is 0xe0000000
>> Found FSL PCI host bridge at 0x0000000ffe250000. Firmware bus number: 0->1
>> PCI host bridge /pcie@ffe250000  ranges:
>>  MEM 0x0000000c10000000..0x0000000c1fffffff -> 0x00000000e0000000
>>   IO 0x0000000ff8010000..0x0000000ff801ffff -> 0x0000000000000000
>> /pcie@ffe250000: PCICSRBAR @ 0xff000000
>> setup_pci_atmu: end of DRAM 200000000
>> /pcie@ffe250000: Setup 64-bit PCI DMA window
>> /pcie@ffe250000: WARNING: Outbound window cfg leaves gaps in memory map.
>> Adjusting the memory map could reduce unnecessary bounce buffering.
>> /pcie@ffe250000: DMA window size is 0xe0000000
>> Found FSL PCI host bridge at 0x0000000ffe260000. Firmware bus number: 0->0
>> PCI host bridge /pcie@ffe260000  ranges:
>>  MEM 0x0000000c20000000..0x0000000c2fffffff -> 0x00000000e0000000
>>   IO 0x0000000ff8020000..0x0000000ff802ffff -> 0x0000000000000000
>> /pcie@ffe260000: PCICSRBAR @ 0xff000000
>> setup_pci_atmu: end of DRAM 200000000
>> /pcie@ffe260000: Setup 64-bit PCI DMA window
>> /pcie@ffe260000: WARNING: Outbound window cfg leaves gaps in memory map.
>> Adjusting the memory map could reduce unnecessary bounce buffering.
>> /pcie@ffe260000: DMA window size is 0xe0000000
>> Found FSL PCI host bridge at 0x0000000ffe270000. Firmware bus number: 0->0
>> PCI host bridge /pcie@ffe270000  ranges:
>>  MEM 0x0000000c30000000..0x0000000c3fffffff -> 0x00000000e0000000
>>   IO 0x0000000ff8030000..0x0000000ff803ffff -> 0x0000000000000000
>> /pcie@ffe270000: PCICSRBAR @ 0xff000000
>> setup_pci_atmu: end of DRAM 200000000
>> /pcie@ffe270000: Setup 64-bit PCI DMA window
>> /pcie@ffe270000: WARNING: Outbound window cfg leaves gaps in memory map.
>> Adjusting the memory map could reduce unnecessary bounce buffering.
>> /pcie@ffe270000: DMA window size is 0xe0000000
>> iommu: Adding device ff6000000.qman-portal to group 0
>> iommu: Adding device ff6004000.qman-portal to group 1
>> iommu: Adding device ff6008000.qman-portal to group 2
>> iommu: Adding device ff600c000.qman-portal to group 3
>> iommu: Adding device ff6010000.qman-portal to group 4
>> iommu: Adding device ff6014000.qman-portal to group 5
>> iommu: Adding device ff6018000.qman-portal to group 6
>> iommu: Adding device ff601c000.qman-portal to group 7
>> iommu: Adding device ff6020000.qman-portal to group 8
>> iommu: Adding device ff6024000.qman-portal to group 9
>> iommu: Adding device ffe100300.dma to group 10
>> iommu: Adding device ffe101300.dma to group 11
>> iommu: Adding device ffe114000.sdhc to group 12
>> iommu: Adding device ffe210000.usb to group 13
>> iommu: Adding device ffe211000.usb to group 14
>> iommu: Adding device ffe220000.sata to group 15
>> iommu: Adding device ffe221000.sata to group 16
>> iommu: Adding device ffe318000.qman to group 17
>> iommu: Adding device ffe31a000.bman to group 18
>> iommu: Adding device ffe240000.pcie to group 19
>> iommu: Adding device ffe250000.pcie to group 20
>> iommu: Adding device ffe260000.pcie to group 21
>> iommu: Adding device ffe270000.pcie to group 22
>> iommu: Adding device ffe140000.qe to group 23
>> software IO TLB: mapped [mem 0xfbfff000-0xfffff000] (64MB)
>> PCI: Probing PCI hardware
>> fsl-pci ffe240000.pcie: PCI host bridge to bus 0000:00
>> pci_bus 0000:00: root bus resource [io
>>  0x8000080000010000-0x800008000001ffff] (bus address [0x0000-0xffff])
>> pci_bus 0000:00: root bus resource [mem 0xc00000000-0xc0fffffff] (bus
>> address [0xe0000000-0xefffffff])
>> pci_bus 0000:00: root bus resource [bus 00]
>> iommu: Removing device ffe240000.pcie from group 19
>> iommu: Adding device 0000:00:00.0 to group 24
>> pci 0000:00:00.0: bridge configuration invalid ([bus 00-00]),
>> reconfiguring
>> pci 0000:00:00.0: PCI bridge to [bus 01-ff]
>> fsl-pci ffe250000.pcie: PCI host bridge to bus 0001:00
>> pci_bus 0001:00: root bus resource [io
>>  0x8000080000021000-0x8000080000030fff] (bus address [0x0000-0xffff])
>> pci_bus 0001:00: root bus resource [mem 0xc10000000-0xc1fffffff] (bus
>> address [0xe0000000-0xefffffff])
>> pci_bus 0001:00: root bus resource [bus 00-01]
>> iommu: Removing device ffe250000.pcie from group 20
>> iommu: Adding device 0001:00:00.0 to group 19
>> pci 0001:01:00.0: enabling Extended Tags
>> pci 0001:01:00.0: 4.000 Gb/s available PCIe bandwidth, limited by 5 GT/s
>> x1 link at 0001:00:00.0 (capable of 63.008 Gb/s with 8 GT/s x8 link)
>> iommu: Adding device 0001:01:00.0 to group 19
>> pci 0001:01:00.1: enabling Extended Tags
>> iommu: Adding device 0001:01:00.1 to group 19
>> pci 0001:00:00.0: PCI bridge to [bus 01-ff]
>> fsl-pci ffe260000.pcie: PCI host bridge to bus 0002:00
>> pci_bus 0002:00: root bus resource [io
>>  0x8000080000032000-0x8000080000041fff] (bus address [0x0000-0xffff])
>> pci_bus 0002:00: root bus resource [mem 0xc20000000-0xc2fffffff] (bus
>> address [0xe0000000-0xefffffff])
>> pci_bus 0002:00: root bus resource [bus 00]
>> iommu: Removing device ffe260000.pcie from group 21
>> iommu: Adding device 0002:00:00.0 to group 20
>> pci 0002:00:00.0: bridge configuration invalid ([bus 00-00]),
>> reconfiguring
>> pci 0002:00:00.0: PCI bridge to [bus 01-ff]
>> fsl-pci ffe270000.pcie: PCI host bridge to bus 0003:00
>> pci_bus 0003:00: root bus resource [io
>>  0x8000080000043000-0x8000080000052fff] (bus address [0x0000-0xffff])
>> pci_bus 0003:00: root bus resource [mem 0xc30000000-0xc3fffffff] (bus
>> address [0xe0000000-0xefffffff])
>> pci_bus 0003:00: root bus resource [bus 00]
>> iommu: Removing device ffe270000.pcie from group 22
>> iommu: Adding device 0003:00:00.0 to group 21
>> pci 0003:00:00.0: bridge configuration invalid ([bus 00-00]),
>> reconfiguring
>> pci 0003:00:00.0: PCI bridge to [bus 01-ff]
>> PCI: Cannot allocate resource region 0 of device 0000:00:00.0, will remap
>> PCI: Cannot allocate resource region 0 of device 0001:00:00.0, will remap
>> PCI: Cannot allocate resource region 2 of device 0001:01:00.0, will remap
>> PCI: Cannot allocate resource region 5 of device 0001:01:00.0, will remap
>> PCI: Cannot allocate resource region 6 of device 0001:01:00.0, will remap
>> PCI: Cannot allocate resource region 0 of device 0001:01:00.1, will remap
>> PCI: Cannot allocate resource region 0 of device 0002:00:00.0, will remap
>> PCI: Cannot allocate resource region 0 of device 0003:00:00.0, will remap
>> pci 0000:00:00.0: BAR 0: no space for [mem size 0x01000000]
>> pci 0000:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>> pci 0000:00:00.0: PCI bridge to [bus 01]
>> pci 0000:00:00.0:   bridge window [io
>>  0x8000080000010000-0x800008000001ffff]
>> pci 0000:00:00.0:   bridge window [mem 0xc00000000-0xc0fffffff]
>> pci_bus 0000:00: Some PCI device resources are unassigned, try booting
>> with pci=realloc
>> pci 0001:00:00.0: BAR 0: no space for [mem size 0x01000000]
>> pci 0001:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>> pci 0001:00:00.0: BAR 9: no space for [mem size 0x00200000 64bit pref]
>> pci 0001:00:00.0: BAR 9: failed to assign [mem size 0x00200000 64bit pref]
>> pci 0001:01:00.0: BAR 2: no space for [mem size 0x00200000 64bit pref]
>> pci 0001:01:00.0: BAR 2: failed to assign [mem size 0x00200000 64bit pref]
>> pci 0001:01:00.0: BAR 5: no space for [mem size 0x00040000]
>> pci 0001:01:00.0: BAR 5: failed to assign [mem size 0x00040000]
>> pci 0001:01:00.0: BAR 6: no space for [mem size 0x00020000 pref]
>> pci 0001:01:00.0: BAR 6: failed to assign [mem size 0x00020000 pref]
>> pci 0001:01:00.1: BAR 0: no space for [mem size 0x00004000 64bit]
>> pci 0001:01:00.1: BAR 0: failed to assign [mem size 0x00004000 64bit]
>> pci 0001:00:00.0: PCI bridge to [bus 01]
>> pci 0001:00:00.0:   bridge window [io
>>  0x8000080000021000-0x8000080000030fff]
>> pci 0001:00:00.0:   bridge window [mem 0xc10000000-0xc1fffffff]
>> pci_bus 0001:00: Some PCI device resources are unassigned, try booting
>> with pci=realloc
>> pci 0002:00:00.0: BAR 0: no space for [mem size 0x01000000]
>> pci 0002:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>> pci 0002:00:00.0: PCI bridge to [bus 01]
>> pci 0002:00:00.0:   bridge window [io
>>  0x8000080000032000-0x8000080000041fff]
>> pci 0002:00:00.0:   bridge window [mem 0xc20000000-0xc2fffffff]
>> pci_bus 0002:00: Some PCI device resources are unassigned, try booting
>> with pci=realloc
>> pci 0003:00:00.0: BAR 0: no space for [mem size 0x01000000]
>> pci 0003:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>> pci 0003:00:00.0: PCI bridge to [bus 01]
>> pci 0003:00:00.0:   bridge window [io
>>  0x8000080000043000-0x8000080000052fff]
>> pci 0003:00:00.0:   bridge window [mem 0xc30000000-0xc3fffffff]
>> pci_bus 0003:00: Some PCI device resources are unassigned, try booting
>> with pci=realloc
>>
>>
>>
>>
> _______________________________________________
> amd-gfx mailing listamd-gfx@lists.freedesktop.orghttps://lists.freedesktop.org/mailman/listinfo/amd-gfx
>
>
>

[-- Attachment #1.2: Type: text/html, Size: 40543 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
  2019-12-02 16:31           ` Yusuf Altıparmak
@ 2019-12-03 10:31             ` Yusuf Altıparmak
  2019-12-03 12:07               ` Christian König
  0 siblings, 1 reply; 12+ messages in thread
From: Yusuf Altıparmak @ 2019-12-03 10:31 UTC (permalink / raw)
  To: Christian König; +Cc: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 21088 bytes --]

Hello Christian,
My "setpci -s 0001:01:00.0 ECAP15+4.l ECAP15+8.l" output is;

0001f000

00000820

Regards.



Yusuf Altıparmak <yusufalti1997@gmail.com>, 2 Ara 2019 Pzt, 19:31 tarihinde
şunu yazdı:

> Most likely not. There is support for resizing the VRAM BAR, but usually
>> you can only make it larger and not smaller.
>> Please give me the output of "sudo setpci -s 0001:01:00.0 ECAP15+4.l
>> ECAP15+8.l" if you want to double check that.
>>
>
> Okay I'll try it tomorrow. What does the " sudo setpci -s 0001:01:00.0
> ECAP15+4.l ECAP15+8.l" command exactly do ?
>
>
>
>> Well you rather need to ask if anybody has sample PCIe configuration for
>> GPUs in general. That problem is not really E9171 related. You might want
>> to ask NXP for that maybe.
>> Sorry, no idea if that is correct or not. You need to ask NXP for help
>> with that.
>>
>>
> Okay no problem. At least I know what is the missing point now. The
> problem is probably because of the .dtsi and u-boot config files. Memory
> ranges are overwriting like you said. I'll ask nxp to take some sample PCIe
> configuration for GPUs.
>
> Thank you for your interest Christian.
> Regards .
>
>
>>
>> Am 02.12.19 um 14:32 schrieb Yusuf Altıparmak:
>>>
>>>
>>>> I attached my dts file.
>>>>
>>>> System is working fine when GPU is not plugged in.
>>>>
>>>> *This is the last console log before freeze:*
>>>> [drm] amdgpu kernel modesetting enabled.
>>>>
>>>> [drm] initializing kernel modesetting (POLARIS12 0x1002:0x6987
>>>> 0x1787:0x2389 0x80).
>>>> [drm] register mmio base: 0x20200000
>>>>
>>>> fsl-fman-port ffe488000.port fm1-gb0: renamed from eth0
>>>>
>>>> [drm] register mmio size: 262144
>>>>
>>>> [drm] add ip block number 0 <vi_common>
>>>>
>>>> [drm] add ip block number 1 <gmc_v8_0>
>>>>
>>>> [drm] add ip block number 2 <tonga_ih>
>>>>
>>>> [drm] add ip block number 3 <powerplay>
>>>>
>>>> [drm] add ip block number 4 <dm>
>>>>
>>>> [drm] add ip block number 5 <gfx_v8_0>
>>>>
>>>> [drm] add ip block number 6 <sdma_v3_0>
>>>>
>>>> [drm] add ip block number 7 <uvd_v6_0>
>>>>
>>>> [drm] add ip block number 8 <vce_v3_0>
>>>>
>>>> [drm] UVD is enabled in VM mode
>>>>
>>>> [drm] UVD ENC is enabled in VM mode
>>>>
>>>> [drm] VCE enabled in VM mode
>>>>
>>>> ATOM BIOS: 113-ER16BFC-001
>>>>
>>>> [drm] GPU posting now...
>>>>
>>>> Disabling lock debugging due to kernel taint
>>>>
>>>> Machine check in kernel mode.
>>>>
>>>> Caused by (from MCSR=a000): Load Error Report
>>>>
>>>> Guarded Load Error Report
>>>>
>>>> Kernel panic - not syncing: Unrecoverable Machine check
>>>>
>>>> CPU: 1 PID: 2023 Comm: udevd Tainted: G   M
>>>>  4.19.26+gc0c2141 #1
>>>> Call Trace:
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> amd-gfx mailing listamd-gfx@lists.freedesktop.orghttps://lists.freedesktop.org/mailman/listinfo/amd-gfx <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7C9be244504efb4ff9cc0a08d7772c1630%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637108903547899069&sdata=OL4UKczXinKq5FM3Cid5kYcESkHAl0esXfOB83AUPU8%3D&reserved=0>
>>>>
>>>>
>>>>
>>>
>>> Christian König <ckoenig.leichtzumerken@gmail.com>, 2 Ara 2019 Pzt,
>>> 15:28 tarihinde şunu yazdı:
>>>
>>>> Hi Yusuf,
>>>>
>>>> Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
>>>>
>>>> My embedded board is freezing when I put E9171 on PCIe. What is the
>>>> meaning of Unrecoverable Machine Check error about GPU?
>>>>
>>>>
>>>> Well see the explanation on Wikipedia for example:
>>>> https://en.wikipedia.org/wiki/Machine-check_exception
>>>> <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMachine-check_exception&data=02%7C01%7Cchristian.koenig%40amd.com%7C9be244504efb4ff9cc0a08d7772c1630%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637108903547909070&sdata=mBxI9LnRBO4iu5oyHc9OrrFj0K10SiOf5%2FztzmNedaI%3D&reserved=0>
>>>>
>>>> In general it means you have messed up something in your hardware
>>>> configuration.
>>>>
>>>> Could PCIe settings in .dts file cause this problem?
>>>>
>>>>
>>>> Possible, but rather unlikely. My best guess is that it is some problem
>>>> with the power supply.
>>>>
>>>> If it is, is there any sample PCIe configuration for E9171?
>>>>
>>>>
>>>> The E9171 is just a PCIe device, so the dtsi is actually rather
>>>> uninteresting. What we really need is a full dmesg and maybe lspci output
>>>> would help as well.
>>>>
>>>> Regards,
>>>> Christian.
>>>>
>>>
>>>
>>> Hi Christian,
>>>
>>> At first, I am using NXP T1042D4RDB-64B which has 256 MB PCIe buffer
>>> according to its. PCIe memory range was arranged to 256 MB in .dts file and
>>> in U-boot configuration file. Driver was giving error with exit code -12
>>> (OUT_OF_MEMORY). But I was able to reach the linux console.
>>>
>>> [    5.512922] [drm] amdgpu kernel modesetting enabled.
>>> [    5.517065] [drm] initializing kernel modesetting (POLARIS12
>>> 0x1002:0x6987 0x1787:0x2389 0x80).
>>> [    5.524507] amdgpu 0001:01:00.0: Fatal error during GPU init
>>> [    5.529296] amdgpu: probe of 0001:01:00.0 failed with error -12
>>>
>>> Then I canged 256 MB to 4GB in .dtsi and U-boot conf file. I also
>>> changed 64KB I/O size to 1MB . When I do this, I wasn't able to reach the
>>> linux console because board was freezing. But driver was successfull at
>>> this time. I already mentioned successfull driver console logs up.
>>>
>>> *this is lspci -v when GPU is plugged and Memory size is 256 MB.*
>>>
>>> root@t1042d4rdb-64b:~# lspci -v
>>> 0000:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>> 11) (prog-if 00 [Normal decode])
>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe240000
>>> /pcie@0
>>>         Flags: bus master, fast devsel, latency 0, IRQ 20
>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>         Prefetchable memory behind bridge: None
>>>         Capabilities: [44] Power Management version 3
>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>         Capabilities: [100] Advanced Error Reporting
>>>         Kernel driver in use: pcieport
>>>
>>> 0001:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>> 11) (prog-if 00 [Normal decode])
>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe250000
>>> /pcie@0
>>>         Flags: bus master, fast devsel, latency 0, IRQ 21
>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>         Prefetchable memory behind bridge: None
>>>         Capabilities: [44] Power Management version 3
>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>         Capabilities: [100] Advanced Error Reporting
>>>         Kernel driver in use: pcieport
>>>
>>> 0001:01:00.0 VGA compatible controller: Advanced Micro Devices, Inc.
>>> [AMD/ATI] Lexa [Radeon E9171 MCM] (rev 80) (prog-if 00 [VGA controller])
>>>         Subsystem: Hightech Information System Ltd. Device 2389
>>>         Flags: fast devsel, IRQ 41
>>>         Memory at c10000000 (64-bit, prefetchable) [size=256M]
>>>         Memory at <ignored> (64-bit, prefetchable)
>>>         I/O ports at 1100 [size=256]
>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>         Expansion ROM at <ignored> [disabled]
>>>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>>         Capabilities: [50] Power Management version 3
>>>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>>>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>>>         Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
>>> Len=010 <?>
>>>         Capabilities: [150] Advanced Error Reporting
>>>         Capabilities: [200] Resizable BAR <?>
>>>         Capabilities: [270] Secondary PCI Express <?>
>>>         Capabilities: [2b0] Address Translation Service (ATS)
>>>         Capabilities: [2c0] Page Request Interface (PRI)
>>>         Capabilities: [2d0] Process Address Space ID (PASID)
>>>         Capabilities: [320] Latency Tolerance Reporting
>>>         Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
>>>         Capabilities: [370] L1 PM Substates
>>>         Kernel modules: amdgpu
>>>
>>> 0001:01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device
>>> aae0
>>>         Subsystem: Hightech Information System Ltd. Device aae0
>>>         Flags: bus master, fast devsel, latency 0, IRQ 17
>>>         Memory at <ignored> (64-bit, non-prefetchable)
>>>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>>         Capabilities: [50] Power Management version 3
>>>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>>>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>>>         Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
>>> Len=010 <?>
>>>         Capabilities: [150] Advanced Error Reporting
>>>         Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
>>>
>>> 0002:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>> 11) (prog-if 00 [Normal decode])
>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe260000
>>> /pcie@0
>>>         Flags: bus master, fast devsel, latency 0, IRQ 22
>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>         Prefetchable memory behind bridge: None
>>>         Capabilities: [44] Power Management version 3
>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>         Capabilities: [100] Advanced Error Reporting
>>>         Kernel driver in use: pcieport
>>>
>>> 0003:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>> 11) (prog-if 00 [Normal decode])
>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe270000
>>> /pcie@0
>>>         Flags: bus master, fast devsel, latency 0, IRQ 23
>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>         Prefetchable memory behind bridge: None
>>>         Capabilities: [44] Power Management version 3
>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>         Capabilities: [100] Advanced Error Reporting
>>>         Kernel driver in use: pcieport
>>>
>>> *AND This is PCIe dmesg message when memory range is 256MB. It's also
>>> giving same message when memory range is arranged as 4GB;*
>>>
>>> PCI host bridge /pcie@ffe240000  ranges:
>>>  MEM 0x0000000c00000000..0x0000000c0fffffff -> 0x00000000e0000000
>>>   IO 0x0000000ff8000000..0x0000000ff800ffff -> 0x0000000000000000
>>> /pcie@ffe240000: PCICSRBAR @ 0xff000000
>>> setup_pci_atmu: end of DRAM 200000000
>>> /pcie@ffe240000: Setup 64-bit PCI DMA window
>>> /pcie@ffe240000: WARNING: Outbound window cfg leaves gaps in memory
>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>> /pcie@ffe240000: DMA window size is 0xe0000000
>>> Found FSL PCI host bridge at 0x0000000ffe250000. Firmware bus number:
>>> 0->1
>>> PCI host bridge /pcie@ffe250000  ranges:
>>>  MEM 0x0000000c10000000..0x0000000c1fffffff -> 0x00000000e0000000
>>>   IO 0x0000000ff8010000..0x0000000ff801ffff -> 0x0000000000000000
>>> /pcie@ffe250000: PCICSRBAR @ 0xff000000
>>> setup_pci_atmu: end of DRAM 200000000
>>> /pcie@ffe250000: Setup 64-bit PCI DMA window
>>> /pcie@ffe250000: WARNING: Outbound window cfg leaves gaps in memory
>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>> /pcie@ffe250000: DMA window size is 0xe0000000
>>> Found FSL PCI host bridge at 0x0000000ffe260000. Firmware bus number:
>>> 0->0
>>> PCI host bridge /pcie@ffe260000  ranges:
>>>  MEM 0x0000000c20000000..0x0000000c2fffffff -> 0x00000000e0000000
>>>   IO 0x0000000ff8020000..0x0000000ff802ffff -> 0x0000000000000000
>>> /pcie@ffe260000: PCICSRBAR @ 0xff000000
>>> setup_pci_atmu: end of DRAM 200000000
>>> /pcie@ffe260000: Setup 64-bit PCI DMA window
>>> /pcie@ffe260000: WARNING: Outbound window cfg leaves gaps in memory
>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>> /pcie@ffe260000: DMA window size is 0xe0000000
>>> Found FSL PCI host bridge at 0x0000000ffe270000. Firmware bus number:
>>> 0->0
>>> PCI host bridge /pcie@ffe270000  ranges:
>>>  MEM 0x0000000c30000000..0x0000000c3fffffff -> 0x00000000e0000000
>>>   IO 0x0000000ff8030000..0x0000000ff803ffff -> 0x0000000000000000
>>> /pcie@ffe270000: PCICSRBAR @ 0xff000000
>>> setup_pci_atmu: end of DRAM 200000000
>>> /pcie@ffe270000: Setup 64-bit PCI DMA window
>>> /pcie@ffe270000: WARNING: Outbound window cfg leaves gaps in memory
>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>> /pcie@ffe270000: DMA window size is 0xe0000000
>>> iommu: Adding device ff6000000.qman-portal to group 0
>>> iommu: Adding device ff6004000.qman-portal to group 1
>>> iommu: Adding device ff6008000.qman-portal to group 2
>>> iommu: Adding device ff600c000.qman-portal to group 3
>>> iommu: Adding device ff6010000.qman-portal to group 4
>>> iommu: Adding device ff6014000.qman-portal to group 5
>>> iommu: Adding device ff6018000.qman-portal to group 6
>>> iommu: Adding device ff601c000.qman-portal to group 7
>>> iommu: Adding device ff6020000.qman-portal to group 8
>>> iommu: Adding device ff6024000.qman-portal to group 9
>>> iommu: Adding device ffe100300.dma to group 10
>>> iommu: Adding device ffe101300.dma to group 11
>>> iommu: Adding device ffe114000.sdhc to group 12
>>> iommu: Adding device ffe210000.usb to group 13
>>> iommu: Adding device ffe211000.usb to group 14
>>> iommu: Adding device ffe220000.sata to group 15
>>> iommu: Adding device ffe221000.sata to group 16
>>> iommu: Adding device ffe318000.qman to group 17
>>> iommu: Adding device ffe31a000.bman to group 18
>>> iommu: Adding device ffe240000.pcie to group 19
>>> iommu: Adding device ffe250000.pcie to group 20
>>> iommu: Adding device ffe260000.pcie to group 21
>>> iommu: Adding device ffe270000.pcie to group 22
>>> iommu: Adding device ffe140000.qe to group 23
>>> software IO TLB: mapped [mem 0xfbfff000-0xfffff000] (64MB)
>>> PCI: Probing PCI hardware
>>> fsl-pci ffe240000.pcie: PCI host bridge to bus 0000:00
>>> pci_bus 0000:00: root bus resource [io
>>>  0x8000080000010000-0x800008000001ffff] (bus address [0x0000-0xffff])
>>> pci_bus 0000:00: root bus resource [mem 0xc00000000-0xc0fffffff] (bus
>>> address [0xe0000000-0xefffffff])
>>> pci_bus 0000:00: root bus resource [bus 00]
>>> iommu: Removing device ffe240000.pcie from group 19
>>> iommu: Adding device 0000:00:00.0 to group 24
>>> pci 0000:00:00.0: bridge configuration invalid ([bus 00-00]),
>>> reconfiguring
>>> pci 0000:00:00.0: PCI bridge to [bus 01-ff]
>>> fsl-pci ffe250000.pcie: PCI host bridge to bus 0001:00
>>> pci_bus 0001:00: root bus resource [io
>>>  0x8000080000021000-0x8000080000030fff] (bus address [0x0000-0xffff])
>>> pci_bus 0001:00: root bus resource [mem 0xc10000000-0xc1fffffff] (bus
>>> address [0xe0000000-0xefffffff])
>>> pci_bus 0001:00: root bus resource [bus 00-01]
>>> iommu: Removing device ffe250000.pcie from group 20
>>> iommu: Adding device 0001:00:00.0 to group 19
>>> pci 0001:01:00.0: enabling Extended Tags
>>> pci 0001:01:00.0: 4.000 Gb/s available PCIe bandwidth, limited by 5 GT/s
>>> x1 link at 0001:00:00.0 (capable of 63.008 Gb/s with 8 GT/s x8 link)
>>> iommu: Adding device 0001:01:00.0 to group 19
>>> pci 0001:01:00.1: enabling Extended Tags
>>> iommu: Adding device 0001:01:00.1 to group 19
>>> pci 0001:00:00.0: PCI bridge to [bus 01-ff]
>>> fsl-pci ffe260000.pcie: PCI host bridge to bus 0002:00
>>> pci_bus 0002:00: root bus resource [io
>>>  0x8000080000032000-0x8000080000041fff] (bus address [0x0000-0xffff])
>>> pci_bus 0002:00: root bus resource [mem 0xc20000000-0xc2fffffff] (bus
>>> address [0xe0000000-0xefffffff])
>>> pci_bus 0002:00: root bus resource [bus 00]
>>> iommu: Removing device ffe260000.pcie from group 21
>>> iommu: Adding device 0002:00:00.0 to group 20
>>> pci 0002:00:00.0: bridge configuration invalid ([bus 00-00]),
>>> reconfiguring
>>> pci 0002:00:00.0: PCI bridge to [bus 01-ff]
>>> fsl-pci ffe270000.pcie: PCI host bridge to bus 0003:00
>>> pci_bus 0003:00: root bus resource [io
>>>  0x8000080000043000-0x8000080000052fff] (bus address [0x0000-0xffff])
>>> pci_bus 0003:00: root bus resource [mem 0xc30000000-0xc3fffffff] (bus
>>> address [0xe0000000-0xefffffff])
>>> pci_bus 0003:00: root bus resource [bus 00]
>>> iommu: Removing device ffe270000.pcie from group 22
>>> iommu: Adding device 0003:00:00.0 to group 21
>>> pci 0003:00:00.0: bridge configuration invalid ([bus 00-00]),
>>> reconfiguring
>>> pci 0003:00:00.0: PCI bridge to [bus 01-ff]
>>> PCI: Cannot allocate resource region 0 of device 0000:00:00.0, will remap
>>> PCI: Cannot allocate resource region 0 of device 0001:00:00.0, will remap
>>> PCI: Cannot allocate resource region 2 of device 0001:01:00.0, will remap
>>> PCI: Cannot allocate resource region 5 of device 0001:01:00.0, will remap
>>> PCI: Cannot allocate resource region 6 of device 0001:01:00.0, will remap
>>> PCI: Cannot allocate resource region 0 of device 0001:01:00.1, will remap
>>> PCI: Cannot allocate resource region 0 of device 0002:00:00.0, will remap
>>> PCI: Cannot allocate resource region 0 of device 0003:00:00.0, will remap
>>> pci 0000:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>> pci 0000:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>> pci 0000:00:00.0: PCI bridge to [bus 01]
>>> pci 0000:00:00.0:   bridge window [io
>>>  0x8000080000010000-0x800008000001ffff]
>>> pci 0000:00:00.0:   bridge window [mem 0xc00000000-0xc0fffffff]
>>> pci_bus 0000:00: Some PCI device resources are unassigned, try booting
>>> with pci=realloc
>>> pci 0001:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>> pci 0001:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>> pci 0001:00:00.0: BAR 9: no space for [mem size 0x00200000 64bit pref]
>>> pci 0001:00:00.0: BAR 9: failed to assign [mem size 0x00200000 64bit
>>> pref]
>>> pci 0001:01:00.0: BAR 2: no space for [mem size 0x00200000 64bit pref]
>>> pci 0001:01:00.0: BAR 2: failed to assign [mem size 0x00200000 64bit
>>> pref]
>>> pci 0001:01:00.0: BAR 5: no space for [mem size 0x00040000]
>>> pci 0001:01:00.0: BAR 5: failed to assign [mem size 0x00040000]
>>> pci 0001:01:00.0: BAR 6: no space for [mem size 0x00020000 pref]
>>> pci 0001:01:00.0: BAR 6: failed to assign [mem size 0x00020000 pref]
>>> pci 0001:01:00.1: BAR 0: no space for [mem size 0x00004000 64bit]
>>> pci 0001:01:00.1: BAR 0: failed to assign [mem size 0x00004000 64bit]
>>> pci 0001:00:00.0: PCI bridge to [bus 01]
>>> pci 0001:00:00.0:   bridge window [io
>>>  0x8000080000021000-0x8000080000030fff]
>>> pci 0001:00:00.0:   bridge window [mem 0xc10000000-0xc1fffffff]
>>> pci_bus 0001:00: Some PCI device resources are unassigned, try booting
>>> with pci=realloc
>>> pci 0002:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>> pci 0002:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>> pci 0002:00:00.0: PCI bridge to [bus 01]
>>> pci 0002:00:00.0:   bridge window [io
>>>  0x8000080000032000-0x8000080000041fff]
>>> pci 0002:00:00.0:   bridge window [mem 0xc20000000-0xc2fffffff]
>>> pci_bus 0002:00: Some PCI device resources are unassigned, try booting
>>> with pci=realloc
>>> pci 0003:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>> pci 0003:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>> pci 0003:00:00.0: PCI bridge to [bus 01]
>>> pci 0003:00:00.0:   bridge window [io
>>>  0x8000080000043000-0x8000080000052fff]
>>> pci 0003:00:00.0:   bridge window [mem 0xc30000000-0xc3fffffff]
>>> pci_bus 0003:00: Some PCI device resources are unassigned, try booting
>>> with pci=realloc
>>>
>>>
>>>
>>>
>> _______________________________________________
>> amd-gfx mailing listamd-gfx@lists.freedesktop.orghttps://lists.freedesktop.org/mailman/listinfo/amd-gfx
>>
>>
>>

[-- Attachment #1.2: Type: text/html, Size: 41323 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
  2019-12-03 10:31             ` Yusuf Altıparmak
@ 2019-12-03 12:07               ` Christian König
  2019-12-03 12:50                 ` Yusuf Altıparmak
  0 siblings, 1 reply; 12+ messages in thread
From: Christian König @ 2019-12-03 12:07 UTC (permalink / raw)
  To: Yusuf Altıparmak; +Cc: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 27425 bytes --]

Hi Yusuf,

> 0001f000
Exactly as I thought. The hardware does support BAR resize, but 
unfortunately 256MB is already the minimum.

Sorry, but there isn't anything I could do from the GPU drivers point of 
view.

The only good news I have is that 256M+2M+512K+128K=260M address space 
should be enough for the GPU to work, maybe that makes things a bit simpler.

But you definitely got an interesting use case here :)

Regards,
Christian.

Am 03.12.19 um 11:31 schrieb Yusuf Altıparmak:
> Hello Christian,
> My "setpci -s 0001:01:00.0 ECAP15+4.l ECAP15+8.l" output is;
>
> 0001f000
> 00000820
>
> Regards.
>
>
>
> Yusuf Altıparmak <yusufalti1997@gmail.com 
> <mailto:yusufalti1997@gmail.com>>, 2 Ara 2019 Pzt, 19:31 tarihinde 
> şunu yazdı:
>
>         Most likely not. There is support for resizing the VRAM BAR,
>         but usually you can only make it larger and not smaller.
>         Please give me the output of "sudo setpci -s 0001:01:00.0
>         ECAP15+4.l ECAP15+8.l" if you want to double check that.
>
>
>     Okay I'll try it tomorrow. What does the " sudo setpci -s
>     0001:01:00.0 ECAP15+4.l ECAP15+8.l" command exactly do ?
>
>         Well you rather need to ask if anybody has sample PCIe
>         configuration for GPUs in general. That problem is not really
>         E9171 related. You might want to ask NXP for that maybe.
>         Sorry, no idea if that is correct or not. You need to ask NXP
>         for help with that.
>
>
>     Okay no problem. At least I know what is the missing point now.
>     The problem is probably because of the .dtsi and u-boot config
>     files. Memory ranges are overwriting like you said. I'll ask nxp
>     to take some sample PCIe configuration for GPUs.
>
>     Thank you for your interest Christian.
>     Regards .
>
>>
>>             Am 02.12.19 um 14:32 schrieb Yusuf Altıparmak:
>>>
>>>
>>>>                 I attached my dts file.
>>>>
>>>>                 System is working fine when GPU is not plugged in.
>>>>                 *
>>>>                 *
>>>>                 *This is the last console log before freeze:*
>>>>                 [drm] amdgpu kernel modesetting enabled.
>>>>                 [drm] initializing kernel modesetting (POLARIS12
>>>>                 0x1002:0x6987 0x1787:0x2389 0x80).
>>>>                 [drm] register mmio base: 0x20200000
>>>>                 fsl-fman-port ffe488000.port fm1-gb0: renamed from
>>>>                 eth0
>>>>                 [drm] register mmio size: 262144
>>>>                 [drm] add ip block number 0 <vi_common>
>>>>                 [drm] add ip block number 1 <gmc_v8_0>
>>>>                 [drm] add ip block number 2 <tonga_ih>
>>>>                 [drm] add ip block number 3 <powerplay>
>>>>                 [drm] add ip block number 4 <dm>
>>>>                 [drm] add ip block number 5 <gfx_v8_0>
>>>>                 [drm] add ip block number 6 <sdma_v3_0>
>>>>                 [drm] add ip block number 7 <uvd_v6_0>
>>>>                 [drm] add ip block number 8 <vce_v3_0>
>>>>                 [drm] UVD is enabled in VM mode
>>>>                 [drm] UVD ENC is enabled in VM mode
>>>>                 [drm] VCE enabled in VM mode
>>>>                 ATOM BIOS: 113-ER16BFC-001
>>>>                 [drm] GPU posting now...
>>>>                 Disabling lock debugging due to kernel taint
>>>>                 Machine check in kernel mode.
>>>>                 Caused by (from MCSR=a000): Load Error Report
>>>>                 Guarded Load Error Report
>>>>                 Kernel panic - not syncing: Unrecoverable Machine
>>>>                 check
>>>>                 CPU: 1 PID: 2023 Comm: udevd Tainted: G   M
>>>>                  4.19.26+gc0c2141 #1
>>>>                 Call Trace:
>>>
>>>>
>>>>
>>>>                 _______________________________________________
>>>>                 amd-gfx mailing list
>>>>                 amd-gfx@lists.freedesktop.org  <mailto:amd-gfx@lists.freedesktop.org>
>>>>                 https://lists.freedesktop.org/mailman/listinfo/amd-gfx  <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7Ca865cbfc1c474712d8ad08d777dc0ab1%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109659276684179&sdata=K7VRTAsSYAiTUVdf2ijSuEhFxZ755%2Bb0n8A3hOCjaAk%3D&reserved=0>
>>>
>>>
>>>
>>>             Christian König <ckoenig.leichtzumerken@gmail.com
>>>             <mailto:ckoenig.leichtzumerken@gmail.com>>, 2 Ara 2019
>>>             Pzt, 15:28 tarihinde şunu yazdı:
>>>
>>>                 Hi Yusuf,
>>>
>>>                 Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
>>>>                 My embedded board is freezing when I put E9171 on
>>>>                 PCIe. What is the meaning of Unrecoverable Machine
>>>>                 Check error about GPU?
>>>
>>>                 Well see the explanation on Wikipedia for example:
>>>                 https://en.wikipedia.org/wiki/Machine-check_exception
>>>                 <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMachine-check_exception&data=02%7C01%7Cchristian.koenig%40amd.com%7Ca865cbfc1c474712d8ad08d777dc0ab1%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109659276694169&sdata=fKC1Ly%2FY24tfgqC0XbMhm94xn6cor%2FanoUu%2B%2Bgk2E7Q%3D&reserved=0>
>>>
>>>                 In general it means you have messed up something in
>>>                 your hardware configuration.
>>>
>>>>                 Could PCIe settings in .dts file cause this problem?
>>>
>>>                 Possible, but rather unlikely. My best guess is that
>>>                 it is some problem with the power supply.
>>>
>>>>                 If it is, is there any sample PCIe configuration
>>>>                 for E9171?
>>>
>>>                 The E9171 is just a PCIe device, so the dtsi is
>>>                 actually rather uninteresting. What we really need
>>>                 is a full dmesg and maybe lspci output would help as
>>>                 well.
>>>
>>>                 Regards,
>>>                 Christian.
>>>
>>>
>>>
>>>             Hi Christian,
>>>
>>>             At first, I am using NXP T1042D4RDB-64B which has 256 MB
>>>             PCIe buffer according to its. PCIe memory range was
>>>             arranged to 256 MB in .dts file and in U-boot
>>>             configuration file. Driver was giving error with exit
>>>             code -12 (OUT_OF_MEMORY). But I was able to reach the
>>>             linux console.
>>>
>>>             [    5.512922] [drm] amdgpu kernel modesetting enabled.
>>>             [    5.517065] [drm] initializing kernel modesetting
>>>             (POLARIS12 0x1002:0x6987 0x1787:0x2389 0x80).
>>>             [    5.524507] amdgpu 0001:01:00.0: Fatal error during
>>>             GPU init
>>>             [    5.529296] amdgpu: probe of 0001:01:00.0 failed with
>>>             error -12
>>>
>>>             Then I canged 256 MB to 4GB in .dtsi and U-boot conf
>>>             file. I also changed 64KB I/O size to 1MB . When I do
>>>             this, I wasn't able to reach the linux console because
>>>             board was freezing. But driver was successfull at this
>>>             time. I already mentioned successfull driver console
>>>             logs up.
>>>
>>>             *this is lspci -v when GPU is plugged and Memory size is
>>>             256 MB.*
>>>
>>>             root@t1042d4rdb-64b:~# lspci -v
>>>             0000:00:00.0 PCI bridge: Freescale Semiconductor Inc
>>>             Device 0824 (rev 11) (prog-if 00 [Normal decode])
>>>                     Device tree node:
>>>             /sys/firmware/devicetree/base/pcie@ffe240000/pcie@0
>>>                     Flags: bus master, fast devsel, latency 0, IRQ 20
>>>                     Memory at <ignored> (32-bit, non-prefetchable)
>>>                     Bus: primary=00, secondary=01, subordinate=01,
>>>             sec-latency=0
>>>                     I/O behind bridge: 00000000-0000ffff [size=64K]
>>>                     Memory behind bridge: e0000000-efffffff [size=256M]
>>>                     Prefetchable memory behind bridge: None
>>>                     Capabilities: [44] Power Management version 3
>>>                     Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>                     Capabilities: [100] Advanced Error Reporting
>>>                     Kernel driver in use: pcieport
>>>
>>>             0001:00:00.0 PCI bridge: Freescale Semiconductor Inc
>>>             Device 0824 (rev 11) (prog-if 00 [Normal decode])
>>>                     Device tree node:
>>>             /sys/firmware/devicetree/base/pcie@ffe250000/pcie@0
>>>                     Flags: bus master, fast devsel, latency 0, IRQ 21
>>>                     Memory at <ignored> (32-bit, non-prefetchable)
>>>                     Bus: primary=00, secondary=01, subordinate=01,
>>>             sec-latency=0
>>>                     I/O behind bridge: 00000000-0000ffff [size=64K]
>>>                     Memory behind bridge: e0000000-efffffff [size=256M]
>>>                     Prefetchable memory behind bridge: None
>>>                     Capabilities: [44] Power Management version 3
>>>                     Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>                     Capabilities: [100] Advanced Error Reporting
>>>                     Kernel driver in use: pcieport
>>>
>>>             0001:01:00.0 VGA compatible controller: Advanced Micro
>>>             Devices, Inc. [AMD/ATI] Lexa [Radeon E9171 MCM] (rev 80)
>>>             (prog-if 00 [VGA controller])
>>>                     Subsystem: Hightech Information System Ltd.
>>>             Device 2389
>>>                     Flags: fast devsel, IRQ 41
>>>                     Memory at c10000000 (64-bit, prefetchable)
>>>             [size=256M]
>>>                     Memory at <ignored> (64-bit, prefetchable)
>>>                     I/O ports at 1100 [size=256]
>>>                     Memory at <ignored> (32-bit, non-prefetchable)
>>>                     Expansion ROM at <ignored> [disabled]
>>>                     Capabilities: [48] Vendor Specific Information:
>>>             Len=08 <?>
>>>                     Capabilities: [50] Power Management version 3
>>>                     Capabilities: [58] Express Legacy Endpoint, MSI 00
>>>                     Capabilities: [a0] MSI: Enable- Count=1/1
>>>             Maskable- 64bit+
>>>                     Capabilities: [100] Vendor Specific Information:
>>>             ID=0001 Rev=1 Len=010 <?>
>>>                     Capabilities: [150] Advanced Error Reporting
>>>                     Capabilities: [200] Resizable BAR <?>
>>>                     Capabilities: [270] Secondary PCI Express <?>
>>>                     Capabilities: [2b0] Address Translation Service
>>>             (ATS)
>>>                     Capabilities: [2c0] Page Request Interface (PRI)
>>>                     Capabilities: [2d0] Process Address Space ID (PASID)
>>>                     Capabilities: [320] Latency Tolerance Reporting
>>>                     Capabilities: [328] Alternative Routing-ID
>>>             Interpretation (ARI)
>>>                     Capabilities: [370] L1 PM Substates
>>>                     Kernel modules: amdgpu
>>>
>>>             0001:01:00.1 Audio device: Advanced Micro Devices, Inc.
>>>             [AMD/ATI] Device aae0
>>>                     Subsystem: Hightech Information System Ltd.
>>>             Device aae0
>>>                     Flags: bus master, fast devsel, latency 0, IRQ 17
>>>                     Memory at <ignored> (64-bit, non-prefetchable)
>>>                     Capabilities: [48] Vendor Specific Information:
>>>             Len=08 <?>
>>>                     Capabilities: [50] Power Management version 3
>>>                     Capabilities: [58] Express Legacy Endpoint, MSI 00
>>>                     Capabilities: [a0] MSI: Enable- Count=1/1
>>>             Maskable- 64bit+
>>>                     Capabilities: [100] Vendor Specific Information:
>>>             ID=0001 Rev=1 Len=010 <?>
>>>                     Capabilities: [150] Advanced Error Reporting
>>>                     Capabilities: [328] Alternative Routing-ID
>>>             Interpretation (ARI)
>>>
>>>             0002:00:00.0 PCI bridge: Freescale Semiconductor Inc
>>>             Device 0824 (rev 11) (prog-if 00 [Normal decode])
>>>                     Device tree node:
>>>             /sys/firmware/devicetree/base/pcie@ffe260000/pcie@0
>>>                     Flags: bus master, fast devsel, latency 0, IRQ 22
>>>                     Memory at <ignored> (32-bit, non-prefetchable)
>>>                     Bus: primary=00, secondary=01, subordinate=01,
>>>             sec-latency=0
>>>                     I/O behind bridge: 00000000-0000ffff [size=64K]
>>>                     Memory behind bridge: e0000000-efffffff [size=256M]
>>>                     Prefetchable memory behind bridge: None
>>>                     Capabilities: [44] Power Management version 3
>>>                     Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>                     Capabilities: [100] Advanced Error Reporting
>>>                     Kernel driver in use: pcieport
>>>
>>>             0003:00:00.0 PCI bridge: Freescale Semiconductor Inc
>>>             Device 0824 (rev 11) (prog-if 00 [Normal decode])
>>>                     Device tree node:
>>>             /sys/firmware/devicetree/base/pcie@ffe270000/pcie@0
>>>                     Flags: bus master, fast devsel, latency 0, IRQ 23
>>>                     Memory at <ignored> (32-bit, non-prefetchable)
>>>                     Bus: primary=00, secondary=01, subordinate=01,
>>>             sec-latency=0
>>>                     I/O behind bridge: 00000000-0000ffff [size=64K]
>>>                     Memory behind bridge: e0000000-efffffff [size=256M]
>>>                     Prefetchable memory behind bridge: None
>>>                     Capabilities: [44] Power Management version 3
>>>                     Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>                     Capabilities: [100] Advanced Error Reporting
>>>                     Kernel driver in use: pcieport
>>>
>>>             *AND This is PCIe dmesg message when memory range is
>>>             256MB. It's also giving same message when memory range
>>>             is arranged as 4GB;*
>>>
>>>             PCI host bridge /pcie@ffe240000  ranges:
>>>              MEM 0x0000000c00000000..0x0000000c0fffffff ->
>>>             0x00000000e0000000
>>>               IO 0x0000000ff8000000..0x0000000ff800ffff ->
>>>             0x0000000000000000
>>>             /pcie@ffe240000: PCICSRBAR @ 0xff000000
>>>             setup_pci_atmu: end of DRAM 200000000
>>>             /pcie@ffe240000: Setup 64-bit PCI DMA window
>>>             /pcie@ffe240000: WARNING: Outbound window cfg leaves
>>>             gaps in memory map. Adjusting the memory map could
>>>             reduce unnecessary bounce buffering.
>>>             /pcie@ffe240000: DMA window size is 0xe0000000
>>>             Found FSL PCI host bridge at 0x0000000ffe250000.
>>>             Firmware bus number: 0->1
>>>             PCI host bridge /pcie@ffe250000  ranges:
>>>              MEM 0x0000000c10000000..0x0000000c1fffffff ->
>>>             0x00000000e0000000
>>>               IO 0x0000000ff8010000..0x0000000ff801ffff ->
>>>             0x0000000000000000
>>>             /pcie@ffe250000: PCICSRBAR @ 0xff000000
>>>             setup_pci_atmu: end of DRAM 200000000
>>>             /pcie@ffe250000: Setup 64-bit PCI DMA window
>>>             /pcie@ffe250000: WARNING: Outbound window cfg leaves
>>>             gaps in memory map. Adjusting the memory map could
>>>             reduce unnecessary bounce buffering.
>>>             /pcie@ffe250000: DMA window size is 0xe0000000
>>>             Found FSL PCI host bridge at 0x0000000ffe260000.
>>>             Firmware bus number: 0->0
>>>             PCI host bridge /pcie@ffe260000  ranges:
>>>              MEM 0x0000000c20000000..0x0000000c2fffffff ->
>>>             0x00000000e0000000
>>>               IO 0x0000000ff8020000..0x0000000ff802ffff ->
>>>             0x0000000000000000
>>>             /pcie@ffe260000: PCICSRBAR @ 0xff000000
>>>             setup_pci_atmu: end of DRAM 200000000
>>>             /pcie@ffe260000: Setup 64-bit PCI DMA window
>>>             /pcie@ffe260000: WARNING: Outbound window cfg leaves
>>>             gaps in memory map. Adjusting the memory map could
>>>             reduce unnecessary bounce buffering.
>>>             /pcie@ffe260000: DMA window size is 0xe0000000
>>>             Found FSL PCI host bridge at 0x0000000ffe270000.
>>>             Firmware bus number: 0->0
>>>             PCI host bridge /pcie@ffe270000  ranges:
>>>              MEM 0x0000000c30000000..0x0000000c3fffffff ->
>>>             0x00000000e0000000
>>>               IO 0x0000000ff8030000..0x0000000ff803ffff ->
>>>             0x0000000000000000
>>>             /pcie@ffe270000: PCICSRBAR @ 0xff000000
>>>             setup_pci_atmu: end of DRAM 200000000
>>>             /pcie@ffe270000: Setup 64-bit PCI DMA window
>>>             /pcie@ffe270000: WARNING: Outbound window cfg leaves
>>>             gaps in memory map. Adjusting the memory map could
>>>             reduce unnecessary bounce buffering.
>>>             /pcie@ffe270000: DMA window size is 0xe0000000
>>>             iommu: Adding device ff6000000.qman-portal to group 0
>>>             iommu: Adding device ff6004000.qman-portal to group 1
>>>             iommu: Adding device ff6008000.qman-portal to group 2
>>>             iommu: Adding device ff600c000.qman-portal to group 3
>>>             iommu: Adding device ff6010000.qman-portal to group 4
>>>             iommu: Adding device ff6014000.qman-portal to group 5
>>>             iommu: Adding device ff6018000.qman-portal to group 6
>>>             iommu: Adding device ff601c000.qman-portal to group 7
>>>             iommu: Adding device ff6020000.qman-portal to group 8
>>>             iommu: Adding device ff6024000.qman-portal to group 9
>>>             iommu: Adding device ffe100300.dma to group 10
>>>             iommu: Adding device ffe101300.dma to group 11
>>>             iommu: Adding device ffe114000.sdhc to group 12
>>>             iommu: Adding device ffe210000.usb to group 13
>>>             iommu: Adding device ffe211000.usb to group 14
>>>             iommu: Adding device ffe220000.sata to group 15
>>>             iommu: Adding device ffe221000.sata to group 16
>>>             iommu: Adding device ffe318000.qman to group 17
>>>             iommu: Adding device ffe31a000.bman to group 18
>>>             iommu: Adding device ffe240000.pcie to group 19
>>>             iommu: Adding device ffe250000.pcie to group 20
>>>             iommu: Adding device ffe260000.pcie to group 21
>>>             iommu: Adding device ffe270000.pcie to group 22
>>>             iommu: Adding device ffe140000.qe to group 23
>>>             software IO TLB: mapped [mem 0xfbfff000-0xfffff000] (64MB)
>>>             PCI: Probing PCI hardware
>>>             fsl-pci ffe240000.pcie: PCI host bridge to bus 0000:00
>>>             pci_bus 0000:00: root bus resource [io
>>>              0x8000080000010000-0x800008000001ffff] (bus address
>>>             [0x0000-0xffff])
>>>             pci_bus 0000:00: root bus resource [mem
>>>             0xc00000000-0xc0fffffff] (bus address
>>>             [0xe0000000-0xefffffff])
>>>             pci_bus 0000:00: root bus resource [bus 00]
>>>             iommu: Removing device ffe240000.pcie from group 19
>>>             iommu: Adding device 0000:00:00.0 to group 24
>>>             pci 0000:00:00.0: bridge configuration invalid ([bus
>>>             00-00]), reconfiguring
>>>             pci 0000:00:00.0: PCI bridge to [bus 01-ff]
>>>             fsl-pci ffe250000.pcie: PCI host bridge to bus 0001:00
>>>             pci_bus 0001:00: root bus resource [io
>>>              0x8000080000021000-0x8000080000030fff] (bus address
>>>             [0x0000-0xffff])
>>>             pci_bus 0001:00: root bus resource [mem
>>>             0xc10000000-0xc1fffffff] (bus address
>>>             [0xe0000000-0xefffffff])
>>>             pci_bus 0001:00: root bus resource [bus 00-01]
>>>             iommu: Removing device ffe250000.pcie from group 20
>>>             iommu: Adding device 0001:00:00.0 to group 19
>>>             pci 0001:01:00.0: enabling Extended Tags
>>>             pci 0001:01:00.0: 4.000 Gb/s available PCIe bandwidth,
>>>             limited by 5 GT/s x1 link at 0001:00:00.0 (capable of
>>>             63.008 Gb/s with 8 GT/s x8 link)
>>>             iommu: Adding device 0001:01:00.0 to group 19
>>>             pci 0001:01:00.1: enabling Extended Tags
>>>             iommu: Adding device 0001:01:00.1 to group 19
>>>             pci 0001:00:00.0: PCI bridge to [bus 01-ff]
>>>             fsl-pci ffe260000.pcie: PCI host bridge to bus 0002:00
>>>             pci_bus 0002:00: root bus resource [io
>>>              0x8000080000032000-0x8000080000041fff] (bus address
>>>             [0x0000-0xffff])
>>>             pci_bus 0002:00: root bus resource [mem
>>>             0xc20000000-0xc2fffffff] (bus address
>>>             [0xe0000000-0xefffffff])
>>>             pci_bus 0002:00: root bus resource [bus 00]
>>>             iommu: Removing device ffe260000.pcie from group 21
>>>             iommu: Adding device 0002:00:00.0 to group 20
>>>             pci 0002:00:00.0: bridge configuration invalid ([bus
>>>             00-00]), reconfiguring
>>>             pci 0002:00:00.0: PCI bridge to [bus 01-ff]
>>>             fsl-pci ffe270000.pcie: PCI host bridge to bus 0003:00
>>>             pci_bus 0003:00: root bus resource [io
>>>              0x8000080000043000-0x8000080000052fff] (bus address
>>>             [0x0000-0xffff])
>>>             pci_bus 0003:00: root bus resource [mem
>>>             0xc30000000-0xc3fffffff] (bus address
>>>             [0xe0000000-0xefffffff])
>>>             pci_bus 0003:00: root bus resource [bus 00]
>>>             iommu: Removing device ffe270000.pcie from group 22
>>>             iommu: Adding device 0003:00:00.0 to group 21
>>>             pci 0003:00:00.0: bridge configuration invalid ([bus
>>>             00-00]), reconfiguring
>>>             pci 0003:00:00.0: PCI bridge to [bus 01-ff]
>>>             PCI: Cannot allocate resource region 0 of device
>>>             0000:00:00.0, will remap
>>>             PCI: Cannot allocate resource region 0 of device
>>>             0001:00:00.0, will remap
>>>             PCI: Cannot allocate resource region 2 of device
>>>             0001:01:00.0, will remap
>>>             PCI: Cannot allocate resource region 5 of device
>>>             0001:01:00.0, will remap
>>>             PCI: Cannot allocate resource region 6 of device
>>>             0001:01:00.0, will remap
>>>             PCI: Cannot allocate resource region 0 of device
>>>             0001:01:00.1, will remap
>>>             PCI: Cannot allocate resource region 0 of device
>>>             0002:00:00.0, will remap
>>>             PCI: Cannot allocate resource region 0 of device
>>>             0003:00:00.0, will remap
>>>             pci 0000:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>             pci 0000:00:00.0: BAR 0: failed to assign [mem size
>>>             0x01000000]
>>>             pci 0000:00:00.0: PCI bridge to [bus 01]
>>>             pci 0000:00:00.0:   bridge window [io
>>>              0x8000080000010000-0x800008000001ffff]
>>>             pci 0000:00:00.0:   bridge window [mem
>>>             0xc00000000-0xc0fffffff]
>>>             pci_bus 0000:00: Some PCI device resources are
>>>             unassigned, try booting with pci=realloc
>>>             pci 0001:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>             pci 0001:00:00.0: BAR 0: failed to assign [mem size
>>>             0x01000000]
>>>             pci 0001:00:00.0: BAR 9: no space for [mem size
>>>             0x00200000 64bit pref]
>>>             pci 0001:00:00.0: BAR 9: failed to assign [mem size
>>>             0x00200000 64bit pref]
>>>             pci 0001:01:00.0: BAR 2: no space for [mem size
>>>             0x00200000 64bit pref]
>>>             pci 0001:01:00.0: BAR 2: failed to assign [mem size
>>>             0x00200000 64bit pref]
>>>             pci 0001:01:00.0: BAR 5: no space for [mem size 0x00040000]
>>>             pci 0001:01:00.0: BAR 5: failed to assign [mem size
>>>             0x00040000]
>>>             pci 0001:01:00.0: BAR 6: no space for [mem size
>>>             0x00020000 pref]
>>>             pci 0001:01:00.0: BAR 6: failed to assign [mem size
>>>             0x00020000 pref]
>>>             pci 0001:01:00.1: BAR 0: no space for [mem size
>>>             0x00004000 64bit]
>>>             pci 0001:01:00.1: BAR 0: failed to assign [mem size
>>>             0x00004000 64bit]
>>>             pci 0001:00:00.0: PCI bridge to [bus 01]
>>>             pci 0001:00:00.0:   bridge window [io
>>>              0x8000080000021000-0x8000080000030fff]
>>>             pci 0001:00:00.0:   bridge window [mem
>>>             0xc10000000-0xc1fffffff]
>>>             pci_bus 0001:00: Some PCI device resources are
>>>             unassigned, try booting with pci=realloc
>>>             pci 0002:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>             pci 0002:00:00.0: BAR 0: failed to assign [mem size
>>>             0x01000000]
>>>             pci 0002:00:00.0: PCI bridge to [bus 01]
>>>             pci 0002:00:00.0:   bridge window [io
>>>              0x8000080000032000-0x8000080000041fff]
>>>             pci 0002:00:00.0:   bridge window [mem
>>>             0xc20000000-0xc2fffffff]
>>>             pci_bus 0002:00: Some PCI device resources are
>>>             unassigned, try booting with pci=realloc
>>>             pci 0003:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>             pci 0003:00:00.0: BAR 0: failed to assign [mem size
>>>             0x01000000]
>>>             pci 0003:00:00.0: PCI bridge to [bus 01]
>>>             pci 0003:00:00.0:   bridge window [io
>>>              0x8000080000043000-0x8000080000052fff]
>>>             pci 0003:00:00.0:   bridge window [mem
>>>             0xc30000000-0xc3fffffff]
>>>             pci_bus 0003:00: Some PCI device resources are
>>>             unassigned, try booting with pci=realloc
>>>
>>>
>>
>>
>>         _______________________________________________
>>         amd-gfx mailing list
>>         amd-gfx@lists.freedesktop.org  <mailto:amd-gfx@lists.freedesktop.org>
>>         https://lists.freedesktop.org/mailman/listinfo/amd-gfx  <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7Ca865cbfc1c474712d8ad08d777dc0ab1%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109659276704165&sdata=HJnUkrzWe1YKpMmQh4Ejn4T%2FaIS2UN2TOBnb9xM5BJQ%3D&reserved=0>
>


[-- Attachment #1.2: Type: text/html, Size: 67148 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
  2019-12-03 12:07               ` Christian König
@ 2019-12-03 12:50                 ` Yusuf Altıparmak
  2019-12-03 17:29                   ` Christian König
  0 siblings, 1 reply; 12+ messages in thread
From: Yusuf Altıparmak @ 2019-12-03 12:50 UTC (permalink / raw)
  To: Christian König; +Cc: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 23154 bytes --]

Hi Christian,

> 0001f000
>
> Exactly as I thought. The hardware does support BAR resize, but
> unfortunately 256MB is already the minimum.
>
> Sorry, but there isn't anything I could do from the GPU drivers point of
> view.
>

Yes unfortunately there is nothing remained to about GPU side.

The only good news I have is that 256M+2M+512K+128K=260M address space
> should be enough for the GPU to work, maybe that makes things a bit simpler.
>
>
Right now I am trying to increase MMIO size config to 512 MB, I hope that
should help me. By the way I wonder how can I get at least VGA output from
GPU. Maybe I can get a text console on screen or something like X server?
Do you have any recommendations? I tried this GPU with my own Ubuntu 18.04
and its special driver. Everything worked fine.



> But you definitely got an interesting use case here :)
>

This is the worlds the most interesting use case I think. I can't increase
MMIO size because U-boot freezes. I can't decrease it to 256 MB again
because driver does not accept :D. I am just wondering, does modern gaming
motherboards have more than 4GB PCIe buffer for this job ? :D


Am 03.12.19 um 11:31 schrieb Yusuf Altıparmak:
>
> Hello Christian,
> My "setpci -s 0001:01:00.0 ECAP15+4.l ECAP15+8.l" output is;
>
> 0001f000
>
> 00000820
>
> Regards.
>
>
>
> Yusuf Altıparmak <yusufalti1997@gmail.com>, 2 Ara 2019 Pzt, 19:31
> tarihinde şunu yazdı:
>
>> Most likely not. There is support for resizing the VRAM BAR, but usually
>>> you can only make it larger and not smaller.
>>> Please give me the output of "sudo setpci -s 0001:01:00.0 ECAP15+4.l
>>> ECAP15+8.l" if you want to double check that.
>>>
>>
>> Okay I'll try it tomorrow. What does the " sudo setpci -s 0001:01:00.0
>> ECAP15+4.l ECAP15+8.l" command exactly do ?
>>
>>
>>
>>> Well you rather need to ask if anybody has sample PCIe configuration for
>>> GPUs in general. That problem is not really E9171 related. You might want
>>> to ask NXP for that maybe.
>>> Sorry, no idea if that is correct or not. You need to ask NXP for help
>>> with that.
>>>
>>>
>> Okay no problem. At least I know what is the missing point now. The
>> problem is probably because of the .dtsi and u-boot config files. Memory
>> ranges are overwriting like you said. I'll ask nxp to take some sample PCIe
>> configuration for GPUs.
>>
>> Thank you for your interest Christian.
>> Regards .
>>
>>
>>>
>>> Am 02.12.19 um 14:32 schrieb Yusuf Altıparmak:
>>>>
>>>>
>>>>> I attached my dts file.
>>>>>
>>>>> System is working fine when GPU is not plugged in.
>>>>>
>>>>> *This is the last console log before freeze:*
>>>>> [drm] amdgpu kernel modesetting enabled.
>>>>>
>>>>> [drm] initializing kernel modesetting (POLARIS12 0x1002:0x6987
>>>>> 0x1787:0x2389 0x80).
>>>>> [drm] register mmio base: 0x20200000
>>>>>
>>>>> fsl-fman-port ffe488000.port fm1-gb0: renamed from eth0
>>>>>
>>>>> [drm] register mmio size: 262144
>>>>>
>>>>> [drm] add ip block number 0 <vi_common>
>>>>>
>>>>> [drm] add ip block number 1 <gmc_v8_0>
>>>>>
>>>>> [drm] add ip block number 2 <tonga_ih>
>>>>>
>>>>> [drm] add ip block number 3 <powerplay>
>>>>>
>>>>> [drm] add ip block number 4 <dm>
>>>>>
>>>>> [drm] add ip block number 5 <gfx_v8_0>
>>>>>
>>>>> [drm] add ip block number 6 <sdma_v3_0>
>>>>>
>>>>> [drm] add ip block number 7 <uvd_v6_0>
>>>>>
>>>>> [drm] add ip block number 8 <vce_v3_0>
>>>>>
>>>>> [drm] UVD is enabled in VM mode
>>>>>
>>>>> [drm] UVD ENC is enabled in VM mode
>>>>>
>>>>> [drm] VCE enabled in VM mode
>>>>>
>>>>> ATOM BIOS: 113-ER16BFC-001
>>>>>
>>>>> [drm] GPU posting now...
>>>>>
>>>>> Disabling lock debugging due to kernel taint
>>>>>
>>>>> Machine check in kernel mode.
>>>>>
>>>>> Caused by (from MCSR=a000): Load Error Report
>>>>>
>>>>> Guarded Load Error Report
>>>>>
>>>>> Kernel panic - not syncing: Unrecoverable Machine check
>>>>>
>>>>> CPU: 1 PID: 2023 Comm: udevd Tainted: G   M
>>>>>  4.19.26+gc0c2141 #1
>>>>> Call Trace:
>>>>>
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> amd-gfx mailing listamd-gfx@lists.freedesktop.orghttps://lists.freedesktop.org/mailman/listinfo/amd-gfx <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7Ca865cbfc1c474712d8ad08d777dc0ab1%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109659276684179&sdata=K7VRTAsSYAiTUVdf2ijSuEhFxZ755%2Bb0n8A3hOCjaAk%3D&reserved=0>
>>>>>
>>>>>
>>>>>
>>>>
>>>> Christian König <ckoenig.leichtzumerken@gmail.com>, 2 Ara 2019 Pzt,
>>>> 15:28 tarihinde şunu yazdı:
>>>>
>>>>> Hi Yusuf,
>>>>>
>>>>> Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
>>>>>
>>>>> My embedded board is freezing when I put E9171 on PCIe. What is the
>>>>> meaning of Unrecoverable Machine Check error about GPU?
>>>>>
>>>>>
>>>>> Well see the explanation on Wikipedia for example:
>>>>> https://en.wikipedia.org/wiki/Machine-check_exception
>>>>> <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMachine-check_exception&data=02%7C01%7Cchristian.koenig%40amd.com%7Ca865cbfc1c474712d8ad08d777dc0ab1%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109659276694169&sdata=fKC1Ly%2FY24tfgqC0XbMhm94xn6cor%2FanoUu%2B%2Bgk2E7Q%3D&reserved=0>
>>>>>
>>>>> In general it means you have messed up something in your hardware
>>>>> configuration.
>>>>>
>>>>> Could PCIe settings in .dts file cause this problem?
>>>>>
>>>>>
>>>>> Possible, but rather unlikely. My best guess is that it is some
>>>>> problem with the power supply.
>>>>>
>>>>> If it is, is there any sample PCIe configuration for E9171?
>>>>>
>>>>>
>>>>> The E9171 is just a PCIe device, so the dtsi is actually rather
>>>>> uninteresting. What we really need is a full dmesg and maybe lspci output
>>>>> would help as well.
>>>>>
>>>>> Regards,
>>>>> Christian.
>>>>>
>>>>
>>>>
>>>> Hi Christian,
>>>>
>>>> At first, I am using NXP T1042D4RDB-64B which has 256 MB PCIe buffer
>>>> according to its. PCIe memory range was arranged to 256 MB in .dts file and
>>>> in U-boot configuration file. Driver was giving error with exit code -12
>>>> (OUT_OF_MEMORY). But I was able to reach the linux console.
>>>>
>>>> [    5.512922] [drm] amdgpu kernel modesetting enabled.
>>>> [    5.517065] [drm] initializing kernel modesetting (POLARIS12
>>>> 0x1002:0x6987 0x1787:0x2389 0x80).
>>>> [    5.524507] amdgpu 0001:01:00.0: Fatal error during GPU init
>>>> [    5.529296] amdgpu: probe of 0001:01:00.0 failed with error -12
>>>>
>>>> Then I canged 256 MB to 4GB in .dtsi and U-boot conf file. I also
>>>> changed 64KB I/O size to 1MB . When I do this, I wasn't able to reach the
>>>> linux console because board was freezing. But driver was successfull at
>>>> this time. I already mentioned successfull driver console logs up.
>>>>
>>>> *this is lspci -v when GPU is plugged and Memory size is 256 MB.*
>>>>
>>>> root@t1042d4rdb-64b:~# lspci -v
>>>> 0000:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>>> 11) (prog-if 00 [Normal decode])
>>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe240000
>>>> /pcie@0
>>>>         Flags: bus master, fast devsel, latency 0, IRQ 20
>>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>>         Prefetchable memory behind bridge: None
>>>>         Capabilities: [44] Power Management version 3
>>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>         Capabilities: [100] Advanced Error Reporting
>>>>         Kernel driver in use: pcieport
>>>>
>>>> 0001:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>>> 11) (prog-if 00 [Normal decode])
>>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe250000
>>>> /pcie@0
>>>>         Flags: bus master, fast devsel, latency 0, IRQ 21
>>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>>         Prefetchable memory behind bridge: None
>>>>         Capabilities: [44] Power Management version 3
>>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>         Capabilities: [100] Advanced Error Reporting
>>>>         Kernel driver in use: pcieport
>>>>
>>>> 0001:01:00.0 VGA compatible controller: Advanced Micro Devices, Inc.
>>>> [AMD/ATI] Lexa [Radeon E9171 MCM] (rev 80) (prog-if 00 [VGA controller])
>>>>         Subsystem: Hightech Information System Ltd. Device 2389
>>>>         Flags: fast devsel, IRQ 41
>>>>         Memory at c10000000 (64-bit, prefetchable) [size=256M]
>>>>         Memory at <ignored> (64-bit, prefetchable)
>>>>         I/O ports at 1100 [size=256]
>>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>>         Expansion ROM at <ignored> [disabled]
>>>>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>>>         Capabilities: [50] Power Management version 3
>>>>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>>>>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>>>>         Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
>>>> Len=010 <?>
>>>>         Capabilities: [150] Advanced Error Reporting
>>>>         Capabilities: [200] Resizable BAR <?>
>>>>         Capabilities: [270] Secondary PCI Express <?>
>>>>         Capabilities: [2b0] Address Translation Service (ATS)
>>>>         Capabilities: [2c0] Page Request Interface (PRI)
>>>>         Capabilities: [2d0] Process Address Space ID (PASID)
>>>>         Capabilities: [320] Latency Tolerance Reporting
>>>>         Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
>>>>         Capabilities: [370] L1 PM Substates
>>>>         Kernel modules: amdgpu
>>>>
>>>> 0001:01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI]
>>>> Device aae0
>>>>         Subsystem: Hightech Information System Ltd. Device aae0
>>>>         Flags: bus master, fast devsel, latency 0, IRQ 17
>>>>         Memory at <ignored> (64-bit, non-prefetchable)
>>>>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>>>         Capabilities: [50] Power Management version 3
>>>>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>>>>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>>>>         Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1
>>>> Len=010 <?>
>>>>         Capabilities: [150] Advanced Error Reporting
>>>>         Capabilities: [328] Alternative Routing-ID Interpretation (ARI)
>>>>
>>>> 0002:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>>> 11) (prog-if 00 [Normal decode])
>>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe260000
>>>> /pcie@0
>>>>         Flags: bus master, fast devsel, latency 0, IRQ 22
>>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>>         Prefetchable memory behind bridge: None
>>>>         Capabilities: [44] Power Management version 3
>>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>         Capabilities: [100] Advanced Error Reporting
>>>>         Kernel driver in use: pcieport
>>>>
>>>> 0003:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>>> 11) (prog-if 00 [Normal decode])
>>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe270000
>>>> /pcie@0
>>>>         Flags: bus master, fast devsel, latency 0, IRQ 23
>>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>>         Prefetchable memory behind bridge: None
>>>>         Capabilities: [44] Power Management version 3
>>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>         Capabilities: [100] Advanced Error Reporting
>>>>         Kernel driver in use: pcieport
>>>>
>>>> *AND This is PCIe dmesg message when memory range is 256MB. It's also
>>>> giving same message when memory range is arranged as 4GB;*
>>>>
>>>> PCI host bridge /pcie@ffe240000  ranges:
>>>>  MEM 0x0000000c00000000..0x0000000c0fffffff -> 0x00000000e0000000
>>>>   IO 0x0000000ff8000000..0x0000000ff800ffff -> 0x0000000000000000
>>>> /pcie@ffe240000: PCICSRBAR @ 0xff000000
>>>> setup_pci_atmu: end of DRAM 200000000
>>>> /pcie@ffe240000: Setup 64-bit PCI DMA window
>>>> /pcie@ffe240000: WARNING: Outbound window cfg leaves gaps in memory
>>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>>> /pcie@ffe240000: DMA window size is 0xe0000000
>>>> Found FSL PCI host bridge at 0x0000000ffe250000. Firmware bus number:
>>>> 0->1
>>>> PCI host bridge /pcie@ffe250000  ranges:
>>>>  MEM 0x0000000c10000000..0x0000000c1fffffff -> 0x00000000e0000000
>>>>   IO 0x0000000ff8010000..0x0000000ff801ffff -> 0x0000000000000000
>>>> /pcie@ffe250000: PCICSRBAR @ 0xff000000
>>>> setup_pci_atmu: end of DRAM 200000000
>>>> /pcie@ffe250000: Setup 64-bit PCI DMA window
>>>> /pcie@ffe250000: WARNING: Outbound window cfg leaves gaps in memory
>>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>>> /pcie@ffe250000: DMA window size is 0xe0000000
>>>> Found FSL PCI host bridge at 0x0000000ffe260000. Firmware bus number:
>>>> 0->0
>>>> PCI host bridge /pcie@ffe260000  ranges:
>>>>  MEM 0x0000000c20000000..0x0000000c2fffffff -> 0x00000000e0000000
>>>>   IO 0x0000000ff8020000..0x0000000ff802ffff -> 0x0000000000000000
>>>> /pcie@ffe260000: PCICSRBAR @ 0xff000000
>>>> setup_pci_atmu: end of DRAM 200000000
>>>> /pcie@ffe260000: Setup 64-bit PCI DMA window
>>>> /pcie@ffe260000: WARNING: Outbound window cfg leaves gaps in memory
>>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>>> /pcie@ffe260000: DMA window size is 0xe0000000
>>>> Found FSL PCI host bridge at 0x0000000ffe270000. Firmware bus number:
>>>> 0->0
>>>> PCI host bridge /pcie@ffe270000  ranges:
>>>>  MEM 0x0000000c30000000..0x0000000c3fffffff -> 0x00000000e0000000
>>>>   IO 0x0000000ff8030000..0x0000000ff803ffff -> 0x0000000000000000
>>>> /pcie@ffe270000: PCICSRBAR @ 0xff000000
>>>> setup_pci_atmu: end of DRAM 200000000
>>>> /pcie@ffe270000: Setup 64-bit PCI DMA window
>>>> /pcie@ffe270000: WARNING: Outbound window cfg leaves gaps in memory
>>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>>> /pcie@ffe270000: DMA window size is 0xe0000000
>>>> iommu: Adding device ff6000000.qman-portal to group 0
>>>> iommu: Adding device ff6004000.qman-portal to group 1
>>>> iommu: Adding device ff6008000.qman-portal to group 2
>>>> iommu: Adding device ff600c000.qman-portal to group 3
>>>> iommu: Adding device ff6010000.qman-portal to group 4
>>>> iommu: Adding device ff6014000.qman-portal to group 5
>>>> iommu: Adding device ff6018000.qman-portal to group 6
>>>> iommu: Adding device ff601c000.qman-portal to group 7
>>>> iommu: Adding device ff6020000.qman-portal to group 8
>>>> iommu: Adding device ff6024000.qman-portal to group 9
>>>> iommu: Adding device ffe100300.dma to group 10
>>>> iommu: Adding device ffe101300.dma to group 11
>>>> iommu: Adding device ffe114000.sdhc to group 12
>>>> iommu: Adding device ffe210000.usb to group 13
>>>> iommu: Adding device ffe211000.usb to group 14
>>>> iommu: Adding device ffe220000.sata to group 15
>>>> iommu: Adding device ffe221000.sata to group 16
>>>> iommu: Adding device ffe318000.qman to group 17
>>>> iommu: Adding device ffe31a000.bman to group 18
>>>> iommu: Adding device ffe240000.pcie to group 19
>>>> iommu: Adding device ffe250000.pcie to group 20
>>>> iommu: Adding device ffe260000.pcie to group 21
>>>> iommu: Adding device ffe270000.pcie to group 22
>>>> iommu: Adding device ffe140000.qe to group 23
>>>> software IO TLB: mapped [mem 0xfbfff000-0xfffff000] (64MB)
>>>> PCI: Probing PCI hardware
>>>> fsl-pci ffe240000.pcie: PCI host bridge to bus 0000:00
>>>> pci_bus 0000:00: root bus resource [io
>>>>  0x8000080000010000-0x800008000001ffff] (bus address [0x0000-0xffff])
>>>> pci_bus 0000:00: root bus resource [mem 0xc00000000-0xc0fffffff] (bus
>>>> address [0xe0000000-0xefffffff])
>>>> pci_bus 0000:00: root bus resource [bus 00]
>>>> iommu: Removing device ffe240000.pcie from group 19
>>>> iommu: Adding device 0000:00:00.0 to group 24
>>>> pci 0000:00:00.0: bridge configuration invalid ([bus 00-00]),
>>>> reconfiguring
>>>> pci 0000:00:00.0: PCI bridge to [bus 01-ff]
>>>> fsl-pci ffe250000.pcie: PCI host bridge to bus 0001:00
>>>> pci_bus 0001:00: root bus resource [io
>>>>  0x8000080000021000-0x8000080000030fff] (bus address [0x0000-0xffff])
>>>> pci_bus 0001:00: root bus resource [mem 0xc10000000-0xc1fffffff] (bus
>>>> address [0xe0000000-0xefffffff])
>>>> pci_bus 0001:00: root bus resource [bus 00-01]
>>>> iommu: Removing device ffe250000.pcie from group 20
>>>> iommu: Adding device 0001:00:00.0 to group 19
>>>> pci 0001:01:00.0: enabling Extended Tags
>>>> pci 0001:01:00.0: 4.000 Gb/s available PCIe bandwidth, limited by 5
>>>> GT/s x1 link at 0001:00:00.0 (capable of 63.008 Gb/s with 8 GT/s x8 link)
>>>> iommu: Adding device 0001:01:00.0 to group 19
>>>> pci 0001:01:00.1: enabling Extended Tags
>>>> iommu: Adding device 0001:01:00.1 to group 19
>>>> pci 0001:00:00.0: PCI bridge to [bus 01-ff]
>>>> fsl-pci ffe260000.pcie: PCI host bridge to bus 0002:00
>>>> pci_bus 0002:00: root bus resource [io
>>>>  0x8000080000032000-0x8000080000041fff] (bus address [0x0000-0xffff])
>>>> pci_bus 0002:00: root bus resource [mem 0xc20000000-0xc2fffffff] (bus
>>>> address [0xe0000000-0xefffffff])
>>>> pci_bus 0002:00: root bus resource [bus 00]
>>>> iommu: Removing device ffe260000.pcie from group 21
>>>> iommu: Adding device 0002:00:00.0 to group 20
>>>> pci 0002:00:00.0: bridge configuration invalid ([bus 00-00]),
>>>> reconfiguring
>>>> pci 0002:00:00.0: PCI bridge to [bus 01-ff]
>>>> fsl-pci ffe270000.pcie: PCI host bridge to bus 0003:00
>>>> pci_bus 0003:00: root bus resource [io
>>>>  0x8000080000043000-0x8000080000052fff] (bus address [0x0000-0xffff])
>>>> pci_bus 0003:00: root bus resource [mem 0xc30000000-0xc3fffffff] (bus
>>>> address [0xe0000000-0xefffffff])
>>>> pci_bus 0003:00: root bus resource [bus 00]
>>>> iommu: Removing device ffe270000.pcie from group 22
>>>> iommu: Adding device 0003:00:00.0 to group 21
>>>> pci 0003:00:00.0: bridge configuration invalid ([bus 00-00]),
>>>> reconfiguring
>>>> pci 0003:00:00.0: PCI bridge to [bus 01-ff]
>>>> PCI: Cannot allocate resource region 0 of device 0000:00:00.0, will
>>>> remap
>>>> PCI: Cannot allocate resource region 0 of device 0001:00:00.0, will
>>>> remap
>>>> PCI: Cannot allocate resource region 2 of device 0001:01:00.0, will
>>>> remap
>>>> PCI: Cannot allocate resource region 5 of device 0001:01:00.0, will
>>>> remap
>>>> PCI: Cannot allocate resource region 6 of device 0001:01:00.0, will
>>>> remap
>>>> PCI: Cannot allocate resource region 0 of device 0001:01:00.1, will
>>>> remap
>>>> PCI: Cannot allocate resource region 0 of device 0002:00:00.0, will
>>>> remap
>>>> PCI: Cannot allocate resource region 0 of device 0003:00:00.0, will
>>>> remap
>>>> pci 0000:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>> pci 0000:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>>> pci 0000:00:00.0: PCI bridge to [bus 01]
>>>> pci 0000:00:00.0:   bridge window [io
>>>>  0x8000080000010000-0x800008000001ffff]
>>>> pci 0000:00:00.0:   bridge window [mem 0xc00000000-0xc0fffffff]
>>>> pci_bus 0000:00: Some PCI device resources are unassigned, try booting
>>>> with pci=realloc
>>>> pci 0001:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>> pci 0001:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>>> pci 0001:00:00.0: BAR 9: no space for [mem size 0x00200000 64bit pref]
>>>> pci 0001:00:00.0: BAR 9: failed to assign [mem size 0x00200000 64bit
>>>> pref]
>>>> pci 0001:01:00.0: BAR 2: no space for [mem size 0x00200000 64bit pref]
>>>> pci 0001:01:00.0: BAR 2: failed to assign [mem size 0x00200000 64bit
>>>> pref]
>>>> pci 0001:01:00.0: BAR 5: no space for [mem size 0x00040000]
>>>> pci 0001:01:00.0: BAR 5: failed to assign [mem size 0x00040000]
>>>> pci 0001:01:00.0: BAR 6: no space for [mem size 0x00020000 pref]
>>>> pci 0001:01:00.0: BAR 6: failed to assign [mem size 0x00020000 pref]
>>>> pci 0001:01:00.1: BAR 0: no space for [mem size 0x00004000 64bit]
>>>> pci 0001:01:00.1: BAR 0: failed to assign [mem size 0x00004000 64bit]
>>>> pci 0001:00:00.0: PCI bridge to [bus 01]
>>>> pci 0001:00:00.0:   bridge window [io
>>>>  0x8000080000021000-0x8000080000030fff]
>>>> pci 0001:00:00.0:   bridge window [mem 0xc10000000-0xc1fffffff]
>>>> pci_bus 0001:00: Some PCI device resources are unassigned, try booting
>>>> with pci=realloc
>>>> pci 0002:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>> pci 0002:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>>> pci 0002:00:00.0: PCI bridge to [bus 01]
>>>> pci 0002:00:00.0:   bridge window [io
>>>>  0x8000080000032000-0x8000080000041fff]
>>>> pci 0002:00:00.0:   bridge window [mem 0xc20000000-0xc2fffffff]
>>>> pci_bus 0002:00: Some PCI device resources are unassigned, try booting
>>>> with pci=realloc
>>>> pci 0003:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>> pci 0003:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>>> pci 0003:00:00.0: PCI bridge to [bus 01]
>>>> pci 0003:00:00.0:   bridge window [io
>>>>  0x8000080000043000-0x8000080000052fff]
>>>> pci 0003:00:00.0:   bridge window [mem 0xc30000000-0xc3fffffff]
>>>> pci_bus 0003:00: Some PCI device resources are unassigned, try booting
>>>> with pci=realloc
>>>>
>>>>
>>>>
>>>>
>>> _______________________________________________
>>> amd-gfx mailing listamd-gfx@lists.freedesktop.orghttps://lists.freedesktop.org/mailman/listinfo/amd-gfx <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7Ca865cbfc1c474712d8ad08d777dc0ab1%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109659276704165&sdata=HJnUkrzWe1YKpMmQh4Ejn4T%2FaIS2UN2TOBnb9xM5BJQ%3D&reserved=0>
>>>
>>>
>>>
>

[-- Attachment #1.2: Type: text/html, Size: 64040 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
  2019-12-03 12:50                 ` Yusuf Altıparmak
@ 2019-12-03 17:29                   ` Christian König
       [not found]                     ` <CAGzVRjwVdgojwsM1pOYftcXf6WeQN_46QwfQ1gvVUd6duyXiEQ@mail.gmail.com>
  0 siblings, 1 reply; 12+ messages in thread
From: Christian König @ 2019-12-03 17:29 UTC (permalink / raw)
  To: Yusuf Altıparmak; +Cc: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 31839 bytes --]

> Right now I am trying to increase MMIO size config to 512 MB
What you could try as well is to use the size 320MB for the MMIO. Those 
ranges usually don't need to be a power of two (only the BARs itself are 
a power of two) and this way it might even be easier to fit everything 
together.

> By the way I wonder how can I get at least VGA output from GPU. Maybe 
> I can get a text console on screen or something like X server? Do you 
> have any recommendations?
What could maybe work is VGA emulation, which essentially means text 
only. But no guarantee for that this really works as expected.

> I am just wondering, does modern gaming motherboards have more than 
> 4GB PCIe buffer for this job ?
They don't, resources are dynamically assigned instead.

See on x86 you usually have 1GB 32-bit address space where the BIOS 
shuffles all the mandatory devices it sees at boot time into.

Then when the motherboard has multiple PEG slots the BIOS also 
configures a 64-bit address space which is usually rather huge 
(256GB-1TB). Since the the VRAM and the doorbell BAR are 64bit BARs on 
the GPU they can be mapped into that as well.

This way you can easily have 10 GPUs connected to your CPU.

The problem you have here is that U-config doesn't do this resource 
assignment automatically and you need to configure it manually.

Regards,
Christian.

Am 03.12.19 um 13:50 schrieb Yusuf Altıparmak:
>
> Hi Christian,
>
>>     0001f000
>     Exactly as I thought. The hardware does support BAR resize, but
>     unfortunately 256MB is already the minimum.
>
>     Sorry, but there isn't anything I could do from the GPU drivers
>     point of view.
>
>
> Yes unfortunately there is nothing remained to about GPU side.
>
>     The only good news I have is that 256M+2M+512K+128K=260M address
>     space should be enough for the GPU to work, maybe that makes
>     things a bit simpler.
>
>
> Right now I am trying to increase MMIO size config to 512 MB, I hope 
> that should help me. By the way I wonder how can I get at least VGA 
> output from GPU. Maybe I can get a text console on screen or something 
> like X server? Do you have any recommendations? I tried this GPU with 
> my own Ubuntu 18.04 and its special driver. Everything worked fine.
>
>     But you definitely got an interesting use case here :)
>
>
> This is the worlds the most interesting use case I think. I can't 
> increase MMIO size because U-boot freezes. I can't decrease it to 256 
> MB again because driver does not accept :D. I am just wondering, does 
> modern gaming motherboards have more than 4GB PCIe buffer for this job 
> ? :D
>
>
>     Am 03.12.19 um 11:31 schrieb Yusuf Altıparmak:
>>     Hello Christian,
>>     My "setpci -s 0001:01:00.0 ECAP15+4.l ECAP15+8.l" output is;
>>
>>     0001f000
>>     00000820
>>
>>     Regards.
>>
>>
>>
>>     Yusuf Altıparmak <yusufalti1997@gmail.com
>>     <mailto:yusufalti1997@gmail.com>>, 2 Ara 2019 Pzt, 19:31
>>     tarihinde şunu yazdı:
>>
>>             Most likely not. There is support for resizing the VRAM
>>             BAR, but usually you can only make it larger and not smaller.
>>             Please give me the output of "sudo setpci -s 0001:01:00.0
>>             ECAP15+4.l ECAP15+8.l" if you want to double check that.
>>
>>
>>         Okay I'll try it tomorrow. What does the " sudo setpci -s
>>         0001:01:00.0 ECAP15+4.l ECAP15+8.l" command exactly do ?
>>
>>             Well you rather need to ask if anybody has sample PCIe
>>             configuration for GPUs in general. That problem is not
>>             really E9171 related. You might want to ask NXP for that
>>             maybe.
>>             Sorry, no idea if that is correct or not. You need to ask
>>             NXP for help with that.
>>
>>
>>         Okay no problem. At least I know what is the missing point
>>         now. The problem is probably because of the .dtsi and u-boot
>>         config files. Memory ranges are overwriting like you said.
>>         I'll ask nxp to take some sample PCIe configuration for GPUs.
>>
>>         Thank you for your interest Christian.
>>         Regards .
>>
>>>
>>>                 Am 02.12.19 um 14:32 schrieb Yusuf Altıparmak:
>>>>
>>>>
>>>>>                     I attached my dts file.
>>>>>
>>>>>                     System is working fine when GPU is not plugged in.
>>>>>                     *
>>>>>                     *
>>>>>                     *This is the last console log before freeze:*
>>>>>                     [drm] amdgpu kernel modesetting enabled.
>>>>>                     [drm] initializing kernel modesetting
>>>>>                     (POLARIS12 0x1002:0x6987 0x1787:0x2389 0x80).
>>>>>                     [drm] register mmio base: 0x20200000
>>>>>                     fsl-fman-port ffe488000.port fm1-gb0: renamed
>>>>>                     from eth0
>>>>>                     [drm] register mmio size: 262144
>>>>>                     [drm] add ip block number 0 <vi_common>
>>>>>                     [drm] add ip block number 1 <gmc_v8_0>
>>>>>                     [drm] add ip block number 2 <tonga_ih>
>>>>>                     [drm] add ip block number 3 <powerplay>
>>>>>                     [drm] add ip block number 4 <dm>
>>>>>                     [drm] add ip block number 5 <gfx_v8_0>
>>>>>                     [drm] add ip block number 6 <sdma_v3_0>
>>>>>                     [drm] add ip block number 7 <uvd_v6_0>
>>>>>                     [drm] add ip block number 8 <vce_v3_0>
>>>>>                     [drm] UVD is enabled in VM mode
>>>>>                     [drm] UVD ENC is enabled in VM mode
>>>>>                     [drm] VCE enabled in VM mode
>>>>>                     ATOM BIOS: 113-ER16BFC-001
>>>>>                     [drm] GPU posting now...
>>>>>                     Disabling lock debugging due to kernel taint
>>>>>                     Machine check in kernel mode.
>>>>>                     Caused by (from MCSR=a000): Load Error Report
>>>>>                     Guarded Load Error Report
>>>>>                     Kernel panic - not syncing: Unrecoverable
>>>>>                     Machine check
>>>>>                     CPU: 1 PID: 2023 Comm: udevd Tainted: G   M
>>>>>                      4.19.26+gc0c2141 #1
>>>>>                     Call Trace:
>>>>
>>>>>
>>>>>
>>>>>                     _______________________________________________
>>>>>                     amd-gfx mailing list
>>>>>                     amd-gfx@lists.freedesktop.org  <mailto:amd-gfx@lists.freedesktop.org>
>>>>>                     https://lists.freedesktop.org/mailman/listinfo/amd-gfx  <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7C561d9d3206ff46d338ad08d777ef717c%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109742603335427&sdata=DstGsDW0X7ennQoF1vg%2FYLGAStF4p1dAeMFhzsAFc1I%3D&reserved=0>
>>>>
>>>>
>>>>
>>>>                 Christian König <ckoenig.leichtzumerken@gmail.com
>>>>                 <mailto:ckoenig.leichtzumerken@gmail.com>>, 2 Ara
>>>>                 2019 Pzt, 15:28 tarihinde şunu yazdı:
>>>>
>>>>                     Hi Yusuf,
>>>>
>>>>                     Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
>>>>>                     My embedded board is freezing when I put E9171
>>>>>                     on PCIe. What is the meaning of Unrecoverable
>>>>>                     Machine Check error about GPU?
>>>>
>>>>                     Well see the explanation on Wikipedia for
>>>>                     example:
>>>>                     https://en.wikipedia.org/wiki/Machine-check_exception
>>>>                     <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMachine-check_exception&data=02%7C01%7Cchristian.koenig%40amd.com%7C561d9d3206ff46d338ad08d777ef717c%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109742603345420&sdata=NtHiZmL7me4JRZR4L9KHZzpD8Zcn6caURVvtiDSgHsE%3D&reserved=0>
>>>>
>>>>                     In general it means you have messed up
>>>>                     something in your hardware configuration.
>>>>
>>>>>                     Could PCIe settings in .dts file cause this
>>>>>                     problem?
>>>>
>>>>                     Possible, but rather unlikely. My best guess is
>>>>                     that it is some problem with the power supply.
>>>>
>>>>>                     If it is, is there any sample PCIe
>>>>>                     configuration for E9171?
>>>>
>>>>                     The E9171 is just a PCIe device, so the dtsi is
>>>>                     actually rather uninteresting. What we really
>>>>                     need is a full dmesg and maybe lspci output
>>>>                     would help as well.
>>>>
>>>>                     Regards,
>>>>                     Christian.
>>>>
>>>>
>>>>
>>>>                 Hi Christian,
>>>>
>>>>                 At first, I am using NXP T1042D4RDB-64B which has
>>>>                 256 MB PCIe buffer according to its. PCIe memory
>>>>                 range was arranged to 256 MB in .dts file and in
>>>>                 U-boot configuration file. Driver was giving error
>>>>                 with exit code -12 (OUT_OF_MEMORY). But I was able
>>>>                 to reach the linux console.
>>>>
>>>>                 [    5.512922] [drm] amdgpu kernel modesetting enabled.
>>>>                 [    5.517065] [drm] initializing kernel
>>>>                 modesetting (POLARIS12 0x1002:0x6987 0x1787:0x2389
>>>>                 0x80).
>>>>                 [    5.524507] amdgpu 0001:01:00.0: Fatal error
>>>>                 during GPU init
>>>>                 [    5.529296] amdgpu: probe of 0001:01:00.0 failed
>>>>                 with error -12
>>>>
>>>>                 Then I canged 256 MB to 4GB in .dtsi and U-boot
>>>>                 conf file. I also changed 64KB I/O size to 1MB .
>>>>                 When I do this, I wasn't able to reach the linux
>>>>                 console because board was freezing. But driver was
>>>>                 successfull at this time. I already mentioned
>>>>                 successfull driver console logs up.
>>>>
>>>>                 *this is lspci -v when GPU is plugged and Memory
>>>>                 size is 256 MB.*
>>>>
>>>>                 root@t1042d4rdb-64b:~# lspci -v
>>>>                 0000:00:00.0 PCI bridge: Freescale Semiconductor
>>>>                 Inc Device 0824 (rev 11) (prog-if 00 [Normal decode])
>>>>                         Device tree node:
>>>>                 /sys/firmware/devicetree/base/pcie@ffe240000/pcie@0
>>>>                         Flags: bus master, fast devsel, latency 0,
>>>>                 IRQ 20
>>>>                         Memory at <ignored> (32-bit, non-prefetchable)
>>>>                         Bus: primary=00, secondary=01,
>>>>                 subordinate=01, sec-latency=0
>>>>                         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>                         Memory behind bridge: e0000000-efffffff
>>>>                 [size=256M]
>>>>                         Prefetchable memory behind bridge: None
>>>>                 Capabilities: [44] Power Management version 3
>>>>                 Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>                 Capabilities: [100] Advanced Error Reporting
>>>>                         Kernel driver in use: pcieport
>>>>
>>>>                 0001:00:00.0 PCI bridge: Freescale Semiconductor
>>>>                 Inc Device 0824 (rev 11) (prog-if 00 [Normal decode])
>>>>                         Device tree node:
>>>>                 /sys/firmware/devicetree/base/pcie@ffe250000/pcie@0
>>>>                         Flags: bus master, fast devsel, latency 0,
>>>>                 IRQ 21
>>>>                         Memory at <ignored> (32-bit, non-prefetchable)
>>>>                         Bus: primary=00, secondary=01,
>>>>                 subordinate=01, sec-latency=0
>>>>                         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>                         Memory behind bridge: e0000000-efffffff
>>>>                 [size=256M]
>>>>                         Prefetchable memory behind bridge: None
>>>>                 Capabilities: [44] Power Management version 3
>>>>                 Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>                 Capabilities: [100] Advanced Error Reporting
>>>>                         Kernel driver in use: pcieport
>>>>
>>>>                 0001:01:00.0 VGA compatible controller: Advanced
>>>>                 Micro Devices, Inc. [AMD/ATI] Lexa [Radeon E9171
>>>>                 MCM] (rev 80) (prog-if 00 [VGA controller])
>>>>                         Subsystem: Hightech Information System Ltd.
>>>>                 Device 2389
>>>>                         Flags: fast devsel, IRQ 41
>>>>                         Memory at c10000000 (64-bit, prefetchable)
>>>>                 [size=256M]
>>>>                         Memory at <ignored> (64-bit, prefetchable)
>>>>                         I/O ports at 1100 [size=256]
>>>>                         Memory at <ignored> (32-bit, non-prefetchable)
>>>>                         Expansion ROM at <ignored> [disabled]
>>>>                 Capabilities: [48] Vendor Specific Information:
>>>>                 Len=08 <?>
>>>>                 Capabilities: [50] Power Management version 3
>>>>                 Capabilities: [58] Express Legacy Endpoint, MSI 00
>>>>                 Capabilities: [a0] MSI: Enable- Count=1/1 Maskable-
>>>>                 64bit+
>>>>                 Capabilities: [100] Vendor Specific Information:
>>>>                 ID=0001 Rev=1 Len=010 <?>
>>>>                 Capabilities: [150] Advanced Error Reporting
>>>>                 Capabilities: [200] Resizable BAR <?>
>>>>                 Capabilities: [270] Secondary PCI Express <?>
>>>>                 Capabilities: [2b0] Address Translation Service (ATS)
>>>>                 Capabilities: [2c0] Page Request Interface (PRI)
>>>>                 Capabilities: [2d0] Process Address Space ID (PASID)
>>>>                 Capabilities: [320] Latency Tolerance Reporting
>>>>                 Capabilities: [328] Alternative Routing-ID
>>>>                 Interpretation (ARI)
>>>>                 Capabilities: [370] L1 PM Substates
>>>>                         Kernel modules: amdgpu
>>>>
>>>>                 0001:01:00.1 Audio device: Advanced Micro Devices,
>>>>                 Inc. [AMD/ATI] Device aae0
>>>>                         Subsystem: Hightech Information System Ltd.
>>>>                 Device aae0
>>>>                         Flags: bus master, fast devsel, latency 0,
>>>>                 IRQ 17
>>>>                         Memory at <ignored> (64-bit, non-prefetchable)
>>>>                 Capabilities: [48] Vendor Specific Information:
>>>>                 Len=08 <?>
>>>>                 Capabilities: [50] Power Management version 3
>>>>                 Capabilities: [58] Express Legacy Endpoint, MSI 00
>>>>                 Capabilities: [a0] MSI: Enable- Count=1/1 Maskable-
>>>>                 64bit+
>>>>                 Capabilities: [100] Vendor Specific Information:
>>>>                 ID=0001 Rev=1 Len=010 <?>
>>>>                 Capabilities: [150] Advanced Error Reporting
>>>>                 Capabilities: [328] Alternative Routing-ID
>>>>                 Interpretation (ARI)
>>>>
>>>>                 0002:00:00.0 PCI bridge: Freescale Semiconductor
>>>>                 Inc Device 0824 (rev 11) (prog-if 00 [Normal decode])
>>>>                         Device tree node:
>>>>                 /sys/firmware/devicetree/base/pcie@ffe260000/pcie@0
>>>>                         Flags: bus master, fast devsel, latency 0,
>>>>                 IRQ 22
>>>>                         Memory at <ignored> (32-bit, non-prefetchable)
>>>>                         Bus: primary=00, secondary=01,
>>>>                 subordinate=01, sec-latency=0
>>>>                         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>                         Memory behind bridge: e0000000-efffffff
>>>>                 [size=256M]
>>>>                         Prefetchable memory behind bridge: None
>>>>                 Capabilities: [44] Power Management version 3
>>>>                 Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>                 Capabilities: [100] Advanced Error Reporting
>>>>                         Kernel driver in use: pcieport
>>>>
>>>>                 0003:00:00.0 PCI bridge: Freescale Semiconductor
>>>>                 Inc Device 0824 (rev 11) (prog-if 00 [Normal decode])
>>>>                         Device tree node:
>>>>                 /sys/firmware/devicetree/base/pcie@ffe270000/pcie@0
>>>>                         Flags: bus master, fast devsel, latency 0,
>>>>                 IRQ 23
>>>>                         Memory at <ignored> (32-bit, non-prefetchable)
>>>>                         Bus: primary=00, secondary=01,
>>>>                 subordinate=01, sec-latency=0
>>>>                         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>                         Memory behind bridge: e0000000-efffffff
>>>>                 [size=256M]
>>>>                         Prefetchable memory behind bridge: None
>>>>                 Capabilities: [44] Power Management version 3
>>>>                 Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>                 Capabilities: [100] Advanced Error Reporting
>>>>                         Kernel driver in use: pcieport
>>>>
>>>>                 *AND This is PCIe dmesg message when memory range
>>>>                 is 256MB. It's also giving same message when memory
>>>>                 range is arranged as 4GB;*
>>>>
>>>>                 PCI host bridge /pcie@ffe240000  ranges:
>>>>                  MEM 0x0000000c00000000..0x0000000c0fffffff ->
>>>>                 0x00000000e0000000
>>>>                   IO 0x0000000ff8000000..0x0000000ff800ffff ->
>>>>                 0x0000000000000000
>>>>                 /pcie@ffe240000: PCICSRBAR @ 0xff000000
>>>>                 setup_pci_atmu: end of DRAM 200000000
>>>>                 /pcie@ffe240000: Setup 64-bit PCI DMA window
>>>>                 /pcie@ffe240000: WARNING: Outbound window cfg
>>>>                 leaves gaps in memory map. Adjusting the memory map
>>>>                 could reduce unnecessary bounce buffering.
>>>>                 /pcie@ffe240000: DMA window size is 0xe0000000
>>>>                 Found FSL PCI host bridge at 0x0000000ffe250000.
>>>>                 Firmware bus number: 0->1
>>>>                 PCI host bridge /pcie@ffe250000  ranges:
>>>>                  MEM 0x0000000c10000000..0x0000000c1fffffff ->
>>>>                 0x00000000e0000000
>>>>                   IO 0x0000000ff8010000..0x0000000ff801ffff ->
>>>>                 0x0000000000000000
>>>>                 /pcie@ffe250000: PCICSRBAR @ 0xff000000
>>>>                 setup_pci_atmu: end of DRAM 200000000
>>>>                 /pcie@ffe250000: Setup 64-bit PCI DMA window
>>>>                 /pcie@ffe250000: WARNING: Outbound window cfg
>>>>                 leaves gaps in memory map. Adjusting the memory map
>>>>                 could reduce unnecessary bounce buffering.
>>>>                 /pcie@ffe250000: DMA window size is 0xe0000000
>>>>                 Found FSL PCI host bridge at 0x0000000ffe260000.
>>>>                 Firmware bus number: 0->0
>>>>                 PCI host bridge /pcie@ffe260000  ranges:
>>>>                  MEM 0x0000000c20000000..0x0000000c2fffffff ->
>>>>                 0x00000000e0000000
>>>>                   IO 0x0000000ff8020000..0x0000000ff802ffff ->
>>>>                 0x0000000000000000
>>>>                 /pcie@ffe260000: PCICSRBAR @ 0xff000000
>>>>                 setup_pci_atmu: end of DRAM 200000000
>>>>                 /pcie@ffe260000: Setup 64-bit PCI DMA window
>>>>                 /pcie@ffe260000: WARNING: Outbound window cfg
>>>>                 leaves gaps in memory map. Adjusting the memory map
>>>>                 could reduce unnecessary bounce buffering.
>>>>                 /pcie@ffe260000: DMA window size is 0xe0000000
>>>>                 Found FSL PCI host bridge at 0x0000000ffe270000.
>>>>                 Firmware bus number: 0->0
>>>>                 PCI host bridge /pcie@ffe270000  ranges:
>>>>                  MEM 0x0000000c30000000..0x0000000c3fffffff ->
>>>>                 0x00000000e0000000
>>>>                   IO 0x0000000ff8030000..0x0000000ff803ffff ->
>>>>                 0x0000000000000000
>>>>                 /pcie@ffe270000: PCICSRBAR @ 0xff000000
>>>>                 setup_pci_atmu: end of DRAM 200000000
>>>>                 /pcie@ffe270000: Setup 64-bit PCI DMA window
>>>>                 /pcie@ffe270000: WARNING: Outbound window cfg
>>>>                 leaves gaps in memory map. Adjusting the memory map
>>>>                 could reduce unnecessary bounce buffering.
>>>>                 /pcie@ffe270000: DMA window size is 0xe0000000
>>>>                 iommu: Adding device ff6000000.qman-portal to group 0
>>>>                 iommu: Adding device ff6004000.qman-portal to group 1
>>>>                 iommu: Adding device ff6008000.qman-portal to group 2
>>>>                 iommu: Adding device ff600c000.qman-portal to group 3
>>>>                 iommu: Adding device ff6010000.qman-portal to group 4
>>>>                 iommu: Adding device ff6014000.qman-portal to group 5
>>>>                 iommu: Adding device ff6018000.qman-portal to group 6
>>>>                 iommu: Adding device ff601c000.qman-portal to group 7
>>>>                 iommu: Adding device ff6020000.qman-portal to group 8
>>>>                 iommu: Adding device ff6024000.qman-portal to group 9
>>>>                 iommu: Adding device ffe100300.dma to group 10
>>>>                 iommu: Adding device ffe101300.dma to group 11
>>>>                 iommu: Adding device ffe114000.sdhc to group 12
>>>>                 iommu: Adding device ffe210000.usb to group 13
>>>>                 iommu: Adding device ffe211000.usb to group 14
>>>>                 iommu: Adding device ffe220000.sata to group 15
>>>>                 iommu: Adding device ffe221000.sata to group 16
>>>>                 iommu: Adding device ffe318000.qman to group 17
>>>>                 iommu: Adding device ffe31a000.bman to group 18
>>>>                 iommu: Adding device ffe240000.pcie to group 19
>>>>                 iommu: Adding device ffe250000.pcie to group 20
>>>>                 iommu: Adding device ffe260000.pcie to group 21
>>>>                 iommu: Adding device ffe270000.pcie to group 22
>>>>                 iommu: Adding device ffe140000.qe to group 23
>>>>                 software IO TLB: mapped [mem 0xfbfff000-0xfffff000]
>>>>                 (64MB)
>>>>                 PCI: Probing PCI hardware
>>>>                 fsl-pci ffe240000.pcie: PCI host bridge to bus 0000:00
>>>>                 pci_bus 0000:00: root bus resource [io
>>>>                  0x8000080000010000-0x800008000001ffff] (bus
>>>>                 address [0x0000-0xffff])
>>>>                 pci_bus 0000:00: root bus resource [mem
>>>>                 0xc00000000-0xc0fffffff] (bus address
>>>>                 [0xe0000000-0xefffffff])
>>>>                 pci_bus 0000:00: root bus resource [bus 00]
>>>>                 iommu: Removing device ffe240000.pcie from group 19
>>>>                 iommu: Adding device 0000:00:00.0 to group 24
>>>>                 pci 0000:00:00.0: bridge configuration invalid
>>>>                 ([bus 00-00]), reconfiguring
>>>>                 pci 0000:00:00.0: PCI bridge to [bus 01-ff]
>>>>                 fsl-pci ffe250000.pcie: PCI host bridge to bus 0001:00
>>>>                 pci_bus 0001:00: root bus resource [io
>>>>                  0x8000080000021000-0x8000080000030fff] (bus
>>>>                 address [0x0000-0xffff])
>>>>                 pci_bus 0001:00: root bus resource [mem
>>>>                 0xc10000000-0xc1fffffff] (bus address
>>>>                 [0xe0000000-0xefffffff])
>>>>                 pci_bus 0001:00: root bus resource [bus 00-01]
>>>>                 iommu: Removing device ffe250000.pcie from group 20
>>>>                 iommu: Adding device 0001:00:00.0 to group 19
>>>>                 pci 0001:01:00.0: enabling Extended Tags
>>>>                 pci 0001:01:00.0: 4.000 Gb/s available PCIe
>>>>                 bandwidth, limited by 5 GT/s x1 link at
>>>>                 0001:00:00.0 (capable of 63.008 Gb/s with 8 GT/s x8
>>>>                 link)
>>>>                 iommu: Adding device 0001:01:00.0 to group 19
>>>>                 pci 0001:01:00.1: enabling Extended Tags
>>>>                 iommu: Adding device 0001:01:00.1 to group 19
>>>>                 pci 0001:00:00.0: PCI bridge to [bus 01-ff]
>>>>                 fsl-pci ffe260000.pcie: PCI host bridge to bus 0002:00
>>>>                 pci_bus 0002:00: root bus resource [io
>>>>                  0x8000080000032000-0x8000080000041fff] (bus
>>>>                 address [0x0000-0xffff])
>>>>                 pci_bus 0002:00: root bus resource [mem
>>>>                 0xc20000000-0xc2fffffff] (bus address
>>>>                 [0xe0000000-0xefffffff])
>>>>                 pci_bus 0002:00: root bus resource [bus 00]
>>>>                 iommu: Removing device ffe260000.pcie from group 21
>>>>                 iommu: Adding device 0002:00:00.0 to group 20
>>>>                 pci 0002:00:00.0: bridge configuration invalid
>>>>                 ([bus 00-00]), reconfiguring
>>>>                 pci 0002:00:00.0: PCI bridge to [bus 01-ff]
>>>>                 fsl-pci ffe270000.pcie: PCI host bridge to bus 0003:00
>>>>                 pci_bus 0003:00: root bus resource [io
>>>>                  0x8000080000043000-0x8000080000052fff] (bus
>>>>                 address [0x0000-0xffff])
>>>>                 pci_bus 0003:00: root bus resource [mem
>>>>                 0xc30000000-0xc3fffffff] (bus address
>>>>                 [0xe0000000-0xefffffff])
>>>>                 pci_bus 0003:00: root bus resource [bus 00]
>>>>                 iommu: Removing device ffe270000.pcie from group 22
>>>>                 iommu: Adding device 0003:00:00.0 to group 21
>>>>                 pci 0003:00:00.0: bridge configuration invalid
>>>>                 ([bus 00-00]), reconfiguring
>>>>                 pci 0003:00:00.0: PCI bridge to [bus 01-ff]
>>>>                 PCI: Cannot allocate resource region 0 of device
>>>>                 0000:00:00.0, will remap
>>>>                 PCI: Cannot allocate resource region 0 of device
>>>>                 0001:00:00.0, will remap
>>>>                 PCI: Cannot allocate resource region 2 of device
>>>>                 0001:01:00.0, will remap
>>>>                 PCI: Cannot allocate resource region 5 of device
>>>>                 0001:01:00.0, will remap
>>>>                 PCI: Cannot allocate resource region 6 of device
>>>>                 0001:01:00.0, will remap
>>>>                 PCI: Cannot allocate resource region 0 of device
>>>>                 0001:01:00.1, will remap
>>>>                 PCI: Cannot allocate resource region 0 of device
>>>>                 0002:00:00.0, will remap
>>>>                 PCI: Cannot allocate resource region 0 of device
>>>>                 0003:00:00.0, will remap
>>>>                 pci 0000:00:00.0: BAR 0: no space for [mem size
>>>>                 0x01000000]
>>>>                 pci 0000:00:00.0: BAR 0: failed to assign [mem size
>>>>                 0x01000000]
>>>>                 pci 0000:00:00.0: PCI bridge to [bus 01]
>>>>                 pci 0000:00:00.0: bridge window [io
>>>>                  0x8000080000010000-0x800008000001ffff]
>>>>                 pci 0000:00:00.0: bridge window [mem
>>>>                 0xc00000000-0xc0fffffff]
>>>>                 pci_bus 0000:00: Some PCI device resources are
>>>>                 unassigned, try booting with pci=realloc
>>>>                 pci 0001:00:00.0: BAR 0: no space for [mem size
>>>>                 0x01000000]
>>>>                 pci 0001:00:00.0: BAR 0: failed to assign [mem size
>>>>                 0x01000000]
>>>>                 pci 0001:00:00.0: BAR 9: no space for [mem size
>>>>                 0x00200000 64bit pref]
>>>>                 pci 0001:00:00.0: BAR 9: failed to assign [mem size
>>>>                 0x00200000 64bit pref]
>>>>                 pci 0001:01:00.0: BAR 2: no space for [mem size
>>>>                 0x00200000 64bit pref]
>>>>                 pci 0001:01:00.0: BAR 2: failed to assign [mem size
>>>>                 0x00200000 64bit pref]
>>>>                 pci 0001:01:00.0: BAR 5: no space for [mem size
>>>>                 0x00040000]
>>>>                 pci 0001:01:00.0: BAR 5: failed to assign [mem size
>>>>                 0x00040000]
>>>>                 pci 0001:01:00.0: BAR 6: no space for [mem size
>>>>                 0x00020000 pref]
>>>>                 pci 0001:01:00.0: BAR 6: failed to assign [mem size
>>>>                 0x00020000 pref]
>>>>                 pci 0001:01:00.1: BAR 0: no space for [mem size
>>>>                 0x00004000 64bit]
>>>>                 pci 0001:01:00.1: BAR 0: failed to assign [mem size
>>>>                 0x00004000 64bit]
>>>>                 pci 0001:00:00.0: PCI bridge to [bus 01]
>>>>                 pci 0001:00:00.0: bridge window [io
>>>>                  0x8000080000021000-0x8000080000030fff]
>>>>                 pci 0001:00:00.0: bridge window [mem
>>>>                 0xc10000000-0xc1fffffff]
>>>>                 pci_bus 0001:00: Some PCI device resources are
>>>>                 unassigned, try booting with pci=realloc
>>>>                 pci 0002:00:00.0: BAR 0: no space for [mem size
>>>>                 0x01000000]
>>>>                 pci 0002:00:00.0: BAR 0: failed to assign [mem size
>>>>                 0x01000000]
>>>>                 pci 0002:00:00.0: PCI bridge to [bus 01]
>>>>                 pci 0002:00:00.0: bridge window [io
>>>>                  0x8000080000032000-0x8000080000041fff]
>>>>                 pci 0002:00:00.0: bridge window [mem
>>>>                 0xc20000000-0xc2fffffff]
>>>>                 pci_bus 0002:00: Some PCI device resources are
>>>>                 unassigned, try booting with pci=realloc
>>>>                 pci 0003:00:00.0: BAR 0: no space for [mem size
>>>>                 0x01000000]
>>>>                 pci 0003:00:00.0: BAR 0: failed to assign [mem size
>>>>                 0x01000000]
>>>>                 pci 0003:00:00.0: PCI bridge to [bus 01]
>>>>                 pci 0003:00:00.0: bridge window [io
>>>>                  0x8000080000043000-0x8000080000052fff]
>>>>                 pci 0003:00:00.0: bridge window [mem
>>>>                 0xc30000000-0xc3fffffff]
>>>>                 pci_bus 0003:00: Some PCI device resources are
>>>>                 unassigned, try booting with pci=realloc
>>>>
>>>>
>>>
>>>
>>>             _______________________________________________
>>>             amd-gfx mailing list
>>>             amd-gfx@lists.freedesktop.org  <mailto:amd-gfx@lists.freedesktop.org>
>>>             https://lists.freedesktop.org/mailman/listinfo/amd-gfx  <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7C561d9d3206ff46d338ad08d777ef717c%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109742603345420&sdata=PO02mQtDBnkAdNfCn%2Fp7QmKX8IbJ7zke8edrfhsM6Mg%3D&reserved=0>
>>
>


[-- Attachment #1.2: Type: text/html, Size: 102885 bytes --]

[-- Attachment #2: Type: text/plain, Size: 153 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [error] Drm -> amdgpu Unrecoverable Machine Check
       [not found]                     ` <CAGzVRjwVdgojwsM1pOYftcXf6WeQN_46QwfQ1gvVUd6duyXiEQ@mail.gmail.com>
@ 2020-01-02  8:41                       ` Yusuf Altıparmak
  0 siblings, 0 replies; 12+ messages in thread
From: Yusuf Altıparmak @ 2020-01-02  8:41 UTC (permalink / raw)
  To: Christian König; +Cc: amd-gfx


[-- Attachment #1.1: Type: text/plain, Size: 28635 bytes --]

Hello Christian,

I solved this problem weeks ago. The problem was, the system I use could
only give 256 MB address range but GPU was demanding more. Even if I give 4
GB, PCIe slot is only having 256 MB, nothing more.  I put a empty area that
is between PCIe2 ( GPU was connected to this) and PCIe3 and everything
worked fine ( I moved forward the start adress of next PCIe device) . If
anyone encounters same problem, here is a sample Device Tree Source for
linux;

pci0: pcie@ffe240000 {
reg = <0xf 0xfe240000 0 0x10000>;
ranges = <0x02000000 0 0xe0000000 0x2 0x0 0x0 0x10000000
 0x01000000 0 0x0 0xf 0xf8000000 0x0 0x00010000>;
pcie@0 {
ranges = <0x02000000 0 0xe0000000
 0x02000000 0 0xe0000000
 0 0x10000000

 0x01000000 0 0x00000000
 0x01000000 0 0x00000000
 0 0x00010000>;
};
};

pci1: pcie@ffe250000 { // GPU CONNECTED TO THIS ONE
reg = <0xf 0xfe250000 0 0x10000>;
ranges = <0x02000000 0 0xe0000000 0x2 0x10000000 0x1 0x00000000
 0x01000000 0 0 0xf 0xf8010000 0 0x00010000>;
pcie@0 {
ranges = <0x02000000 0 0xe0000000
 0x02000000 0 0xe0000000
 0x1 0x00000000

 0x01000000 0 0x00000000
 0x01000000 0 0x00000000
 0 0x00010000>;
};
};

pci2: pcie@ffe260000 {
reg = <0xf 0xfe260000 0 0x10000>;
ranges = <0x02000000 0 0xe0000000 0x3 0x20000000 0 0x10000000 // 0x3
0x20000000 actually it must be 0x3 0x10000000 because I gave 4 GB to pci1
but I also added 256 MB empty area between them. So it started from 0x3
0x20000000
 0x01000000 0 0x00000000 0xf 0xf8020000 0 0x00010000>;
pcie@0 {
ranges = <0x02000000 0 0xe0000000
 0x02000000 0 0xe0000000
 0 0x10000000

 0x01000000 0 0x00000000
 0x01000000 0 0x00000000
 0 0x00010000>;
};
};

pci3: pcie@ffe270000 {
reg = <0xf 0xfe270000 0 0x10000>;
ranges = <0x02000000 0 0xe0000000 0x3 0x30000000 0 0x10000000
 0x01000000 0 0x00000000 0xf 0xf8030000 0 0x00010000>;
pcie@0 {
ranges = <0x02000000 0 0xe0000000
 0x02000000 0 0xe0000000
 0 0x10000000

 0x01000000 0 0x00000000
 0x01000000 0 0x00000000
 0 0x00010000>;
};
};



Yusuf Altıparmak <yusufalti1997@gmail.com>, 3 Ara 2019 Sal, 22:20 tarihinde
şunu yazdı:

>
> What you could try as well is to use the size 320MB for the MMIO. Those
>> ranges usually don't need to be a power of two (only the BARs itself are a
>> power of two) and this way it might even be easier to fit everything
>> together.
>>
>
> Hmm this makes my job easier it seems.
>
>
>> By the way I wonder how can I get at least VGA output from GPU. Maybe I
>> can get a text console on screen or something like X server? Do you have
>> any recommendations?
>>
>> What could maybe work is VGA emulation, which essentially means text
>> only. But no guarantee for that this really works as expected.
>>
>> It's a well known board and U-boot is the most popular bootloader in
> embedded world it seems. I think I am not the only one who tries to connect
> a GPU from PCIe so I think there must be some config variables that enables
> VGA emulation, or some kind of packages.
>
>
>
>> I am just wondering, does modern gaming motherboards have more than 4GB
>> PCIe buffer for this job ?
>>
>> They don't, resources are dynamically assigned instead.
>>
>> See on x86 you usually have 1GB 32-bit address space where the BIOS
>> shuffles all the mandatory devices it sees at boot time into.
>>
>> Then when the motherboard has multiple PEG slots the BIOS also configures
>> a 64-bit address space which is usually rather huge (256GB-1TB). Since the
>> the VRAM and the doorbell BAR are 64bit BARs on the GPU they can be mapped
>> into that as well.
>>
>> This way you can easily have 10 GPUs connected to your CPU.
>>
>> Ah that was a clear answer. So the adress that CPU uses after mapping is
> actully an imaginary/virtual adress. It depends on the operating systems
> bit configuration. If I am not wrong, those addresses are adding on
> previous one meanwhile PCIe is mapping with endpoint device.
>
>
>
>> The problem you have here is that U-config doesn't do this resource
>> assignment automatically and you need to configure it manually.
>>
>
> Yes. By the way, thanks for your answers Christian. I am a newbie to
> embedded world. I have been dealing with these stuffs for 3 months. I
> couldn't get the answers I seek from google. Your answers were more clear
> and understandable.
>
> Best Regards.
>
>
>
>> Am 03.12.19 um 13:50 schrieb Yusuf Altıparmak:
>>
>>
>> Hi Christian,
>>
>>> 0001f000
>>>
>>> Exactly as I thought. The hardware does support BAR resize, but
>>> unfortunately 256MB is already the minimum.
>>>
>>> Sorry, but there isn't anything I could do from the GPU drivers point of
>>> view.
>>>
>>
>> Yes unfortunately there is nothing remained to about GPU side.
>>
>> The only good news I have is that 256M+2M+512K+128K=260M address space
>>> should be enough for the GPU to work, maybe that makes things a bit simpler.
>>>
>>>
>> Right now I am trying to increase MMIO size config to 512 MB, I hope that
>> should help me. By the way I wonder how can I get at least VGA output from
>> GPU. Maybe I can get a text console on screen or something like X server?
>> Do you have any recommendations? I tried this GPU with my own Ubuntu 18.04
>> and its special driver. Everything worked fine.
>>
>>
>>
>>> But you definitely got an interesting use case here :)
>>>
>>
>> This is the worlds the most interesting use case I think. I can't
>> increase MMIO size because U-boot freezes. I can't decrease it to 256 MB
>> again because driver does not accept :D. I am just wondering, does modern
>> gaming motherboards have more than 4GB PCIe buffer for this job ? :D
>>
>>
>> Am 03.12.19 um 11:31 schrieb Yusuf Altıparmak:
>>>
>>> Hello Christian,
>>> My "setpci -s 0001:01:00.0 ECAP15+4.l ECAP15+8.l" output is;
>>>
>>> 0001f000
>>>
>>> 00000820
>>>
>>> Regards.
>>>
>>>
>>>
>>> Yusuf Altıparmak <yusufalti1997@gmail.com>, 2 Ara 2019 Pzt, 19:31
>>> tarihinde şunu yazdı:
>>>
>>>> Most likely not. There is support for resizing the VRAM BAR, but
>>>>> usually you can only make it larger and not smaller.
>>>>> Please give me the output of "sudo setpci -s 0001:01:00.0 ECAP15+4.l
>>>>> ECAP15+8.l" if you want to double check that.
>>>>>
>>>>
>>>> Okay I'll try it tomorrow. What does the " sudo setpci -s 0001:01:00.0
>>>> ECAP15+4.l ECAP15+8.l" command exactly do ?
>>>>
>>>>
>>>>
>>>>> Well you rather need to ask if anybody has sample PCIe configuration
>>>>> for GPUs in general. That problem is not really E9171 related. You might
>>>>> want to ask NXP for that maybe.
>>>>> Sorry, no idea if that is correct or not. You need to ask NXP for help
>>>>> with that.
>>>>>
>>>>>
>>>> Okay no problem. At least I know what is the missing point now. The
>>>> problem is probably because of the .dtsi and u-boot config files. Memory
>>>> ranges are overwriting like you said. I'll ask nxp to take some sample PCIe
>>>> configuration for GPUs.
>>>>
>>>> Thank you for your interest Christian.
>>>> Regards .
>>>>
>>>>
>>>>>
>>>>> Am 02.12.19 um 14:32 schrieb Yusuf Altıparmak:
>>>>>>
>>>>>>
>>>>>>> I attached my dts file.
>>>>>>>
>>>>>>> System is working fine when GPU is not plugged in.
>>>>>>>
>>>>>>> *This is the last console log before freeze:*
>>>>>>> [drm] amdgpu kernel modesetting enabled.
>>>>>>>
>>>>>>> [drm] initializing kernel modesetting (POLARIS12 0x1002:0x6987
>>>>>>> 0x1787:0x2389 0x80).
>>>>>>> [drm] register mmio base: 0x20200000
>>>>>>>
>>>>>>> fsl-fman-port ffe488000.port fm1-gb0: renamed from eth0
>>>>>>>
>>>>>>> [drm] register mmio size: 262144
>>>>>>>
>>>>>>> [drm] add ip block number 0 <vi_common>
>>>>>>>
>>>>>>> [drm] add ip block number 1 <gmc_v8_0>
>>>>>>>
>>>>>>> [drm] add ip block number 2 <tonga_ih>
>>>>>>>
>>>>>>> [drm] add ip block number 3 <powerplay>
>>>>>>>
>>>>>>> [drm] add ip block number 4 <dm>
>>>>>>>
>>>>>>> [drm] add ip block number 5 <gfx_v8_0>
>>>>>>>
>>>>>>> [drm] add ip block number 6 <sdma_v3_0>
>>>>>>>
>>>>>>> [drm] add ip block number 7 <uvd_v6_0>
>>>>>>>
>>>>>>> [drm] add ip block number 8 <vce_v3_0>
>>>>>>>
>>>>>>> [drm] UVD is enabled in VM mode
>>>>>>>
>>>>>>> [drm] UVD ENC is enabled in VM mode
>>>>>>>
>>>>>>> [drm] VCE enabled in VM mode
>>>>>>>
>>>>>>> ATOM BIOS: 113-ER16BFC-001
>>>>>>>
>>>>>>> [drm] GPU posting now...
>>>>>>>
>>>>>>> Disabling lock debugging due to kernel taint
>>>>>>>
>>>>>>> Machine check in kernel mode.
>>>>>>>
>>>>>>> Caused by (from MCSR=a000): Load Error Report
>>>>>>>
>>>>>>> Guarded Load Error Report
>>>>>>>
>>>>>>> Kernel panic - not syncing: Unrecoverable Machine check
>>>>>>>
>>>>>>> CPU: 1 PID: 2023 Comm: udevd Tainted: G   M
>>>>>>>  4.19.26+gc0c2141 #1
>>>>>>> Call Trace:
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> amd-gfx mailing listamd-gfx@lists.freedesktop.orghttps://lists.freedesktop.org/mailman/listinfo/amd-gfx <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7C561d9d3206ff46d338ad08d777ef717c%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109742603335427&sdata=DstGsDW0X7ennQoF1vg%2FYLGAStF4p1dAeMFhzsAFc1I%3D&reserved=0>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>> Christian König <ckoenig.leichtzumerken@gmail.com>, 2 Ara 2019 Pzt,
>>>>>> 15:28 tarihinde şunu yazdı:
>>>>>>
>>>>>>> Hi Yusuf,
>>>>>>>
>>>>>>> Am 02.12.19 um 12:41 schrieb Yusuf Altıparmak:
>>>>>>>
>>>>>>> My embedded board is freezing when I put E9171 on PCIe. What is the
>>>>>>> meaning of Unrecoverable Machine Check error about GPU?
>>>>>>>
>>>>>>>
>>>>>>> Well see the explanation on Wikipedia for example:
>>>>>>> https://en.wikipedia.org/wiki/Machine-check_exception
>>>>>>> <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMachine-check_exception&data=02%7C01%7Cchristian.koenig%40amd.com%7C561d9d3206ff46d338ad08d777ef717c%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109742603345420&sdata=NtHiZmL7me4JRZR4L9KHZzpD8Zcn6caURVvtiDSgHsE%3D&reserved=0>
>>>>>>>
>>>>>>> In general it means you have messed up something in your hardware
>>>>>>> configuration.
>>>>>>>
>>>>>>> Could PCIe settings in .dts file cause this problem?
>>>>>>>
>>>>>>>
>>>>>>> Possible, but rather unlikely. My best guess is that it is some
>>>>>>> problem with the power supply.
>>>>>>>
>>>>>>> If it is, is there any sample PCIe configuration for E9171?
>>>>>>>
>>>>>>>
>>>>>>> The E9171 is just a PCIe device, so the dtsi is actually rather
>>>>>>> uninteresting. What we really need is a full dmesg and maybe lspci output
>>>>>>> would help as well.
>>>>>>>
>>>>>>> Regards,
>>>>>>> Christian.
>>>>>>>
>>>>>>
>>>>>>
>>>>>> Hi Christian,
>>>>>>
>>>>>> At first, I am using NXP T1042D4RDB-64B which has 256 MB PCIe buffer
>>>>>> according to its. PCIe memory range was arranged to 256 MB in .dts file and
>>>>>> in U-boot configuration file. Driver was giving error with exit code -12
>>>>>> (OUT_OF_MEMORY). But I was able to reach the linux console.
>>>>>>
>>>>>> [    5.512922] [drm] amdgpu kernel modesetting enabled.
>>>>>> [    5.517065] [drm] initializing kernel modesetting (POLARIS12
>>>>>> 0x1002:0x6987 0x1787:0x2389 0x80).
>>>>>> [    5.524507] amdgpu 0001:01:00.0: Fatal error during GPU init
>>>>>> [    5.529296] amdgpu: probe of 0001:01:00.0 failed with error -12
>>>>>>
>>>>>> Then I canged 256 MB to 4GB in .dtsi and U-boot conf file. I also
>>>>>> changed 64KB I/O size to 1MB . When I do this, I wasn't able to reach the
>>>>>> linux console because board was freezing. But driver was successfull at
>>>>>> this time. I already mentioned successfull driver console logs up.
>>>>>>
>>>>>> *this is lspci -v when GPU is plugged and Memory size is 256 MB.*
>>>>>>
>>>>>> root@t1042d4rdb-64b:~# lspci -v
>>>>>> 0000:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>>>>> 11) (prog-if 00 [Normal decode])
>>>>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe240000
>>>>>> /pcie@0
>>>>>>         Flags: bus master, fast devsel, latency 0, IRQ 20
>>>>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>>>>         Prefetchable memory behind bridge: None
>>>>>>         Capabilities: [44] Power Management version 3
>>>>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>>>         Capabilities: [100] Advanced Error Reporting
>>>>>>         Kernel driver in use: pcieport
>>>>>>
>>>>>> 0001:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>>>>> 11) (prog-if 00 [Normal decode])
>>>>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe250000
>>>>>> /pcie@0
>>>>>>         Flags: bus master, fast devsel, latency 0, IRQ 21
>>>>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>>>>         Prefetchable memory behind bridge: None
>>>>>>         Capabilities: [44] Power Management version 3
>>>>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>>>         Capabilities: [100] Advanced Error Reporting
>>>>>>         Kernel driver in use: pcieport
>>>>>>
>>>>>> 0001:01:00.0 VGA compatible controller: Advanced Micro Devices, Inc.
>>>>>> [AMD/ATI] Lexa [Radeon E9171 MCM] (rev 80) (prog-if 00 [VGA controller])
>>>>>>         Subsystem: Hightech Information System Ltd. Device 2389
>>>>>>         Flags: fast devsel, IRQ 41
>>>>>>         Memory at c10000000 (64-bit, prefetchable) [size=256M]
>>>>>>         Memory at <ignored> (64-bit, prefetchable)
>>>>>>         I/O ports at 1100 [size=256]
>>>>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>>>>         Expansion ROM at <ignored> [disabled]
>>>>>>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>>>>>         Capabilities: [50] Power Management version 3
>>>>>>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>>>>>>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>>>>>>         Capabilities: [100] Vendor Specific Information: ID=0001
>>>>>> Rev=1 Len=010 <?>
>>>>>>         Capabilities: [150] Advanced Error Reporting
>>>>>>         Capabilities: [200] Resizable BAR <?>
>>>>>>         Capabilities: [270] Secondary PCI Express <?>
>>>>>>         Capabilities: [2b0] Address Translation Service (ATS)
>>>>>>         Capabilities: [2c0] Page Request Interface (PRI)
>>>>>>         Capabilities: [2d0] Process Address Space ID (PASID)
>>>>>>         Capabilities: [320] Latency Tolerance Reporting
>>>>>>         Capabilities: [328] Alternative Routing-ID Interpretation
>>>>>> (ARI)
>>>>>>         Capabilities: [370] L1 PM Substates
>>>>>>         Kernel modules: amdgpu
>>>>>>
>>>>>> 0001:01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI]
>>>>>> Device aae0
>>>>>>         Subsystem: Hightech Information System Ltd. Device aae0
>>>>>>         Flags: bus master, fast devsel, latency 0, IRQ 17
>>>>>>         Memory at <ignored> (64-bit, non-prefetchable)
>>>>>>         Capabilities: [48] Vendor Specific Information: Len=08 <?>
>>>>>>         Capabilities: [50] Power Management version 3
>>>>>>         Capabilities: [58] Express Legacy Endpoint, MSI 00
>>>>>>         Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
>>>>>>         Capabilities: [100] Vendor Specific Information: ID=0001
>>>>>> Rev=1 Len=010 <?>
>>>>>>         Capabilities: [150] Advanced Error Reporting
>>>>>>         Capabilities: [328] Alternative Routing-ID Interpretation
>>>>>> (ARI)
>>>>>>
>>>>>> 0002:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>>>>> 11) (prog-if 00 [Normal decode])
>>>>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe260000
>>>>>> /pcie@0
>>>>>>         Flags: bus master, fast devsel, latency 0, IRQ 22
>>>>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>>>>         Prefetchable memory behind bridge: None
>>>>>>         Capabilities: [44] Power Management version 3
>>>>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>>>         Capabilities: [100] Advanced Error Reporting
>>>>>>         Kernel driver in use: pcieport
>>>>>>
>>>>>> 0003:00:00.0 PCI bridge: Freescale Semiconductor Inc Device 0824 (rev
>>>>>> 11) (prog-if 00 [Normal decode])
>>>>>>         Device tree node: /sys/firmware/devicetree/base/pcie@ffe270000
>>>>>> /pcie@0
>>>>>>         Flags: bus master, fast devsel, latency 0, IRQ 23
>>>>>>         Memory at <ignored> (32-bit, non-prefetchable)
>>>>>>         Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
>>>>>>         I/O behind bridge: 00000000-0000ffff [size=64K]
>>>>>>         Memory behind bridge: e0000000-efffffff [size=256M]
>>>>>>         Prefetchable memory behind bridge: None
>>>>>>         Capabilities: [44] Power Management version 3
>>>>>>         Capabilities: [4c] Express Root Port (Slot-), MSI 00
>>>>>>         Capabilities: [100] Advanced Error Reporting
>>>>>>         Kernel driver in use: pcieport
>>>>>>
>>>>>> *AND This is PCIe dmesg message when memory range is 256MB. It's also
>>>>>> giving same message when memory range is arranged as 4GB;*
>>>>>>
>>>>>> PCI host bridge /pcie@ffe240000  ranges:
>>>>>>  MEM 0x0000000c00000000..0x0000000c0fffffff -> 0x00000000e0000000
>>>>>>   IO 0x0000000ff8000000..0x0000000ff800ffff -> 0x0000000000000000
>>>>>> /pcie@ffe240000: PCICSRBAR @ 0xff000000
>>>>>> setup_pci_atmu: end of DRAM 200000000
>>>>>> /pcie@ffe240000: Setup 64-bit PCI DMA window
>>>>>> /pcie@ffe240000: WARNING: Outbound window cfg leaves gaps in memory
>>>>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>>>>> /pcie@ffe240000: DMA window size is 0xe0000000
>>>>>> Found FSL PCI host bridge at 0x0000000ffe250000. Firmware bus number:
>>>>>> 0->1
>>>>>> PCI host bridge /pcie@ffe250000  ranges:
>>>>>>  MEM 0x0000000c10000000..0x0000000c1fffffff -> 0x00000000e0000000
>>>>>>   IO 0x0000000ff8010000..0x0000000ff801ffff -> 0x0000000000000000
>>>>>> /pcie@ffe250000: PCICSRBAR @ 0xff000000
>>>>>> setup_pci_atmu: end of DRAM 200000000
>>>>>> /pcie@ffe250000: Setup 64-bit PCI DMA window
>>>>>> /pcie@ffe250000: WARNING: Outbound window cfg leaves gaps in memory
>>>>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>>>>> /pcie@ffe250000: DMA window size is 0xe0000000
>>>>>> Found FSL PCI host bridge at 0x0000000ffe260000. Firmware bus number:
>>>>>> 0->0
>>>>>> PCI host bridge /pcie@ffe260000  ranges:
>>>>>>  MEM 0x0000000c20000000..0x0000000c2fffffff -> 0x00000000e0000000
>>>>>>   IO 0x0000000ff8020000..0x0000000ff802ffff -> 0x0000000000000000
>>>>>> /pcie@ffe260000: PCICSRBAR @ 0xff000000
>>>>>> setup_pci_atmu: end of DRAM 200000000
>>>>>> /pcie@ffe260000: Setup 64-bit PCI DMA window
>>>>>> /pcie@ffe260000: WARNING: Outbound window cfg leaves gaps in memory
>>>>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>>>>> /pcie@ffe260000: DMA window size is 0xe0000000
>>>>>> Found FSL PCI host bridge at 0x0000000ffe270000. Firmware bus number:
>>>>>> 0->0
>>>>>> PCI host bridge /pcie@ffe270000  ranges:
>>>>>>  MEM 0x0000000c30000000..0x0000000c3fffffff -> 0x00000000e0000000
>>>>>>   IO 0x0000000ff8030000..0x0000000ff803ffff -> 0x0000000000000000
>>>>>> /pcie@ffe270000: PCICSRBAR @ 0xff000000
>>>>>> setup_pci_atmu: end of DRAM 200000000
>>>>>> /pcie@ffe270000: Setup 64-bit PCI DMA window
>>>>>> /pcie@ffe270000: WARNING: Outbound window cfg leaves gaps in memory
>>>>>> map. Adjusting the memory map could reduce unnecessary bounce buffering.
>>>>>> /pcie@ffe270000: DMA window size is 0xe0000000
>>>>>> iommu: Adding device ff6000000.qman-portal to group 0
>>>>>> iommu: Adding device ff6004000.qman-portal to group 1
>>>>>> iommu: Adding device ff6008000.qman-portal to group 2
>>>>>> iommu: Adding device ff600c000.qman-portal to group 3
>>>>>> iommu: Adding device ff6010000.qman-portal to group 4
>>>>>> iommu: Adding device ff6014000.qman-portal to group 5
>>>>>> iommu: Adding device ff6018000.qman-portal to group 6
>>>>>> iommu: Adding device ff601c000.qman-portal to group 7
>>>>>> iommu: Adding device ff6020000.qman-portal to group 8
>>>>>> iommu: Adding device ff6024000.qman-portal to group 9
>>>>>> iommu: Adding device ffe100300.dma to group 10
>>>>>> iommu: Adding device ffe101300.dma to group 11
>>>>>> iommu: Adding device ffe114000.sdhc to group 12
>>>>>> iommu: Adding device ffe210000.usb to group 13
>>>>>> iommu: Adding device ffe211000.usb to group 14
>>>>>> iommu: Adding device ffe220000.sata to group 15
>>>>>> iommu: Adding device ffe221000.sata to group 16
>>>>>> iommu: Adding device ffe318000.qman to group 17
>>>>>> iommu: Adding device ffe31a000.bman to group 18
>>>>>> iommu: Adding device ffe240000.pcie to group 19
>>>>>> iommu: Adding device ffe250000.pcie to group 20
>>>>>> iommu: Adding device ffe260000.pcie to group 21
>>>>>> iommu: Adding device ffe270000.pcie to group 22
>>>>>> iommu: Adding device ffe140000.qe to group 23
>>>>>> software IO TLB: mapped [mem 0xfbfff000-0xfffff000] (64MB)
>>>>>> PCI: Probing PCI hardware
>>>>>> fsl-pci ffe240000.pcie: PCI host bridge to bus 0000:00
>>>>>> pci_bus 0000:00: root bus resource [io
>>>>>>  0x8000080000010000-0x800008000001ffff] (bus address [0x0000-0xffff])
>>>>>> pci_bus 0000:00: root bus resource [mem 0xc00000000-0xc0fffffff] (bus
>>>>>> address [0xe0000000-0xefffffff])
>>>>>> pci_bus 0000:00: root bus resource [bus 00]
>>>>>> iommu: Removing device ffe240000.pcie from group 19
>>>>>> iommu: Adding device 0000:00:00.0 to group 24
>>>>>> pci 0000:00:00.0: bridge configuration invalid ([bus 00-00]),
>>>>>> reconfiguring
>>>>>> pci 0000:00:00.0: PCI bridge to [bus 01-ff]
>>>>>> fsl-pci ffe250000.pcie: PCI host bridge to bus 0001:00
>>>>>> pci_bus 0001:00: root bus resource [io
>>>>>>  0x8000080000021000-0x8000080000030fff] (bus address [0x0000-0xffff])
>>>>>> pci_bus 0001:00: root bus resource [mem 0xc10000000-0xc1fffffff] (bus
>>>>>> address [0xe0000000-0xefffffff])
>>>>>> pci_bus 0001:00: root bus resource [bus 00-01]
>>>>>> iommu: Removing device ffe250000.pcie from group 20
>>>>>> iommu: Adding device 0001:00:00.0 to group 19
>>>>>> pci 0001:01:00.0: enabling Extended Tags
>>>>>> pci 0001:01:00.0: 4.000 Gb/s available PCIe bandwidth, limited by 5
>>>>>> GT/s x1 link at 0001:00:00.0 (capable of 63.008 Gb/s with 8 GT/s x8 link)
>>>>>> iommu: Adding device 0001:01:00.0 to group 19
>>>>>> pci 0001:01:00.1: enabling Extended Tags
>>>>>> iommu: Adding device 0001:01:00.1 to group 19
>>>>>> pci 0001:00:00.0: PCI bridge to [bus 01-ff]
>>>>>> fsl-pci ffe260000.pcie: PCI host bridge to bus 0002:00
>>>>>> pci_bus 0002:00: root bus resource [io
>>>>>>  0x8000080000032000-0x8000080000041fff] (bus address [0x0000-0xffff])
>>>>>> pci_bus 0002:00: root bus resource [mem 0xc20000000-0xc2fffffff] (bus
>>>>>> address [0xe0000000-0xefffffff])
>>>>>> pci_bus 0002:00: root bus resource [bus 00]
>>>>>> iommu: Removing device ffe260000.pcie from group 21
>>>>>> iommu: Adding device 0002:00:00.0 to group 20
>>>>>> pci 0002:00:00.0: bridge configuration invalid ([bus 00-00]),
>>>>>> reconfiguring
>>>>>> pci 0002:00:00.0: PCI bridge to [bus 01-ff]
>>>>>> fsl-pci ffe270000.pcie: PCI host bridge to bus 0003:00
>>>>>> pci_bus 0003:00: root bus resource [io
>>>>>>  0x8000080000043000-0x8000080000052fff] (bus address [0x0000-0xffff])
>>>>>> pci_bus 0003:00: root bus resource [mem 0xc30000000-0xc3fffffff] (bus
>>>>>> address [0xe0000000-0xefffffff])
>>>>>> pci_bus 0003:00: root bus resource [bus 00]
>>>>>> iommu: Removing device ffe270000.pcie from group 22
>>>>>> iommu: Adding device 0003:00:00.0 to group 21
>>>>>> pci 0003:00:00.0: bridge configuration invalid ([bus 00-00]),
>>>>>> reconfiguring
>>>>>> pci 0003:00:00.0: PCI bridge to [bus 01-ff]
>>>>>> PCI: Cannot allocate resource region 0 of device 0000:00:00.0, will
>>>>>> remap
>>>>>> PCI: Cannot allocate resource region 0 of device 0001:00:00.0, will
>>>>>> remap
>>>>>> PCI: Cannot allocate resource region 2 of device 0001:01:00.0, will
>>>>>> remap
>>>>>> PCI: Cannot allocate resource region 5 of device 0001:01:00.0, will
>>>>>> remap
>>>>>> PCI: Cannot allocate resource region 6 of device 0001:01:00.0, will
>>>>>> remap
>>>>>> PCI: Cannot allocate resource region 0 of device 0001:01:00.1, will
>>>>>> remap
>>>>>> PCI: Cannot allocate resource region 0 of device 0002:00:00.0, will
>>>>>> remap
>>>>>> PCI: Cannot allocate resource region 0 of device 0003:00:00.0, will
>>>>>> remap
>>>>>> pci 0000:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>>>> pci 0000:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>>>>> pci 0000:00:00.0: PCI bridge to [bus 01]
>>>>>> pci 0000:00:00.0:   bridge window [io
>>>>>>  0x8000080000010000-0x800008000001ffff]
>>>>>> pci 0000:00:00.0:   bridge window [mem 0xc00000000-0xc0fffffff]
>>>>>> pci_bus 0000:00: Some PCI device resources are unassigned, try
>>>>>> booting with pci=realloc
>>>>>> pci 0001:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>>>> pci 0001:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>>>>> pci 0001:00:00.0: BAR 9: no space for [mem size 0x00200000 64bit pref]
>>>>>> pci 0001:00:00.0: BAR 9: failed to assign [mem size 0x00200000 64bit
>>>>>> pref]
>>>>>> pci 0001:01:00.0: BAR 2: no space for [mem size 0x00200000 64bit pref]
>>>>>> pci 0001:01:00.0: BAR 2: failed to assign [mem size 0x00200000 64bit
>>>>>> pref]
>>>>>> pci 0001:01:00.0: BAR 5: no space for [mem size 0x00040000]
>>>>>> pci 0001:01:00.0: BAR 5: failed to assign [mem size 0x00040000]
>>>>>> pci 0001:01:00.0: BAR 6: no space for [mem size 0x00020000 pref]
>>>>>> pci 0001:01:00.0: BAR 6: failed to assign [mem size 0x00020000 pref]
>>>>>> pci 0001:01:00.1: BAR 0: no space for [mem size 0x00004000 64bit]
>>>>>> pci 0001:01:00.1: BAR 0: failed to assign [mem size 0x00004000 64bit]
>>>>>> pci 0001:00:00.0: PCI bridge to [bus 01]
>>>>>> pci 0001:00:00.0:   bridge window [io
>>>>>>  0x8000080000021000-0x8000080000030fff]
>>>>>> pci 0001:00:00.0:   bridge window [mem 0xc10000000-0xc1fffffff]
>>>>>> pci_bus 0001:00: Some PCI device resources are unassigned, try
>>>>>> booting with pci=realloc
>>>>>> pci 0002:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>>>> pci 0002:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>>>>> pci 0002:00:00.0: PCI bridge to [bus 01]
>>>>>> pci 0002:00:00.0:   bridge window [io
>>>>>>  0x8000080000032000-0x8000080000041fff]
>>>>>> pci 0002:00:00.0:   bridge window [mem 0xc20000000-0xc2fffffff]
>>>>>> pci_bus 0002:00: Some PCI device resources are unassigned, try
>>>>>> booting with pci=realloc
>>>>>> pci 0003:00:00.0: BAR 0: no space for [mem size 0x01000000]
>>>>>> pci 0003:00:00.0: BAR 0: failed to assign [mem size 0x01000000]
>>>>>> pci 0003:00:00.0: PCI bridge to [bus 01]
>>>>>> pci 0003:00:00.0:   bridge window [io
>>>>>>  0x8000080000043000-0x8000080000052fff]
>>>>>> pci 0003:00:00.0:   bridge window [mem 0xc30000000-0xc3fffffff]
>>>>>> pci_bus 0003:00: Some PCI device resources are unassigned, try
>>>>>> booting with pci=realloc
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>> _______________________________________________
>>>>> amd-gfx mailing listamd-gfx@lists.freedesktop.orghttps://lists.freedesktop.org/mailman/listinfo/amd-gfx <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.freedesktop.org%2Fmailman%2Flistinfo%2Famd-gfx&data=02%7C01%7Cchristian.koenig%40amd.com%7C561d9d3206ff46d338ad08d777ef717c%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C637109742603345420&sdata=PO02mQtDBnkAdNfCn%2Fp7QmKX8IbJ7zke8edrfhsM6Mg%3D&reserved=0>
>>>>>
>>>>>
>>>>>
>>>
>>

[-- Attachment #1.2: Type: text/html, Size: 102798 bytes --]

[-- Attachment #2: Type: text/plain, Size: 154 bytes --]

_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2020-01-02  8:41 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-12-02 11:41 [error] Drm -> amdgpu Unrecoverable Machine Check Yusuf Altıparmak
2019-12-02 12:28 ` Christian König
2019-12-02 13:32   ` Yusuf Altıparmak
2019-12-02 13:41     ` Christian König
2019-12-02 14:20       ` Yusuf Altıparmak
2019-12-02 14:58         ` Christian König
2019-12-02 16:31           ` Yusuf Altıparmak
2019-12-03 10:31             ` Yusuf Altıparmak
2019-12-03 12:07               ` Christian König
2019-12-03 12:50                 ` Yusuf Altıparmak
2019-12-03 17:29                   ` Christian König
     [not found]                     ` <CAGzVRjwVdgojwsM1pOYftcXf6WeQN_46QwfQ1gvVUd6duyXiEQ@mail.gmail.com>
2020-01-02  8:41                       ` Yusuf Altıparmak

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.