From mboxrd@z Thu Jan 1 00:00:00 1970 From: bugzilla-daemon@bugzilla.kernel.org Subject: [Bug 81841] amd-iommu: kernel BUG & lockup after shutting down KVM guest using PCI passthrough/PCIe bridge Date: Tue, 12 Aug 2014 15:20:34 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit To: kvm@vger.kernel.org Return-path: Received: from mail.kernel.org ([198.145.19.201]:46621 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752593AbaHLPUi (ORCPT ); Tue, 12 Aug 2014 11:20:38 -0400 Received: from mail.kernel.org (localhost [127.0.0.1]) by mail.kernel.org (Postfix) with ESMTP id DDF93201B4 for ; Tue, 12 Aug 2014 15:20:35 +0000 (UTC) Received: from bugzilla2.web.kernel.org (bugzilla2.web.kernel.org [172.20.200.52]) by mail.kernel.org (Postfix) with ESMTP id A0FF82011D for ; Tue, 12 Aug 2014 15:20:34 +0000 (UTC) In-Reply-To: Sender: kvm-owner@vger.kernel.org List-ID: https://bugzilla.kernel.org/show_bug.cgi?id=81841 --- Comment #17 from Marti Raudsepp --- It's an ASRock FM2A88X Extreme6+ motherboard with the AMD A88X (Bolton-D4) chipset. There are 12 IOMMU groups on the system. The problematic group for me is number 9 because the legacy PCI bridge (14.4) gets mixed in with other southbridge devices (all 14.*). /sys/kernel/iommu_groups/0/devices: 0000:00:00.0 -> ../../../../devices/pci0000:00/0000:00:00.0 /sys/kernel/iommu_groups/1/devices: 0000:00:01.0 -> ../../../../devices/pci0000:00/0000:00:01.0 0000:00:01.1 -> ../../../../devices/pci0000:00/0000:00:01.1 /sys/kernel/iommu_groups/2/devices: 0000:00:02.0 -> ../../../../devices/pci0000:00/0000:00:02.0 /sys/kernel/iommu_groups/3/devices: 0000:00:03.0 -> ../../../../devices/pci0000:00/0000:00:03.0 /sys/kernel/iommu_groups/4/devices: 0000:00:04.0 -> ../../../../devices/pci0000:00/0000:00:04.0 /sys/kernel/iommu_groups/5/devices: 0000:00:10.0 -> ../../../../devices/pci0000:00/0000:00:10.0 0000:00:10.1 -> ../../../../devices/pci0000:00/0000:00:10.1 /sys/kernel/iommu_groups/6/devices: 0000:00:11.0 -> ../../../../devices/pci0000:00/0000:00:11.0 /sys/kernel/iommu_groups/7/devices: 0000:00:12.0 -> ../../../../devices/pci0000:00/0000:00:12.0 0000:00:12.2 -> ../../../../devices/pci0000:00/0000:00:12.2 /sys/kernel/iommu_groups/8/devices: 0000:00:13.0 -> ../../../../devices/pci0000:00/0000:00:13.0 0000:00:13.2 -> ../../../../devices/pci0000:00/0000:00:13.2 /sys/kernel/iommu_groups/9/devices: 0000:00:14.0 -> ../../../../devices/pci0000:00/0000:00:14.0 0000:00:14.1 -> ../../../../devices/pci0000:00/0000:00:14.1 0000:00:14.2 -> ../../../../devices/pci0000:00/0000:00:14.2 0000:00:14.3 -> ../../../../devices/pci0000:00/0000:00:14.3 0000:00:14.4 -> ../../../../devices/pci0000:00/0000:00:14.4 0000:00:14.5 -> ../../../../devices/pci0000:00/0000:00:14.5 0000:01:05.0 -> ../../../../devices/pci0000:00/0000:00:14.4/0000:01:05.0 [When I plug in a card to the other legacy PCI slot, it also appears here as pci0000:00/0000:00:14.4/0000:01:06.0] /sys/kernel/iommu_groups/10/devices: 0000:00:15.0 -> ../../../../devices/pci0000:00/0000:00:15.0 0000:00:15.2 -> ../../../../devices/pci0000:00/0000:00:15.2 0000:00:15.3 -> ../../../../devices/pci0000:00/0000:00:15.3 0000:03:00.0 -> ../../../../devices/pci0000:00/0000:00:15.2/0000:03:00.0 0000:04:00.0 -> ../../../../devices/pci0000:00/0000:00:15.3/0000:04:00.0 /sys/kernel/iommu_groups/11/devices: 0000:00:18.0 -> ../../../../devices/pci0000:00/0000:00:18.0 0000:00:18.1 -> ../../../../devices/pci0000:00/0000:00:18.1 0000:00:18.2 -> ../../../../devices/pci0000:00/0000:00:18.2 0000:00:18.3 -> ../../../../devices/pci0000:00/0000:00:18.3 0000:00:18.4 -> ../../../../devices/pci0000:00/0000:00:18.4 0000:00:18.5 -> ../../../../devices/pci0000:00/0000:00:18.5 (In reply to Joel Schopp from comment #15) > It's not clear to me which devices were being put in the same group. Here's > some of my notes on your lspci output Other than the 14.* devices everything seems to be as you describe. > +-14.0 Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller > +-14.1 Advanced Micro Devices, Inc. [AMD] FCH IDE Controller > +-14.2 Advanced Micro Devices, Inc. [AMD] FCH Azalia Controller > +-14.3 Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge > I do not think the SMBus/IDE/Azalia/LPC are isolated from eachother, but > they are isolated from the other devices I have identified. Ok, that's not a problem. > +-14.4-[01]----05.0 Dialogic Corporation PRI > The legacy PCI should be isolated from the other devices identified. Not > sure what is going on here. Yep, currently shares group 9. > +-14.5 Advanced Micro Devices, Inc. [AMD] FCH USB OHCI Controller > This OHCI Controller should also be isolated from the other devices. Also shares group 9. > +-15.0-[02]-- > +-15.2-[03]----00.0 ASMedia Technology Inc. ASM1042 SuperSpeed > USB Host Controller > Is this in a PCI-e slot or otherwise attached to the PCI-e? Nope, this is integrated on the motherboard. The only used PCI slot is the Dialogic card. > +-15.3-[04]----00.0 Qualcomm Atheros QCA8171 Gigabit Ethernet > Is this in a PCI-e slot or otherwise attached to the PCI-e? Integrated Ethernet. -- You are receiving this mail because: You are watching the assignee of the bug.