From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752060AbcGMPsw (ORCPT ); Wed, 13 Jul 2016 11:48:52 -0400 Received: from mail-bl2nam02on0121.outbound.protection.outlook.com ([104.47.38.121]:64107 "EHLO NAM02-BL2-obe.outbound.protection.outlook.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1750739AbcGMPsl (ORCPT ); Wed, 13 Jul 2016 11:48:41 -0400 Authentication-Results: spf=none (sender IP is ) smtp.mailfrom=linda.knippers@hpe.com; Subject: Re: IOMMU+DMAR causing NMIs-s To: Alex Williamson , Joerg Roedel References: <5630546.0MIYeGNSYG@vostro.rjw.lan> <20160713081817.GJ12639@8bytes.org> <20160713090332.GK12639@8bytes.org> <20160713094917.GM12639@8bytes.org> <20160713101859.GO12639@8bytes.org> <20160713084830.09453584@t450s.home> CC: Meelis Roos , David Woodhouse , , Linux Kernel list From: Linda Knippers Message-ID: <1a4c4b65-c716-08a5-af89-1b0f551024ed@hpe.com> Date: Wed, 13 Jul 2016 11:13:59 -0400 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.1.1 MIME-Version: 1.0 In-Reply-To: <20160713084830.09453584@t450s.home> Content-Type: text/plain; charset="windows-1252" Content-Transfer-Encoding: 7bit X-Originating-IP: [66.187.233.207] X-ClientProxiedBy: BN3PR0601CA0020.namprd06.prod.outlook.com (10.162.30.30) To DF4PR84MB0235.NAMPRD84.PROD.OUTLOOK.COM (10.162.193.152) X-MS-Office365-Filtering-Correlation-Id: 56e5c703-5bc2-479b-9ef5-08d3ab305a4f X-Microsoft-Exchange-Diagnostics: 1;DF4PR84MB0235;2:Pec3XGeagqEiXXwlWY1P6TBaSLKh17YLV7GwEz747AcqVREPPwmVxs2qui5EuapTYsFLKyHacgjMCcnwtSzLa6xMgSogmHb+JgXuxwe72F8Tt68bkEoM/tkS8UdDhX7Txzl7TvD5iKe40WNsoG0fvvkt3rsd6BSm9k7OSos8qvWphE5/pJ+h73YDLbGaZKmS;3:AXjiPjRGVOxrRlvVC16rIxjKB0qXwtrAlHTac/UrE9Qw2Ksna4MVeVTbR1DYFe9Uxk68WWtbf3k6sz+fqUwTnTou1kPFWNA8hZ0RTMG04VfZHgCtmu0XS8FeojdgzIPP X-Microsoft-Antispam: UriScan:;BCL:0;PCL:0;RULEID:;SRVR:DF4PR84MB0235; X-Microsoft-Exchange-Diagnostics: 1;DF4PR84MB0235;25:WVQOT78sXDdF8D8+AypQUe8+0dk6WN888QTqkBQSypBAtOMUG9/m4UaCJJWFK/J0Vbryv1CnLBBeg3q6hd1GHKcfnYm/WMYmC9q6ST/JchvUDlQqaOkl9ueRi4M7eixQcvEo/qLUinhk1lA9sQNgXMAhyjsAPbTnUes7YwdWYM3m8T5nrTCwYo4kEV0JyummXNNMc7KYrz9HSxefmotlXt24rxg86CwuS6PjCBqGDMcOp9pWaaAzvcPoGfZwVyNU9iKn/AT4sopQ/by0SqoLfMz68s/Hk7HsmsKK4TtoWL/4iiUPM1CtzfZ25DpDW/DGplQSwVy6Gd4OXv5ecarCbNuqf62ahAgygrFCe94d7IzITHwXkCgheINCHjFsVvrx8TsS+xjdquBbCswcArUDkf8VDpCpw4csiFAuloa4Z3kIwfd9sut3ol35QMarUS+Gx0l4WyzrpWvus0T+QvI30O1NB52GEXa0a4EbVXHrxxfW6ktqpq8xCCvaRzyqOTPa1RqQ7VvhutCUC6fA/skqHGeWMLN51ck8QPfPKvHzMg0sO6Nf0R3dcpGlPJN4Z4zZ9iIqnkAf5f85aOL7Rfye/16QN7/gSGwx2NvGRtrTKYI6U9fK4UwJErYSGBlxT0DqJ3/jjTOhMsBVYsQGs8FxK0r9RkM02RwSHgt3J0XDHUQkbM2ytumehA51WIt7ZVRVEAWxD24EEbC3JjenKug5bVRCXfNTSHi9atqG9CwZOKF7eMltQcWsdGBhdijDHZkKZ7NcDrzhTn1k4Rh4f86trjOaKLgxvo+8KZ+ayhmwsxQ/wftM4BCOwrgALeUPGNmw X-Microsoft-Exchange-Diagnostics: 1;DF4PR84MB0235;31:UhUC0Xa5pyPIvy9uRmWE6R6Ks/z6NQbB5vXkOEebOKMZBARy2U+t+JXf3YdHycravEsviDFiCXiuS4t4qPY3j1+aMFZH41zLhxhT79oXuRbBGIqCSR8m4cATnH2DmYn3wnuDTdJiEWU2E/EcbD5Z9PUKvW7YoXE/H5JWYO2sKP0jUfXyhAi/pOtomO+mHUcHY2RFNqFPQTduE3cw/1wBrQ==;20:HWwWCYvvNgcvygj70EUlgGLfU1PhZplKr86L3FBMAszJWtvrJhMX+DHPj51eEK+0D5lCkfce2ilrCUlZeYduE9cSZQdoFCLul4x7U05qFoT3EHnli992b1T+isDFkR3eMkBbZUDFyQt/6idZbVaL1dvQnIxplk4VIOfMUVQMHsQuK9rc7D7xfuHu/gPwbVI8cN2Qi1tDXrbd8cozbLRQvtldPsvLRdbtM/y6MbHBAPFukIpgVpgfOonR2O7EEWnlZaNRhZdm/4uyM73n3Kq2ITK8xhtk0IXUF29HXyX7BdZBRPZoe7R5Qar6DQcyT2+1bocz6nC01z3/WPefjUuth/ri8p/XuYzsiu1IhHv/A+z+Z0xDf4yDVy9XLIsC+o3o1TSNRJuJAbjOcjya0Gx+wmCbYbqAwpOs7/5J0sHW10y9r7D+uEZ8ZruXZGD9ATPJ6ZJOMqkCEMkfDVGsRFx99tXGto+zzouhXYpR9BIuqv9Nr6W5XVWMAF470qBEz4lI X-Microsoft-Antispam-PRVS: X-Exchange-Antispam-Report-Test: UriScan:; X-Exchange-Antispam-Report-CFA-Test: BCL:0;PCL:0;RULEID:(601004)(2401047)(8121501046)(5005006)(10201501046)(3002001)(6055026);SRVR:DF4PR84MB0235;BCL:0;PCL:0;RULEID:;SRVR:DF4PR84MB0235; X-Microsoft-Exchange-Diagnostics: 1;DF4PR84MB0235;4:97KVKpHyI4f4GTvKXHLPILPdP8lrTmD+qcSB5tPiuAznhGJ6fvQioefD3FAUD+Jtx8GwFerOM6Y5fXiUcPhX5myoIWvTxNck63su5yDkpB4HJ548B+B70Z7upAhfuDMId/nfR2F8K4aLknscpHIOb88jG4mRjG950QiwPYY1z4dskgPAX0zbuvhsoBmli9x53sCsZODa0oXG2knoeyaf30RqXj0bdW2aKyBMDiXm/J453bpN6olWUqSngXbf/XtEuZxobHupvFJ4qL1GfcRf9izt9i6BVKCr37TgPehe1K1scNjp6xDhHlBJIleFCg5GdBOuLuJr7Pr9RqG+kYiqclxLe1MAsdTJxvNc5KUUTJjvdF7JIh50jXr11ItfxqXCJOHGIilKSnxTJm7dvyUYAw== X-Forefront-PRVS: 000227DA0C X-Forefront-Antispam-Report: SFV:NSPM;SFS:(10019020)(4630300001)(6049001)(6009001)(7916002)(24454002)(189002)(199003)(377454003)(101416001)(305945005)(66066001)(31686004)(50466002)(36756003)(81166006)(2950100001)(47776003)(64126003)(15975445007)(54356999)(65806001)(50986999)(76176999)(230700001)(7736002)(42186005)(83506001)(7846002)(65956001)(92566002)(81156014)(77096005)(19580395003)(19580405001)(31696002)(2906002)(6116002)(189998001)(68736007)(105586002)(3846002)(4326007)(86362001)(5001770100001)(97736004)(4001350100001)(93886004)(8676002)(106356001)(23746002)(33646002)(586003)(7099028)(65826006);DIR:OUT;SFP:1102;SCL:1;SRVR:DF4PR84MB0235;H:[10.193.20.189];FPR:;SPF:None;PTR:InfoNoRecords;MX:1;A:1;LANG:en; X-Microsoft-Exchange-Diagnostics: =?Windows-1252?Q?1;DF4PR84MB0235;23:qIm0wedEIKfLTrR4bBrLBUn+sn7laWxXA89Mm?= =?Windows-1252?Q?h+n8JSeWFdfavmiJ3qCXJ7oLBNLOQb4xCuq7hAhowbu5TeiCHgnvkce7?= =?Windows-1252?Q?X6haza4gy98syKsBomUNCI9bebhJCdCjSnzPaZaYEhVxngvO/6bCxUIJ?= =?Windows-1252?Q?/HGR+FinEqnH4dQQinzFzkpKQ2DaZbsxwUY151+K6ycqzXQ7daYIdIXq?= =?Windows-1252?Q?YKYL8j3S9CkzjY2a4M8WHLRYINWImnIcNrUK3HmhV/oHUptMp3/l9KtV?= =?Windows-1252?Q?79Nw7Wo4VriR0+A7xIOK/6q+zeIivl75VYulwKL7Mw/zrvC3yWtZWdwD?= =?Windows-1252?Q?/4d3Y/+uhWs+yPTrJ6In5E87J5eAOWdSMsT1PbXvYgqzFdOelK549T3m?= =?Windows-1252?Q?rrvUjleNzVRD4OuUolJilrw65KguhcsQvoMCUmUD4GFUaPzKPtIByARO?= =?Windows-1252?Q?ZDfC+2Yofk+Wc00WbL1dBtv/VSh8KLX/WdOofO7/2QYw9C3P1B2QCD6u?= =?Windows-1252?Q?j1zFb0CZZOMde6QTWHPUB2oMJmZAJk116VPB+WU6ME6Y9rdPgCHRKjF0?= =?Windows-1252?Q?6dhEGiYPRNw43Z3tcqCAspiyHkzU4+6h0AeNDGSlYsUpiV7VEHVHQTCA?= =?Windows-1252?Q?ugNmUE8Cdc//9iA9m3NnFwakinkZ25sca7lPubWliB8yreoo+cCZ8Pu7?= =?Windows-1252?Q?W8xAjCcib1egqslMtw1BC7VfTk4AIxBviHc5KTKs2P9RK0VkwfK3mEoQ?= =?Windows-1252?Q?9ZPOvHp5g6NBPTstacxO0V/RhjnhOFk/LHmZBjHeEbsqtJfPlBdqTPkJ?= =?Windows-1252?Q?qGrLyNOPsN5Z8o0FeorNdpnz7pQEDRfFmCdpt2+uXiV/TuXBLqGM5Cif?= =?Windows-1252?Q?Lxgo7lzubG6cfF2sN3P1sz9pr4cD++bpAyeZP5OzsnNgVPq4ql3I1k7v?= =?Windows-1252?Q?r/S4YAlhZa624bLZu9wWV/F7eEMDh3uQuRa0cSLIpVaqlXKGjcdvaLWp?= =?Windows-1252?Q?yaEA+e0dNJ+RuU91h0cf4bac9kDfAAjSE8DWo4mnYXqHZP1m4/2oDZQ8?= =?Windows-1252?Q?Ja2EL6pwkyug26Ccu/y2hfonXl+4cNcJxtAW2b5kl6tsxoHwGxMysi4f?= =?Windows-1252?Q?TaA+33EWdYs/19lOkG6iDQypiR8x8tZ7cbZUx7h3SEW33dcZpbrsM8B9?= =?Windows-1252?Q?QXFnXP8J1LxOJ3lXwFLd5KlVzThGSZS9zcEo9I3nloKMHQp3ZzEBMDko?= =?Windows-1252?Q?vBTyLaWjGhSUQqmmZF83nZjJVG4KyyAe7+o8gqZVu/hJlQuoQWFHqszK?= =?Windows-1252?Q?q8kZOgFgo8VOUFoMo2TRbbV3gRbBz20JwSV6a2C7bKXSGlwrcaGiC5wN?= =?Windows-1252?Q?M8IUs/VR0voOXH49/KufRDIU9e832JqdMpgMNMxlRU7Y5viSuUcMp5jJ?= =?Windows-1252?Q?9+hpgv0U9dooygeP/Ij?= X-Microsoft-Exchange-Diagnostics: 1;DF4PR84MB0235;6:00yNuovOWPivE2zraMJ8wpC4ph+KO5vbtnDiNc7FPemC8qfn0khzTbI7AlWYNjCHtWO1VvlOZRThjy7NR+eu8tqDptVH0Y4HbG864plHSB1SlwFqjUIqyxyNqaLjhqquHIyiWS3sJoRp4XiEt2FdtEDJuKI4uNJEo76B6AhVG45GegF59okMSIzB+oGL+a/Dk+4O1Q578uT27bzQtYI5H760OE+ubhvXmad6R8ca5IYHmFxTJT6D3h5UzyIsfwfU7QUvpoyfECaVZEXqNs4RVNIXVEonLl2GmT4TrULkurnkjNi9JRvycl9Ea55MnfLMe+SPyMDQ8lsVw21QbgVPSA==;5:wwuCLJY5xMi+ly8wnW13wa3cYG+6dj7jn5C+qQKR4Dbq26fdEcpkJ9+0fCPNJQaF539LqRbhPO0he07b0u4fo1YbjNy0CMr8UPptBZdcEPPW4pXosi1HWpFsA4d21zDxWM2CqDIpWImRbosyJzCReA==;24:cilfJV8K5Nw4ONckM+4gjtwJvE3VcuP70OYuCU1t1v5QkFe+TD5LrLtpWwvpbyEHSStEGWKC5umKFdlLY5SyyG3jrpH5iyiNGqY5UnoRh8I=;7:wLCeM8LaKkMBwThNoJdvYwdqD3HZEbG4gzK0eJ+b7T75uK/egM3RePD45washHyq9XzlXR6PaTE09x/SrwW6ZnmQtlpHrx7G2U+Z+0o/xCjY+0tdbSBRFO1iWUpPZEgH1I2mi9JLdrqcB9qCgNNK4s+s5oAhzm9PD9qcqn4H2OhlmY4wZgmrOL1C8CG0S02zs4GmbetG6vuw6fXu7kUOREy9IE33PV3cBU8YsZ2Pntmr5hC8ZLfWBeP+p2ZVpVSB SpamDiagnosticOutput: 1:99 SpamDiagnosticMetadata: NSPM X-OriginatorOrg: hpe.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 13 Jul 2016 15:14:13.6315 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-Transport-CrossTenantHeadersStamped: DF4PR84MB0235 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 7/13/2016 10:48 AM, Alex Williamson wrote: > On Wed, 13 Jul 2016 12:18:59 +0200 > Joerg Roedel wrote: > >> On Wed, Jul 13, 2016 at 12:58:24PM +0300, Meelis Roos wrote: >>>>> Just got http://kodu.ut.ee/~mroos/4.6-dmar-fault2.png when playing with >>>>> BIOS settings (disabling NUMA). It is the first time I see at least some >>>>> info in NMI decode. >>>> >>>> This looks interesting. Can you please post output of 'lspci -vvv' and >>>> 'lspci -t'? >>> >>> Here. >> >> Thanks. So device 00:1e.0 is a PCI-bridge which has some 32-bit >> PCI-devices behind it. One of these devices tries to read address >> 0xb000, which is blocked by the IOMMU and causes the fault seen in the >> screen-shot. The fault also causes a PCI-error which is then reported >> through the NMI, causing your kernel panic. >> >> So the 32bit PCI devices behind the bridge are: >> >> 01:03.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] ES1000 (rev 02) (prog-if 00 [VGA controller]) >> 01:04.0 System peripheral: Compaq Computer Corporation Integrated Lights Out Controller (rev 03) >> 01:04.2 System peripheral: Compaq Computer Corporation Integrated Lights Out Processor (rev 03) >> 01:04.4 USB controller: Hewlett-Packard Company Integrated Lights-Out Standard Virtual USB Controller (prog-if 00 [UHCI]) >> 01:04.6 IPMI SMIC interface: Hewlett-Packard Company Integrated Lights-Out Standard KCS Interface (prog-if 01) >> >> Can you try to disable this 'Lights Out' processor? Maybe it is causing >> the issues. On the other side, the radeon driver for the ATI card is >> also know for causing faults from time to time. Can you capture the >> kernel messages right before a crash too? > > IIRC, blacklisting the hpwdt module can defuse those NMIs and might > help us see more of the actual DMAR faults. Blacklist in modprobe.d > and rebuild initrd. Thanks, > > Alex > > PS - never assume BIOS release notes are actually complete I agree. I'd do the BIOS update and also make sure the iLO FW is current. -- ljk > _______________________________________________ > iommu mailing list > iommu@lists.linux-foundation.org > https://lists.linuxfoundation.org/mailman/listinfo/iommu >