From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933364AbbLHQAD (ORCPT ); Tue, 8 Dec 2015 11:00:03 -0500 Received: from mga02.intel.com ([134.134.136.20]:63694 "EHLO mga02.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932665AbbLHQAA (ORCPT ); Tue, 8 Dec 2015 11:00:00 -0500 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.20,400,1444719600"; d="scan'208";a="702642647" From: "Luck, Tony" To: Borislav Petkov , "Raj, Ashok" CC: "linux-kernel@vger.kernel.org" , "linux-edac@vger.kernel.org" Subject: RE: [Patch V2] x86, mce: Ensure offline CPU's don't participate in mce rendezvous process. Thread-Topic: [Patch V2] x86, mce: Ensure offline CPU's don't participate in mce rendezvous process. Thread-Index: AQHRLuujvHeocrJoDE+SCXAFkO36pp7AfBCA//96brCAAIsHgP//ltRggACOyICAABQtAP//+g8AAATC0IAAD/FugAAC5E9g Date: Tue, 8 Dec 2015 15:59:58 +0000 Message-ID: <3908561D78D1C84285E8C5FCA982C28F39F7CF67@ORSMSX114.amr.corp.intel.com> References: <20151205002930.GA24005@otc-brkl-03.jf.intel.com> <20151207200019.GH22248@pd.tnic> <3908561D78D1C84285E8C5FCA982C28F39F7C24B@ORSMSX114.amr.corp.intel.com> <20151207201951.GI22248@pd.tnic> <3908561D78D1C84285E8C5FCA982C28F39F7C3A4@ORSMSX114.amr.corp.intel.com> <20151207223427.GJ22248@pd.tnic> <20151207234639.GA81526@otc-brkl-03.jf.intel.com> <20151207232524.GK22248@pd.tnic> <20151208014142.GA82345@otc-brkl-03.jf.intel.com> <20151208091812.GA27180@pd.tnic> In-Reply-To: <20151208091812.GA27180@pd.tnic> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.22.254.138] Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from base64 to 8bit by mail.home.local id tB8G1hra017900 > No, the system did panic in both times. The "strange" observation is > that the MCE gets reported only on the cores on node 0. Or at least only > the printks from mce_panic() on the cores on node0 reach the serial > console. You only see messages and logs from node0, because the cpus there are the only ones that see any errors logged in their banks. The cpus on node 1, 2, 3 scan all banks and find nothing, so say nothing. There are no system-wide banks ... just core-wide (in recent generations banks 0-3) and socket-wide (banks >=4). But don't code those numbers into any generic code ... we will change them sooner or later. -Tony {.n++%ݶw{.n+{G{ayʇڙ,jfhz_(階ݢj"mG?&~iOzv^m ?I