From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752617AbdBMLZc (ORCPT ); Mon, 13 Feb 2017 06:25:32 -0500 Received: from Galois.linutronix.de ([146.0.238.70]:39208 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752172AbdBMLZb (ORCPT ); Mon, 13 Feb 2017 06:25:31 -0500 Date: Mon, 13 Feb 2017 12:25:28 +0100 (CET) From: Thomas Gleixner To: lkml@pengaru.com cc: Pavel Machek , Woody Suwalski , "Rafael J. Wysocki" , kernel list , mingo@redhat.com, hpa@zytor.com Subject: Re: 4.10-rc1: thinkpad x60: who ate my cpu? In-Reply-To: <20170213101851.GX3712@shells.gnugeneration.com> Message-ID: References: <41553b16-c527-d99b-b56b-31d6a08a7e8a@gmail.com> <20170114113054.GA22012@amd> <20170115095656.GA16524@amd> <1614c21c-3626-074e-e3c3-26e9cd200454@gmail.com> <3c3d35ac-e4e4-6a6c-a78e-b0478ff39726@gmail.com> <20170212195742.GA6264@amd> <161a3edb-84f1-769d-d8e9-04102ddbbe58@gmail.com> <20170213080215.GA3730@amd> <20170213094236.GA2924@amd> <20170213101851.GX3712@shells.gnugeneration.com> User-Agent: Alpine 2.20 (DEB 67 2015-01-07) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 13 Feb 2017, lkml@pengaru.com wrote: > On Mon, Feb 13, 2017 at 10:42:36AM +0100, Pavel Machek wrote: > > Hi! > > > > On Mon 2017-02-13 09:48:41, Thomas Gleixner wrote: > > > On Mon, 13 Feb 2017, Pavel Machek wrote: > > > > cd /sys/devices/system/cpu/cpu1 > > > > while true; do echo 0 > online; echo 1 > online; done > > > > > > > > ...crashes x60 with 4.10-rc in few minutes. [Which is bad -- it should > > > > not die, but also good -- this is easier to reproduce then running 100 > > > > suspend cycles.] > > > > > > Can you tell where it crashes? > > > > I did not expect a crash, so I was in X... I have a feeling that this > > will be reproducible on a lot of hardware, but let me try. > > FYI: Lockup reproduced with 4.10.0-rc7 with an X61s. > > Caught a glimpse of something about an RCU stall timeout before the system shut > off. Prior to that, during the loop execution, a bunch of systemd processes > were experiencing watchdog timeouts, and procps `top` would start but > never refresh, leaving the CPU column all "nan". Does the machine use intel_idle by chance? Thanks, tglx