From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754691AbdL1R3T (ORCPT ); Thu, 28 Dec 2017 12:29:19 -0500 Received: from Galois.linutronix.de ([146.0.238.70]:33138 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754434AbdL1R3O (ORCPT ); Thu, 28 Dec 2017 12:29:14 -0500 Date: Thu, 28 Dec 2017 18:29:05 +0100 (CET) From: Thomas Gleixner To: Alexandru Chirvasitu cc: Dou Liyang , Pavel Machek , kernel list , Ingo Molnar , "Maciej W. Rozycki" , Mikael Pettersson , Josh Poulson , Mihai Costache , Stephen Hemminger , Marc Zyngier , linux-pci@vger.kernel.org, Haiyang Zhang , Dexuan Cui , Simon Xiao , Saeed Mahameed , Jork Loeser , Bjorn Helgaas , devel@linuxdriverproject.org, KY Srinivasan Subject: Re: PROBLEM: 4.15.0-rc3 APIC causes lockups on Core 2 Duo laptop In-Reply-To: <20171228172250.GD10658@chirva-slack.chirva-slack> Message-ID: References: <20171219083421.GB24638@arch-chirva.localdomain> <20171220131929.GC24638@arch-chirva.localdomain> <20171228142117.GA10658@chirva-slack.chirva-slack> <20171228154835.GB10658@chirva-slack.chirva-slack> <20171228160522.GC10658@chirva-slack.chirva-slack> <20171228172250.GD10658@chirva-slack.chirva-slack> User-Agent: Alpine 2.20 (DEB 67 2015-01-07) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII X-Linutronix-Spam-Score: -1.0 X-Linutronix-Spam-Level: - X-Linutronix-Spam-Status: No , -1.0 points, 5.0 required, ALL_TRUSTED=-1,SHORTCIRCUIT=-0.0001 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 28 Dec 2017, Alexandru Chirvasitu wrote: > On Thu, Dec 28, 2017 at 05:10:28PM +0100, Thomas Gleixner wrote: > > On Thu, 28 Dec 2017, Alexandru Chirvasitu wrote: > > > Actually, it decided to cooperate for just long enough for me to get > > > the dmesg out. Attached. > > > > > > This is from the kernel you asked about: Dou's patch + yours, i.e. the > > > latest one in that git log I just sent, booted up with 'apic=debug'. > > > > Ok. As I suspected that warning does not trigger. I would have been > > massively surprised if that happened. So Dou's patch is just a red herring > > and just might change the timing enough to make the problem 'hide'. > > > > Can you try something completely different please? > > > > Just use plain Linus tree without any additional patches on top and disable > > CONFIG_NO_HZ_IDLE, i.e. select CONFIG_HZ_PERIODIC. > > > > If that works, then reenable it and add 'nohz=off' to the kernel command > > line. > > > > No go here I'm afraid: > > Linus' clean 4.15-rc5 compiled with CONFIG_HZ_PERIODIC exhibits the > familiar behaviour: lockups, sometimes instant upon trying to log in, > sometimes logging me in and freaking out seconds later. Ok. So it's not the issue I had in mind. Back to some of the interesting bits in the logs: [ 36.017942] spurious APIC interrupt through vector ff on CPU#0, should never happen. Does that message ever show up in 4.14 or 4.9? Thanks, tglx