From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751286AbdH1NxO convert rfc822-to-8bit (ORCPT ); Mon, 28 Aug 2017 09:53:14 -0400 Received: from mout.gmx.net ([212.227.17.20]:50791 "EHLO mout.gmx.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751170AbdH1NxN (ORCPT ); Mon, 28 Aug 2017 09:53:13 -0400 Message-ID: <1503928376.5709.11.camel@gmx.de> Subject: Re: 4.13-rc7: WARNING at arch/x86/kvm/mmu.c:717 (and a crash thereafter) From: Mike Galbraith To: Takashi Iwai , Paolo Bonzini Cc: kvm@vger.kernel.org, x86@kernel.org, linux-kernel@vger.kernel.org Date: Mon, 28 Aug 2017 15:52:56 +0200 In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.20.5 Mime-Version: 1.0 Content-Transfer-Encoding: 8BIT X-Provags-ID: V03:K0:5kCEwDPt/mBR/ofNhjBm3BNF0qE+lfaiLldP9OIh0AxOhgNawt/ cjF0QDJv6GPVUIBGP2OK9fv1STMAoPgPZ4P7kAgKfupdvpKFV5Y/BP9tqR8pWdqcjlKvUoc 7xip67G/0NFOcHtaAzouS/2pGfyAyubmko2B1hYZgm9A9QS0JTzgRCNY1eTQ2Bl2zoZekX0 LHBWjd7heDO5lTcwCgZ7w== X-UI-Out-Filterresults: notjunk:1;V01:K0:ifI72LNS6PM=:ZJP/AJDyASWm8+W2aKeOxx BBIiFavM4ghXod90efQ9JMqj4yAsSXdTqF7aVVU4xIB22yJevvcFZpkRtdDcCxqJVd6hH1r+o oMbc9xB1P5m/kqsxFlyd3c4v7KC6Raf6cDCFUbtp+gQMeDFHl3hxp0O9SzWOArLBYPzYJOEgT Yl9T3LRqfWL60S6VqLgivXzZBwM7J6wu7mRS9mFY94BUk72EoJn2qjHGZ6ms4vo4+djUcDgPU D9nhp15wfDpkrm4IIXN9qBnCv9h++X7grfimlAKlmS1X7voiLKjNdKj+bYm26ptvReqbasGcO pWFsorrDSDk/PpswRGEZCxBNqi93eZNqeAhFzGw4IUQ954SnHzro33IqUWtmmhJQFTGD/6l6z b2H5fMB+xGEPH9i5klz9/sQ9UUlCVCu82DyAygWbqd6nNEu1LyXox2+4RvmRgMiwjf5q6gOsj P+JQGmurL1C7dXu3zMRh7vL17qTSpgnv/s3SL0FcOh5MPltLXgeUJ7SwiF54eTRnjAbso8PgT JPf/rTb5XZ2aCbgYCejjS7NJOlgrvj0ylZ8g+cOxtu3IkfBF/xYe/yT5Z6bzyHrHHCXHYCxmD TSdbNuIF5JjNgg/rVAFxBPBqAEj9XXxzcejo+RYzECTIW0PvIZ4g4C9uolYM2vnuZm1XgjgSJ gLQ3kR+53AhihFyMwBINHWYvudDBrNt1oNtQ3vlUd+8N2KjoqL0aa5PCjM8VtXjLc1U3/8LeI kNL0x6vNc2TLlb2iPrXHIWJtcv0k2d93pM1hKBe0zfbL9mI0h7JElpxQIDlCoEXMKDqA9larO nOz6zAgsM2twBNdniOprejx7m2iUiTMQxAEijRRGvdJ8Tcu7oI= Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 2017-08-28 at 14:26 +0200, Takashi Iwai wrote: > Hi, > > I seem to get a kernel warning when running KVM on Dell desktop with > IvyBridge like below. As you can see, a bad page BUG is triggered > after that, too. The problem is not triggered always, but it happens > occasionally. > > I haven't seen this on 4.13-rc4 at all, and IIRC, it started happening > since rc5. So this might be a regression at rc5. But, as it doesn't > happen always, I can't be 100% sure about it, and it's quite difficult > to bisect (the test case isn't reliable), unfortunately. > > Any hint for further debugging this? Maybe a way to make failure more likely.  This is an RT kernel, but trying to build a fat kernel over NFS from a KVM clone of my workstation (full topology, half of ram) didn't survive one build. [ 2583.153312] WARNING: CPU: 7 PID: 9323 at arch/x86/kvm/mmu.c:717 mmu_spte_clear_track_bits+0x82/0x100 [kvm] [ 2583.153899] WARNING: CPU: 7 PID: 9323 at arch/x86/kvm/mmu.c:717 mmu_spte_clear_track_bits+0x82/0x100 [kvm] [ 2583.154016] WARNING: CPU: 7 PID: 9323 at arch/x86/kvm/mmu.c:717 mmu_spte_clear_track_bits+0x82/0x100 [kvm] [ 2583.158810] WARNING: CPU: 7 PID: 9323 at arch/x86/kvm/mmu.c:717 mmu_spte_clear_track_bits+0x82/0x100 [kvm] [ 2768.419797] BUG: Bad page state in process as pfn:048b3 [ 2768.419932] BUG: Bad page state in process as pfn:04983 [ 2775.097980] BUG: Bad page state in process cc1 pfn:04982 [ 2782.487748] BUG: Bad page state in process cc1 pfn:04980 [ 2782.622636] BUG: Bad page state in process cc1 pfn:048b0 [ 2782.622899] BUG: Bad page state in process cc1 pfn:04984 [ 2782.623053] BUG: Bad page state in process cc1 pfn:04986 [ 2782.673705] BUG: Bad page state in process cc1 pfn:048b4 [ 2782.673903] BUG: Bad page state in process cc1 pfn:048b6 [ 2782.674044] BUG: Bad page state in process cc1 pfn:04989 [ 2782.674185] BUG: Bad page state in process cc1 pfn:0498a [ 2784.895701] BUG: Bad page state in process cc1 pfn:04990 [ 2784.895921] BUG: Bad page state in process cc1 pfn:04992 [ 2784.896100] BUG: Bad page state in process cc1 pfn:04994 [ 2784.896255] BUG: Bad page state in process cc1 pfn:04996 [ 2784.905232] BUG: Bad page state in process cc1 pfn:0499c [ 2784.905501] BUG: Bad page state in process cc1 pfn:0499e [ 2785.762044] BUG: Bad page state in process cc1 pfn:040cb [ 2787.052976] BUG: Bad page state in process cc1 pfn:048ca [ 2787.208480] BUG: Bad page state in process kdesu pfn:048a8 [ 2787.208694] BUG: Bad page state in process kdesu pfn:048aa [ 2787.208862] BUG: Bad page state in process kdesu pfn:048ac [ 2787.208957] BUG: Bad page state in process kdesu pfn:048ae [ 2787.211725] BUG: Bad page state in process cc1 pfn:04884 [ 2787.219784] BUG: Bad page state in process kdesu pfn:04888 [ 2787.226212] BUG: Bad page state in process cc1 pfn:049a0 [ 2788.955108] BUG: Bad page state in process cc1 pfn:048e9 [ 2788.959686] BUG: Bad page state in process cc1 pfn:048f1 [ 2788.959882] BUG: Bad page state in process cc1 pfn:048f2 [ 2788.977485] BUG: Bad page state in process cc1 pfn:048fe [ 2789.295335] BUG: Bad page state in process cc1 pfn:04807 [ 2794.661501] BUG: Bad page state in process cc1 pfn:04819 [ 2794.661658] BUG: Bad page state in process cc1 pfn:0481b [ 2794.661747] BUG: Bad page state in process cc1 pfn:0481d [ 2794.680432] BUG: Bad page state in process cc1 pfn:0482a [ 2794.692849] BUG: Bad page state in process cc1 pfn:04834 [ 2794.705438] BUG: Bad page state in process cc1 pfn:0483c [ 2794.784882] BUG: Bad page state in process gcc pfn:0485c [ 2794.785105] BUG: Bad page state in process gcc pfn:0485e [ 2796.541058] BUG: Bad page state in process Xorg pfn:04011 [ 2808.425625] BUG: Bad page state in process Xorg pfn:04a09 [ 3605.187591] BUG: unable to handle kernel paging request at 000000000001bcf4 [ 3605.202446] BUG: sleeping function called from invalid context at ./include/linux/percpu-rwsem.h:33 [ 3605.203322] BUG: stack guard page was hit at ffffc9000e483ff8 (stack is ffffc9000e484000..ffffc9000e487fff) [ 3605.279108] BUG: scheduling while atomic: ld/5485/0x00000002 >