From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.3 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_PASS,USER_AGENT_MUTT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id E659FC6778A for ; Sun, 1 Jul 2018 12:23:00 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 92D5C2449D for ; Sun, 1 Jul 2018 12:23:00 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 92D5C2449D Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=linux.vnet.ibm.com Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752328AbeGAMW7 (ORCPT ); Sun, 1 Jul 2018 08:22:59 -0400 Received: from mx0a-001b2d01.pphosted.com ([148.163.156.1]:59808 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752124AbeGAMWz (ORCPT ); Sun, 1 Jul 2018 08:22:55 -0400 Received: from pps.filterd (m0098393.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.22/8.16.0.22) with SMTP id w61CIrc0142471 for ; Sun, 1 Jul 2018 08:22:55 -0400 Received: from e06smtp01.uk.ibm.com (e06smtp01.uk.ibm.com [195.75.94.97]) by mx0a-001b2d01.pphosted.com with ESMTP id 2jxpyttg3k-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Sun, 01 Jul 2018 08:22:55 -0400 Received: from localhost by e06smtp01.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Sun, 1 Jul 2018 13:22:52 +0100 Received: from b06cxnps4076.portsmouth.uk.ibm.com (9.149.109.198) by e06smtp01.uk.ibm.com (192.168.101.131) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Sun, 1 Jul 2018 13:22:49 +0100 Received: from d06av24.portsmouth.uk.ibm.com (d06av24.portsmouth.uk.ibm.com [9.149.105.60]) by b06cxnps4076.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id w61CMm6336765838 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=FAIL); Sun, 1 Jul 2018 12:22:48 GMT Received: from d06av24.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 1A6D442041; Sun, 1 Jul 2018 13:22:34 +0100 (BST) Received: from d06av24.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 6DF0D4203F; Sun, 1 Jul 2018 13:22:33 +0100 (BST) Received: from rapoport-lnx (unknown [9.148.8.245]) by d06av24.portsmouth.uk.ibm.com (Postfix) with ESMTPS; Sun, 1 Jul 2018 13:22:33 +0100 (BST) Date: Sun, 1 Jul 2018 15:22:46 +0300 From: Mike Rapoport To: Rob Herring Cc: mhocko@kernel.org, linux-mm@kvack.org, Johannes Weiner , Andrew Morton , "open list:GENERIC INCLUDE/ASM HEADER FILES" , "linux-kernel@vger.kernel.org" , Yoshinori Sato Subject: h8300: BUG: Bad page state in process swapper (was: Re: why do we still need bootmem allocator?) References: <20180625140754.GB29102@dhcp22.suse.cz> <20180627112655.GD4291@rapoport-lnx> <20180627160206.GB19182@rapoport-lnx> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180627160206.GB19182@rapoport-lnx> User-Agent: Mutt/1.5.24 (2015-08-30) X-TM-AS-GCONF: 00 x-cbid: 18070112-4275-0000-0000-00000293E49D X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 18070112-4276-0000-0000-0000379B5AC3 Message-Id: <20180701122245.GA28969@rapoport-lnx> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:,, definitions=2018-07-01_04:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=2 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1806210000 definitions=main-1807010146 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org (added Yoshinori Sato, here's the beginning of the discussion: https://lore.kernel.org/lkml/20180625140754.GB29102@dhcp22.suse.cz/) On Wed, Jun 27, 2018 at 07:02:06PM +0300, Mike Rapoport wrote: > On Wed, Jun 27, 2018 at 07:33:55AM -0600, Rob Herring wrote: > > On Wed, Jun 27, 2018 at 5:27 AM Mike Rapoport wrote: > > > > > > I've tried running the current upstream on h8300 gdb simulator and it > > > failed: > > > > It seems my patch[1] is still not applied. The maintainer said he applied it. > > I've applied it manually. Without it unflatten_and_copy_device_tree() fails > to allocate memory. It indeed can be fixed with moving bootmem_init() > before, as you've noted in the commit message. > > I'll try to dig deeper into it. > > > > [ 0.000000] BUG: Bad page state in process swapper pfn:00004 > > > [ 0.000000] page:007ed080 count:0 mapcount:-128 mapping:00000000 > > > index:0x0 > > > [ 0.000000] flags: 0x0() > > > [ 0.000000] raw: 00000000 0040bdac 0040bdac 00000000 00000000 00000002 > > > ffffff7f 00000000 > > > [ 0.000000] page dumped because: nonzero mapcount > > > ---Type to continue, or q to quit--- > > > [ 0.000000] CPU: 0 PID: 0 Comm: swapper Not tainted 4.18.0-rc2+ #50 > > > [ 0.000000] Stack from 00401f2c: > > > [ 0.000000] 00401f2c 001116cb 007ed080 00401f40 000e20e6 00401f54 > > > 0004df14 00000000 > > > [ 0.000000] 007ed080 007ed000 00401f5c 0004df8c 00401f90 0004e982 > > > 00000044 00401fd1 > > > [ 0.000000] 007ed000 007ed000 00000000 00000004 00000008 00000000 > > > 00000003 00000011 > > > [ 0.000000] > > > [ 0.000000] Call Trace: > > > [ 0.000000] [<000e20e6>] [<0004df14>] [<0004df8c>] [<0004e982>] > > > [ 0.000000] [<00051a28>] [<00001000>] [<00000100>] > > > [ 0.000000] Disabling lock debugging due to kernel taint > > > > > > With v4.13 I was able to get to "no valid init found". > > > > > > I had a quick look at h8300 memory initialization and it seems it has > > > starting pfn set to 0 while fdt defines memory start at 4M. > > > > Perhaps there's another issue. In my setup this is caused by __ffs() clobbering start pfn in nobootmem.c::__free_pages_memory(). If I change the __ffs() implementation from the inline assembly to generic bitops everything is fine. I'm using gcc 8.1.0 from [1] and gdb 8.1.0.20180625-git [1] http://cdn.kernel.org/pub/tools/crosstool/files/bin/x86_64/ -- Sincerely yours, From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mike Rapoport Subject: h8300: BUG: Bad page state in process swapper (was: Re: why do we still need bootmem allocator?) Date: Sun, 1 Jul 2018 15:22:46 +0300 Message-ID: <20180701122245.GA28969@rapoport-lnx> References: <20180625140754.GB29102@dhcp22.suse.cz> <20180627112655.GD4291@rapoport-lnx> <20180627160206.GB19182@rapoport-lnx> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Content-Disposition: inline In-Reply-To: <20180627160206.GB19182@rapoport-lnx> Sender: linux-kernel-owner@vger.kernel.org To: Rob Herring Cc: mhocko@kernel.org, linux-mm@kvack.org, Johannes Weiner , Andrew Morton , "open list:GENERIC INCLUDE/ASM HEADER FILES" , "linux-kernel@vger.kernel.org" , Yoshinori Sato List-Id: linux-arch.vger.kernel.org (added Yoshinori Sato, here's the beginning of the discussion: https://lore.kernel.org/lkml/20180625140754.GB29102@dhcp22.suse.cz/) On Wed, Jun 27, 2018 at 07:02:06PM +0300, Mike Rapoport wrote: > On Wed, Jun 27, 2018 at 07:33:55AM -0600, Rob Herring wrote: > > On Wed, Jun 27, 2018 at 5:27 AM Mike Rapoport wrote: > > > > > > I've tried running the current upstream on h8300 gdb simulator and it > > > failed: > > > > It seems my patch[1] is still not applied. The maintainer said he applied it. > > I've applied it manually. Without it unflatten_and_copy_device_tree() fails > to allocate memory. It indeed can be fixed with moving bootmem_init() > before, as you've noted in the commit message. > > I'll try to dig deeper into it. > > > > [ 0.000000] BUG: Bad page state in process swapper pfn:00004 > > > [ 0.000000] page:007ed080 count:0 mapcount:-128 mapping:00000000 > > > index:0x0 > > > [ 0.000000] flags: 0x0() > > > [ 0.000000] raw: 00000000 0040bdac 0040bdac 00000000 00000000 00000002 > > > ffffff7f 00000000 > > > [ 0.000000] page dumped because: nonzero mapcount > > > ---Type to continue, or q to quit--- > > > [ 0.000000] CPU: 0 PID: 0 Comm: swapper Not tainted 4.18.0-rc2+ #50 > > > [ 0.000000] Stack from 00401f2c: > > > [ 0.000000] 00401f2c 001116cb 007ed080 00401f40 000e20e6 00401f54 > > > 0004df14 00000000 > > > [ 0.000000] 007ed080 007ed000 00401f5c 0004df8c 00401f90 0004e982 > > > 00000044 00401fd1 > > > [ 0.000000] 007ed000 007ed000 00000000 00000004 00000008 00000000 > > > 00000003 00000011 > > > [ 0.000000] > > > [ 0.000000] Call Trace: > > > [ 0.000000] [<000e20e6>] [<0004df14>] [<0004df8c>] [<0004e982>] > > > [ 0.000000] [<00051a28>] [<00001000>] [<00000100>] > > > [ 0.000000] Disabling lock debugging due to kernel taint > > > > > > With v4.13 I was able to get to "no valid init found". > > > > > > I had a quick look at h8300 memory initialization and it seems it has > > > starting pfn set to 0 while fdt defines memory start at 4M. > > > > Perhaps there's another issue. In my setup this is caused by __ffs() clobbering start pfn in nobootmem.c::__free_pages_memory(). If I change the __ffs() implementation from the inline assembly to generic bitops everything is fine. I'm using gcc 8.1.0 from [1] and gdb 8.1.0.20180625-git [1] http://cdn.kernel.org/pub/tools/crosstool/files/bin/x86_64/ -- Sincerely yours,