From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752576AbeDJIzf (ORCPT ); Tue, 10 Apr 2018 04:55:35 -0400 Received: from mx2.suse.de ([195.135.220.15]:46290 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752182AbeDJIze (ORCPT ); Tue, 10 Apr 2018 04:55:34 -0400 Date: Tue, 10 Apr 2018 10:55:31 +0200 From: Jan Kara To: Michal Hocko Cc: Minchan Kim , Andrew Morton , linux-mm , LKML , Johannes Weiner , Jan Kara , Chris Fries Subject: Re: [PATCH] mm: workingset: fix NULL ptr dereference Message-ID: <20180410085531.m2xvzi7nenbrgbve@quack2.suse.cz> References: <20180409015815.235943-1-minchan@kernel.org> <20180410082243.GW21835@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180410082243.GW21835@dhcp22.suse.cz> User-Agent: NeoMutt/20170421 (1.8.2) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 10-04-18 10:22:43, Michal Hocko wrote: > On Mon 09-04-18 10:58:15, Minchan Kim wrote: > > Recently, I got a report like below. > > > > [ 7858.792946] [] __list_del_entry+0x30/0xd0 > > [ 7858.792951] [] list_lru_del+0xac/0x1ac > > [ 7858.792957] [] page_cache_tree_insert+0xd8/0x110 > > [ 7858.792962] [] __add_to_page_cache_locked+0xf8/0x4e0 > > [ 7858.792967] [] add_to_page_cache_lru+0x50/0x1ac > > [ 7858.792972] [] pagecache_get_page+0x468/0x57c > > [ 7858.792979] [] __get_node_page+0x84/0x764 > > [ 7858.792986] [] f2fs_iget+0x264/0xdc8 > > [ 7858.792991] [] f2fs_lookup+0x3b4/0x660 > > [ 7858.792998] [] lookup_slow+0x1e4/0x348 > > [ 7858.793003] [] walk_component+0x21c/0x320 > > [ 7858.793008] [] path_lookupat+0x90/0x1bc > > [ 7858.793013] [] filename_lookup+0x8c/0x1a0 > > [ 7858.793018] [] vfs_fstatat+0x84/0x10c > > [ 7858.793023] [] SyS_newfstatat+0x28/0x64 > > > > v4.9 kenrel already has the d3798ae8c6f3,("mm: filemap: don't > > plant shadow entries without radix tree node") so I thought > > it should be okay. When I was googling, I found others report > > such problem and I think current kernel still has the problem. > > > > https://bugzilla.redhat.com/show_bug.cgi?id=1431567 > > https://bugzilla.redhat.com/show_bug.cgi?id=1420335 > > > > It assumes shadow entry of radix tree relies on the init state > > that node->private_list allocated should be list_empty state. > > Currently, it's initailized in SLAB constructor which means > > node of radix tree would be initialized only when *slub allocates > > new page*, not *new object*. So, if some FS or subsystem pass > > gfp_mask to __GFP_ZERO, slub allocator will do memset blindly. > > That means allocated node can have !list_empty(node->private_list). > > It ends up calling NULL deference at workingset_update_node by > > failing list_empty check. > > > > This patch should fix it. > > > > Fixes: 449dd6984d0e ("mm: keep page cache radix tree nodes in check") > > Reported-by: Chris Fries > > Cc: Johannes Weiner > > Cc: Jan Kara > > Signed-off-by: Minchan Kim > > Regardless of whether it makes sense to use __GFP_ZERO from the upper > layer or not, it is subtle as hell to rely on the pre-existing state > for a newly allocated object. So yes this makes perfect sense. > > Do we want CC: stable? > Acked-by: Michal Hocko Well, for hot allocations we do rely on previous state a lot. After all that's what slab constructor was created for. Whether radix tree node allocation is such a hot path is a question for debate, I agree. Honza -- Jan Kara SUSE Labs, CR