From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752593AbdLKRE5 (ORCPT ); Mon, 11 Dec 2017 12:04:57 -0500 Received: from mx0a-001b2d01.pphosted.com ([148.163.156.1]:59444 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752389AbdLKREx (ORCPT ); Mon, 11 Dec 2017 12:04:53 -0500 Date: Mon, 11 Dec 2017 09:04:49 -0800 From: "Paul E. McKenney" To: "Huang, Ying" Cc: Andrew Morton , Minchan Kim , linux-mm@kvack.org, linux-kernel@vger.kernel.org, Hugh Dickins , Johannes Weiner , Tim Chen , Shaohua Li , Mel Gorman , =?utf-8?B?Su+/vXLvv71tZQ==?= Glisse , Michal Hocko , Andrea Arcangeli , David Rientjes , Rik van Riel , Jan Kara , Dave Jiang , Aaron Lu Subject: Re: [PATCH -mm] mm, swap: Fix race between swapoff and some swap operations Reply-To: paulmck@linux.vnet.ibm.com References: <20171207011426.1633-1-ying.huang@intel.com> <20171207162937.6a179063a7c92ecac77e44af@linux-foundation.org> <20171208014346.GA8915@bbox> <87po7pg4jt.fsf@yhuang-dev.intel.com> <20171208082644.GA14361@bbox> <87k1xxbohp.fsf@yhuang-dev.intel.com> <20171208140909.4e31ba4f1235b638ae68fd5c@linux-foundation.org> <87609dvnl0.fsf@yhuang-dev.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <87609dvnl0.fsf@yhuang-dev.intel.com> User-Agent: Mutt/1.5.21 (2010-09-15) X-TM-AS-GCONF: 00 x-cbid: 17121117-0052-0000-0000-0000028FA086 X-IBM-SpamModules-Scores: X-IBM-SpamModules-Versions: BY=3.00008190; HX=3.00000241; KW=3.00000007; PH=3.00000004; SC=3.00000244; SDB=6.00958734; UDB=6.00484802; IPR=6.00738756; BA=6.00005736; NDR=6.00000001; ZLA=6.00000005; ZF=6.00000009; ZB=6.00000000; ZP=6.00000000; ZH=6.00000000; ZU=6.00000002; MB=3.00018482; XFM=3.00000015; UTC=2017-12-11 17:04:51 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 17121117-0053-0000-0000-000052E11C9B Message-Id: <20171211170449.GS7829@linux.vnet.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2017-12-11_06:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 impostorscore=0 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1709140000 definitions=main-1712110252 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Dec 11, 2017 at 01:30:03PM +0800, Huang, Ying wrote: > Andrew Morton writes: > > > On Fri, 08 Dec 2017 16:41:38 +0800 "Huang\, Ying" wrote: > > > >> > Why do we need srcu here? Is it enough with rcu like below? > >> > > >> > It might have a bug/room to be optimized about performance/naming. > >> > I just wanted to show my intention. > >> > >> Yes. rcu should work too. But if we use rcu, it may need to be called > >> several times to make sure the swap device under us doesn't go away, for > >> example, when checking si->max in __swp_swapcount() and > >> add_swap_count_continuation(). And I found we need rcu to protect swap > >> cache radix tree array too. So I think it may be better to use one > >> calling to srcu_read_lock/unlock() instead of multiple callings to > >> rcu_read_lock/unlock(). > > > > Or use stop_machine() ;) It's very crude but it sure is simple. Does > > anyone have a swapoff-intensive workload? > > Sorry, I don't know how to solve the problem with stop_machine(). > > The problem we try to resolved is that, we have a swap entry, but that > swap entry can become invalid because of swappoff between we check it > and we use it. So we need to prevent swapoff to be run between checking > and using. > > I don't know how to use stop_machine() in swapoff to wait for all users > of swap entry to finish. Anyone can help me on this? You can think of stop_machine() as being sort of like a reader-writer lock. The readers can be any section of code with preemption disabled, and the writer is the function passed to stop_machine(). Users running real-time applications on Linux don't tend to like stop_machine() much, but perhaps it is nevertheless the right tool for this particular job. Thanx, Paul