From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755068AbcHVWqb (ORCPT ); Mon, 22 Aug 2016 18:46:31 -0400 Received: from mx0b-00003501.pphosted.com ([67.231.152.68]:18794 "EHLO mx0a-000cda01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751640AbcHVWq3 (ORCPT ); Mon, 22 Aug 2016 18:46:29 -0400 Authentication-Results: seagate.com; dkim=pass header.s="google" header.d=seagate.com MIME-Version: 1.0 In-Reply-To: References: From: Andriy Tkachuk Date: Mon, 22 Aug 2016 23:46:26 +0100 Message-ID: Subject: Re: mm: kswapd struggles reclaiming the pages on 64GB server To: linux-kernel@vger.kernel.org Cc: Mel Gorman Content-Type: text/plain; charset=UTF-8 X-Proofpoint-PolicyRoute: Outbound X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2016-08-22_13:,, signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 priorityscore=1501 suspectscore=3 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 impostorscore=0 lowpriorityscore=0 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1604210000 definitions=main-1608220226 X-Proofpoint-Spam-Policy: Default Domain Policy Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Aug 22, 2016 at 7:37 PM, Andriy Tkachuk wrote: > > The following patch resolved the problem: > ... Sorry, I was too hurry in sending good news. As appeared - the problem is still there: 06:10:01 PM pgpgin/s pgpgout/s fault/s majflt/s pgfree/s pgscank/s pgscand/s pgsteal/s %vmeff 06:20:01 PM 78.72 1408298.79 3394.73 0.54 384043.36 204.96 10.88 148.06 68.60 06:30:01 PM 51.71 1408936.46 3101.97 0.31 384366.70 211.51 19.30 148.59 64.38 06:40:01 PM 65.29 1366755.43 3506.76 0.65 372628.96 383.94 27.11 154.71 37.64 06:50:01 PM 66.89 1376770.04 3544.59 0.78 375676.38 344.94 12.48 149.84 41.92 07:00:01 PM 15.87 1364029.92 3030.66 0.31 371903.73 307.63 13.32 132.75 41.36 07:10:01 PM 134.24 1370087.74 3397.32 1.06 373435.85 394.16 21.16 166.62 40.12 07:20:01 PM 95.27 1379686.06 3462.59 0.74 376050.98 356.59 29.23 159.00 41.21 07:30:01 PM 12.66 1392306.35 2929.64 0.25 379662.45 395.07 28.66 138.58 32.71 07:40:01 PM 142.06 1368613.00 3486.30 0.98 372912.77 575.74 40.40 174.45 28.31 07:50:02 PM 119.77 1389287.62 3562.60 1.02 378661.56 584.00 29.36 167.92 27.38 08:00:01 PM 7.31 1368140.71 3082.65 0.50 372828.37 711.84 58.08 130.82 16.99 08:10:01 PM 174.74 1381379.24 3529.78 1.65 376567.77 1023.07 46.96 182.60 17.07 08:20:01 PM 205.05 1353845.50 3546.95 1.88 368828.37 1807.24 57.29 183.46 9.84 08:30:01 PM 86.50 1375125.95 3182.56 1.41 374676.32 10590.52 217.62 148.83 1.38 08:40:01 PM 359.38 1345209.72 4185.55 3.45 366630.54 1419665.54 43307.32 220.06 0.02 08:50:01 PM 277.27 1324076.64 3889.98 2.51 360915.41 1779132.87 410.38 200.49 0.01 09:00:02 PM 96.48 1364092.11 3235.53 1.69 372031.87 1973266.99 167.74 163.43 0.01 09:10:01 PM 298.87 1324763.68 3914.11 2.90 360957.82 1772066.29 435.17 204.08 0.01 And I also see the inactive file list bigger than the active file: # cat /proc/vmstat | egrep 'Cached|nr_.*active_|pgsteal_.*_normal|pgscan_kswapd_normal|pgrefill_normal|nr_vmscan_write|nr_swap|pgact' nr_inactive_anon 695534 nr_active_anon 14427464 nr_inactive_file 2786 nr_active_file 2698 nr_vmscan_write 1740097 pgactivate 115697891 pgrefill_normal 33345818 pgsteal_kswapd_normal 367908859 pgsteal_direct_normal 681266 pgscan_kswapd_normal 10255454426 So it looks like the time to check Minchan's patch really (disable the check that inactive file list is bigger than active file)... Andriy