From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-wm0-f71.google.com (mail-wm0-f71.google.com [74.125.82.71]) by kanga.kvack.org (Postfix) with ESMTP id EF9936B4C24 for ; Wed, 29 Aug 2018 10:54:45 -0400 (EDT) Received: by mail-wm0-f71.google.com with SMTP id s18-v6so3063432wmc.5 for ; Wed, 29 Aug 2018 07:54:45 -0700 (PDT) Received: from mail-sor-f41.google.com (mail-sor-f41.google.com. [209.85.220.41]) by mx.google.com with SMTPS id m2-v6sor2985429wrj.18.2018.08.29.07.54.44 for (Google Transport Security); Wed, 29 Aug 2018 07:54:44 -0700 (PDT) MIME-Version: 1.0 References: <20180806120042.GL19540@dhcp22.suse.cz> <010001650fe29e66-359ffa28-9290-4e83-a7e2-b6d1d8d2ee1d-000000@email.amazonses.com> <20180806181638.GE10003@dhcp22.suse.cz> <20180821064911.GW29735@dhcp22.suse.cz> <11b4f8cd-6253-262f-4ae6-a14062c58039@suse.cz> <6ef03395-6baa-a6e5-0d5a-63d4721e6ec0@suse.cz> <20180823122111.GG29735@dhcp22.suse.cz> <76c6e92b-df49-d4b5-27f7-5f2013713727@suse.cz> <8b211f35-0722-cd94-1360-a2dd9fba351e@suse.cz> In-Reply-To: <8b211f35-0722-cd94-1360-a2dd9fba351e@suse.cz> From: Marinko Catovic Date: Wed, 29 Aug 2018 16:54:32 +0200 Message-ID: Subject: Re: Caching/buffers become useless after some time Content-Type: multipart/alternative; boundary="0000000000009238ef0574942314" Sender: owner-linux-mm@kvack.org List-ID: To: Vlastimil Babka Cc: Michal Hocko , Christopher Lameter , linux-mm@kvack.org --0000000000009238ef0574942314 Content-Type: text/plain; charset="UTF-8" > > shall I switch it to defer and observe (all hosts are running fine by > > just now) or > > switch to defer while it is in the bad state? > > You could do it immediately and see if no problems appear for long > enough, OTOH... > well cat /sys/kernel/mm/transparent_hugepage/defrag always [defer] defer+madvise madvise never was active now since your reply, however, I can not tell that it helped. This was set on 2 hosts, one has 20GB of unused RAM now. Yesterday there was a similar picture for both, with several GB, one with up to 10GB unused, I just checked once, this is what I recall. tell me if one would like to login remotely, I can set up teamviewer or something for this at any time, just drop a message here and I'll contact you. I have hopes that one can investigate things even on that host that has 20GB unused, it's just a matter of time until this gets to the low values, surely the problem here already kicked in. Also if the remote login is not an option, I'm always happy to provide whatever info you need. --0000000000009238ef0574942314 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

> shall I switch it to defer and observe (all hosts are running fine by<= br> > just now) or
> switch to defer while it is in the bad state?

You could do it immediately and see if no problems appear for long
enough, OTOH...

well cat /sys/kernel/mm= /transparent_hugepage/defrag
always [defer] defer+madvise madvise never<= /div>
was active now since your reply, however, I can not tell that it = helped.

This was set on 2 hosts, one has 20GB of u= nused RAM now.
Yesterday there was a similar picture for both, wi= th several GB, one with up to 10GB unused,
I just checked once, t= his is what I recall.

tell me if one would like to= login remotely, I can set up teamviewer or something for this
at= any time, just drop a message here and I'll contact you.
I h= ave hopes that one can investigate things even on that host that has 20GB u= nused, it's just
a matter of time until this gets to the low = values, surely the problem here already kicked in.

Also if the remote login is not an option, I'm always happy to provide= whatever info you need.



=
--0000000000009238ef0574942314--