From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753235AbdCMNnI (ORCPT ); Mon, 13 Mar 2017 09:43:08 -0400 Received: from mx1.redhat.com ([209.132.183.28]:40084 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753018AbdCMNmt (ORCPT ); Mon, 13 Mar 2017 09:42:49 -0400 From: Vitaly Kuznetsov To: Michal Hocko Cc: Igor Mammedov , Heiko Carstens , linux-mm@kvack.org, Andrew Morton , Greg KH , "K. Y. Srinivasan" , David Rientjes , Daniel Kiper , linux-api@vger.kernel.org, LKML , linux-s390@vger.kernel.org, xen-devel@lists.xenproject.org, linux-acpi@vger.kernel.org, qiuxishi@huawei.com, toshi.kani@hpe.com, xieyisheng1@huawei.com, slaoub@gmail.com, iamjoonsoo.kim@lge.com, vbabka@suse.cz Subject: Re: [RFC PATCH] mm, hotplug: get rid of auto_online_blocks References: <20170302142816.GK1404@dhcp22.suse.cz> <20170302180315.78975d4b@nial.brq.redhat.com> <20170303082723.GB31499@dhcp22.suse.cz> <20170303183422.6358ee8f@nial.brq.redhat.com> <20170306145417.GG27953@dhcp22.suse.cz> <20170307134004.58343e14@nial.brq.redhat.com> <20170309125400.GI11592@dhcp22.suse.cz> <20170313115554.41d16b1f@nial.brq.redhat.com> <20170313122825.GO31518@dhcp22.suse.cz> <87a88pgwv0.fsf@vitty.brq.redhat.com> <20170313131924.GP31518@dhcp22.suse.cz> Date: Mon, 13 Mar 2017 14:42:37 +0100 In-Reply-To: <20170313131924.GP31518@dhcp22.suse.cz> (Michal Hocko's message of "Mon, 13 Mar 2017 14:19:25 +0100") Message-ID: <87pohlfg36.fsf@vitty.brq.redhat.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.31]); Mon, 13 Mar 2017 13:42:44 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Michal Hocko writes: > On Mon 13-03-17 13:54:59, Vitaly Kuznetsov wrote: >> Michal Hocko writes: >> >> > On Mon 13-03-17 11:55:54, Igor Mammedov wrote: >> >> > > >> >> > > - suggested RFC is not acceptable from virt point of view >> >> > > as it regresses guests on top of x86 kvm/vmware which >> >> > > both use ACPI based memory hotplug. >> >> > > >> >> > > - udev/userspace solution doesn't work in practice as it's >> >> > > too slow and unreliable when system is under load which >> >> > > is quite common in virt usecase. That's why auto online >> >> > > has been introduced in the first place. >> >> > >> >> > Please try to be more specific why "too slow" is a problem. Also how >> >> > much slower are we talking about? >> >> >> >> In virt case on host with lots VMs, userspace handler >> >> processing could be scheduled late enough to trigger a race >> >> between (guest memory going away/OOM handler) and memory >> >> coming online. >> > >> > Either you are mixing two things together or this doesn't really make >> > much sense. So is this a balloning based on memory hotplug (aka active >> > memory hotadd initiated between guest and host automatically) or a guest >> > asking for additional memory by other means (pay more for memory etc.)? >> > Because if this is an administrative operation then I seriously question >> > this reasoning. >> >> I'm probably repeating myself but it seems this point was lost: >> >> This is not really a 'ballooning', it is just a pure memory >> hotplug. People may have any tools monitoring their VM memory usage and >> when a VM is running low on memory they may want to hotplug more memory >> to it. > > What is the API those guests ask for the memory? And who is actually > responsible to ask for that memory? Is it a kernel or userspace > solution? Whatever, this can even be a system administrator running 'free'. Hyper-V driver sends si_mem_available() and vm_memory_committed() metrics to the host every second and this can be later queried by any tool (e.g. powershell script). > >> With udev-style memory onlining they should be aware of page >> tables and other in-kernel structures which require allocation so they >> need to add memory slowly and gradually or they risk running into OOM >> (at least getting some processes killed and these processes may be >> important). With in-kernel memory hotplug everything happens >> synchronously and no 'slowly and gradually' algorithm is required in >> all tools which may trigger memory hotplug. > > What prevents those APIs being used reasonably and only asks so much > memory as they can afford? I mean 1.5% available memory necessary for > the hotplug is not all that much. Or more precisely what prevents to ask > for this additional memory in a synchronous way? The knowledge about the fact that we need to add memory slowly and wait till it gets onlined is not obvious. AFAIR when you hotplug memory to Windows VMs there is no such thing as 'onlining', and no brain is required, a simple script 'low memory -> add mory memory' always works. Asking all these script writers to think twice before issuing a memory add command memory sounds like too much (to me). -- Vitaly