From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.6 required=3.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_PASS, URIBL_BLOCKED,USER_AGENT_MUTT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 232B4C43142 for ; Thu, 2 Aug 2018 23:48:25 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id A5284215EA for ; Thu, 2 Aug 2018 23:48:24 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (1024-bit key) header.d=chromium.org header.i=@chromium.org header.b="AO6zdM9r" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org A5284215EA Authentication-Results: mail.kernel.org; dmarc=fail (p=reject dis=none) header.from=chromium.org Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732119AbeHCBlu (ORCPT ); Thu, 2 Aug 2018 21:41:50 -0400 Received: from mail-pg1-f195.google.com ([209.85.215.195]:41540 "EHLO mail-pg1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727047AbeHCBlu (ORCPT ); Thu, 2 Aug 2018 21:41:50 -0400 Received: by mail-pg1-f195.google.com with SMTP id z8-v6so1941896pgu.8 for ; Thu, 02 Aug 2018 16:48:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=chromium.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:content-transfer-encoding:in-reply-to :user-agent; bh=l77VMm7S3rwhgpv/zdaP5bTvATovWuGBx2zyazHA8Y8=; b=AO6zdM9rHhe/myfv7VBUzmbmir+wIeqoEeDlFVERUJUPkiFuexWHyCrXTtq1JQpSCn ExOEje/NJa1IkRi93hS2Mi68aSoKzJk8JjQV2LEDxODEEwPi78k1TGex8ea1LuWYF4cC /cp5STs1uqVLnDory05R8GGJDjc3KpoEaLO1M= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:content-transfer-encoding :in-reply-to:user-agent; bh=l77VMm7S3rwhgpv/zdaP5bTvATovWuGBx2zyazHA8Y8=; b=N6pAm1hB2jk3gvBAYLBZy3vvnxBYsuazpjOCgsL+SLalCIvJTK+4INfNY7sOrC7dVh PoOmFqA5NcIWQM1UZt+gQDOcquX4qpDAaxuXnvl+4TAlqQJYL38Fn4X0d+V7UnpDrxyV PxbtkAd/EP1t2+mqYKH12xFtH9u+3Wp+/ZQGOvb0qssFXj/olSo2JR6fymqH3ywYXtif FJjFUduOgtELXawdio3C1eDFwZ2Ex/c4U6IwmZStgt3P8FA/DBicwWpsN+/m/oGlomYa 494QryN9eajp63IBArBX9abIkx6SheKyaaREPHFpoymBVJ89uvDn4KG8dUV+BkecFZEh OxJQ== X-Gm-Message-State: AOUpUlFXinMgwNCGVjraQ8FZok9zkuH3cKgeg6blFctZwW95WLCT1Vr2 RRhWGIZGzqYOVORwyLC5PZZfkw== X-Google-Smtp-Source: AAOMgpeeKJPVO7vbxfx9KGxzWIF5eTDc5kQiRSU3S0suDeL6sCO4fcURnOWRlr2lAxGhkEB2oWMcfg== X-Received: by 2002:a62:3703:: with SMTP id e3-v6mr1653530pfa.117.1533253701334; Thu, 02 Aug 2018 16:48:21 -0700 (PDT) Received: from localhost ([2620:15c:202:1:b6af:f85:ed6c:ac6a]) by smtp.gmail.com with ESMTPSA id l127-v6sm4705509pfc.55.2018.08.02.16.48.20 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Thu, 02 Aug 2018 16:48:20 -0700 (PDT) Date: Thu, 2 Aug 2018 16:48:20 -0700 From: Matthias Kaehlcke To: Chanwoo Choi Cc: MyungJoo Ham , Kyungmin Park , Arnd Bergmann , Greg Kroah-Hartman , Rob Herring , Mark Rutland , linux-pm@vger.kernel.org, devicetree@vger.kernel.org, linux-kernel@vger.kernel.org, Brian Norris , Douglas Anderson , Enric Balletbo i Serra , "Rafael J . Wysocki" , Viresh Kumar , Lee Jones , Benson Leung , Olof Johansson Subject: Re: [PATCH v5 05/12] PM / devfreq: Add support for policy notifiers Message-ID: <20180802234820.GU68975@google.com> References: <20180703234705.227473-6-mka@chromium.org> <5B3C6C2A.1070205@samsung.com> <20180706175301.GG129942@google.com> <5399c191-e140-e2b8-629b-72ddfbf99b0f@samsung.com> <20180716175050.GZ129942@google.com> <20180731193953.GH68975@google.com> <5B610B48.4030802@samsung.com> <20180801170824.GJ68975@google.com> <5B626563.1090302@samsung.com> <20180802231343.GS68975@google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20180802231343.GS68975@google.com> User-Agent: Mutt/1.9.2 (2017-12-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Aug 02, 2018 at 04:13:43PM -0700, Matthias Kaehlcke wrote: > Hi Chanwoo, > > On Thu, Aug 02, 2018 at 10:58:59AM +0900, Chanwoo Choi wrote: > > Hi Matthias, > > > > On 2018년 08월 02일 02:08, Matthias Kaehlcke wrote: > > > Hi Chanwoo, > > > > > > On Wed, Aug 01, 2018 at 10:22:16AM +0900, Chanwoo Choi wrote: > > >> On 2018년 08월 01일 04:39, Matthias Kaehlcke wrote: > > >>> On Mon, Jul 16, 2018 at 10:50:50AM -0700, Matthias Kaehlcke wrote: > > >>>> On Thu, Jul 12, 2018 at 05:44:33PM +0900, Chanwoo Choi wrote: > > >>>>> Hi Matthias, > > >>>>> > > >>>>> On 2018년 07월 07일 02:53, Matthias Kaehlcke wrote: > > >>>>>> Hi Chanwoo, > > >>>>>> > > >>>>>> On Wed, Jul 04, 2018 at 03:41:46PM +0900, Chanwoo Choi wrote: > > >>>>>> > > >>>>>>> Firstly, > > >>>>>>> I'm not sure why devfreq needs the devfreq_verify_within_limits() function. > > >>>>>>> > > >>>>>>> devfreq already used the OPP interface as default. It means that > > >>>>>>> the outside of 'drivers/devfreq' can disable/enable the frequency > > >>>>>>> such as drivers/thermal/devfreq_cooling.c. Also, when some device > > >>>>>>> drivers disable/enable the specific frequency, the devfreq core > > >>>>>>> consider them. > > >>>>>>> > > >>>>>>> So, devfreq doesn't need to devfreq_verify_within_limits() because > > >>>>>>> already support some interface to change the minimum/maximum frequency > > >>>>>>> of devfreq device. > > >>>>>>> > > >>>>>>> In case of cpufreq subsystem, cpufreq only provides 'cpufreq_verify_with_limits()' > > >>>>>>> to change the minimum/maximum frequency of cpu. some device driver cannot > > >>>>>>> change the minimum/maximum frequency through OPP interface. > > >>>>>>> > > >>>>>>> But, in case of devfreq subsystem, as I explained already, devfreq support > > >>>>>>> the OPP interface as default way. devfreq subsystem doesn't need to add > > >>>>>>> other way to change the minimum/maximum frequency. > > >>>>>> > > >>>>>> Using the OPP interface exclusively works as long as a > > >>>>>> enabling/disabling of OPPs is limited to a single driver > > >>>>>> (drivers/thermal/devfreq_cooling.c). When multiple drivers are > > >>>>>> involved you need a way to resolve conflicts, that's the purpose of > > >>>>>> devfreq_verify_within_limits(). Please let me know if there are > > >>>>>> existing mechanisms for conflict resolution that I overlooked. > > >>>>>> > > >>>>>> Possibly drivers/thermal/devfreq_cooling.c could be migrated to use > > >>>>>> devfreq_verify_within_limits() instead of the OPP interface if > > >>>>>> desired, however this seems beyond the scope of this series. > > >>>>> > > >>>>> Actually, if we uses this approach, it doesn't support the multiple drivers too. > > >>>>> If non throttler drivers uses devfreq_verify_within_limits(), the conflict > > >>>>> happen. > > >>>> > > >>>> As long as drivers limit the max freq there is no conflict, the lowest > > >>>> max freq wins. I expect this to be the usual case, apparently it > > >>>> worked for cpufreq for 10+ years. > > >>>> > > >>>> However it is correct that there would be a conflict if a driver > > >>>> requests a min freq that is higher than the max freq requested by > > >>>> another. In this case devfreq_verify_within_limits() resolves the > > >>>> conflict by raising p->max to the min freq. Not sure if this is > > >>>> something that would ever occur in practice though. > > >>>> > > >>>> If we are really concerned about this case it would also be an option > > >>>> to limit the adjustment to the max frequency. > > >>>> > > >>>>> To resolve the conflict for multiple device driver, maybe OPP interface > > >>>>> have to support 'usage_count' such as clk_enable/disable(). > > >>>> > > >>>> This would require supporting negative usage count values, since a OPP > > >>>> should not be enabled if e.g. thermal enables it but the throttler > > >>>> disabled it or viceversa. > > >>>> > > >>>> Theoretically there could also be conflicts, like one driver disabling > > >>>> the higher OPPs and another the lower ones, with the outcome of all > > >>>> OPPs being disabled, which would be a more drastic conflict resolution > > >>>> than that of devfreq_verify_within_limits(). > > >>>> > > >>>> Viresh, what do you think about an OPP usage count? > > >>> > > >>> Ping, can we try to reach a conclusion on this or at least keep the > > >>> discussion going? > > >>> > > >>> Not that it matters, but my preferred solution continues to be > > >>> devfreq_verify_within_limits(). It solves conflicts in some way (which > > >>> could be adjusted if needed) and has proven to work in practice for > > >>> 10+ years in a very similar sub-system. > > >> > > >> It is not true. Current cpufreq subsystem doesn't support external OPP > > >> control to enable/disable the OPP entry. If some device driver > > >> controls the OPP entry of cpufreq driver with opp_disable/enable(), > > >> the operation is not working. Because cpufreq considers the limit > > >> through 'cpufreq_verify_with_limits()' only. > > > > > > Ok, we can probably agree that using cpufreq_verify_with_limits() > > > exclusively seems to have worked well for cpufreq, and that in their > > > overall purpose cpufreq and devfreq are similar subsystems. > > > > > > The current throttler series with devfreq_verify_within_limits() takes > > > the enabled OPPs into account, the lowest and highest OPP are used as > > > a starting point for the frequency adjustment and (in theory) the > > > frequency range should only be narrowed by > > > devfreq_verify_within_limits(). > > > > > >> As I already commented[1], there is different between cpufreq and devfreq. > > >> [1] https://lkml.org/lkml/2018/7/4/80 > > >> > > >> Already, subsystem already used OPP interface in order to control > > >> specific OPP entry. I don't want to provide two outside method > > >> to control the frequency of devfreq driver. It might make the confusion. > > > > > > I understand your point, it would indeed be preferable to have a > > > single method. However I'm not convinced that the OPP interface is > > > a suitable solution, as I exposed earlier in this thread (quoted > > > below). > > > > > > I would like you to at least consider the possibility of changing > > > drivers/thermal/devfreq_cooling.c to devfreq_verify_within_limits(). > > > Besides that it's not what is currently used, do you see any technical > > > concerns that would make devfreq_verify_within_limits() an unsuitable > > > or inferior solution? > > > > As we already discussed, devfreq_verify_within_limits() doesn't support > > the multiple outside controllers (e.g., devfreq-cooling.c). > > That's incorrect, its purpose is precisely that. > > Are you suggesting that cpufreq with its use of > cpufreq_verify_within_limits() (the inspiration for > devfreq_verify_within_limits()) is broken? It is used by cpu_cooling.c > and other drivers when receiving a CPUFREQ_ADJUST event, essentially > what I am proposing with DEVFREQ_ADJUST. > > Could you elaborate why this model wouldn't work for devfreq? "OPP > interface is mandatory for devfreq" isn't really a technical argument, > is it mandatory for any other reason than that it is the interface > that is currently used? > > > After you are suggesting the throttler core, there are at least two > > outside controllers (e.g., devfreq-cooling.c and throttler driver). > > As I knew the problem about conflict, I cannot agree the temporary > > method. OPP interface is mandatory for devfreq in order to control > > the OPP (frequency/voltage). In this situation, we have to try to > > find the method through OPP interface. > > What do you mean with "temporary method"? > > We can try to find a method through the OPP interface, but at this > point I'm not convinced that it is technically necessary or even > preferable. > > Another inconvenient of the OPP approach for both devfreq-cooling.c > and the throttler is that they have to bother with disabling all OPPs > above/below the max/min (they don't/shouldn't have to care), instead > of just telling devfreq the max/min. And a more important one: both drivers now have to keep track which OPPs they enabled/disabled previously, done are the days of a simple dev_pm_opp_enable/disable() in devfreq_cooling. Certainly it is possible and not very complex to implement, but is it really the best/a good solution?