From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.7 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,MENTIONS_GIT_HOSTING,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 01127C433E0 for ; Thu, 14 May 2020 10:26:29 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id DDB58206A5 for ; Thu, 14 May 2020 10:26:28 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1725999AbgENK02 (ORCPT ); Thu, 14 May 2020 06:26:28 -0400 Received: from cloudserver094114.home.pl ([79.96.170.134]:48080 "EHLO cloudserver094114.home.pl" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725955AbgENK02 (ORCPT ); Thu, 14 May 2020 06:26:28 -0400 Received: from 89-64-84-17.dynamic.chello.pl (89.64.84.17) (HELO kreacher.localnet) by serwer1319399.home.pl (79.96.170.134) with SMTP (IdeaSmtpServer 0.83.415) id ccccac26da084f9f; Thu, 14 May 2020 12:26:26 +0200 From: "Rafael J. Wysocki" To: Francisco Jerez Cc: Peter Zijlstra , "Pandruvada, Srinivas" , linux-pm@vger.kernel.org, intel-gfx@lists.freedesktop.org, chris.p.wilson@intel.com, "Vivi, Rodrigo" , rui.zhang@intel.com, daniel.lezcano@linaro.org, amit.kucheria@verdurent.com Subject: Re: [RFC] GPU-bound energy efficiency improvements for the intel_pstate driver (v2.99) Date: Thu, 14 May 2020 12:26:25 +0200 Message-ID: <10201092.tPqIMJ2V0o@kreacher> In-Reply-To: <874ksmuqx6.fsf@riseup.net> References: <20200428032258.2518-1-currojerez@riseup.net> <20200511105701.GA2940@hirez.programming.kicks-ass.net> <874ksmuqx6.fsf@riseup.net> MIME-Version: 1.0 Content-Transfer-Encoding: 7Bit Content-Type: text/plain; charset="us-ascii" Sender: linux-pm-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-pm@vger.kernel.org On Monday, May 11, 2020 11:01:41 PM CEST Francisco Jerez wrote: > > --==-=-= > Content-Type: multipart/mixed; boundary="=-=-=" > > --=-=-= > Content-Type: text/plain; charset=utf-8 > Content-Disposition: inline > Content-Transfer-Encoding: quoted-printable > > Peter Zijlstra writes: > > > On Mon, Apr 27, 2020 at 08:22:47PM -0700, Francisco Jerez wrote: > >> This addresses the technical concerns people brought up about my > >> previous v2 revision of this series. Other than a few bug fixes, the > >> only major change relative to v2 is that the controller is now exposed > >> as a new CPUFREQ generic governor as requested by Rafael (named > >> "adaptive" in this RFC though other naming suggestions are welcome). > >> Main reason for calling this v2.99 rather than v3 is that I haven't > >> yet addressed all the documentation requests from the v2 thread -- > >> Will spend some time doing that as soon as I have an ACK (ideally from > >> Rafael) that things are moving in the right direction. > >>=20 > >> You can also find this series along with the WIP code for non-HWP > >> platforms in this branch: > >>=20 > >> https://github.com/curro/linux/tree/intel_pstate-vlp-v2.99 > >>=20 > >> Thanks! > >>=20 > >> [PATCHv2.99 01/11] PM: QoS: Add CPU_SCALING_RESPONSE global PM QoS limit. > >> [PATCHv2.99 02/11] drm/i915: Adjust PM QoS scaling response frequency ba= > sed on GPU load. > >> [PATCHv2.99 03/11] OPTIONAL: drm/i915: Expose PM QoS control parameters = > via debugfs. > >> [PATCHv2.99 04/11] cpufreq: Define ADAPTIVE frequency governor policy. > >> [PATCHv2.99 05/11] cpufreq: intel_pstate: Reorder intel_pstate_clear_upd= > ate_util_hook() and intel_pstate_set_update_util_hook(). > >> [PATCHv2.99 06/11] cpufreq: intel_pstate: Call intel_pstate_set_update_u= > til_hook() once from the setpolicy hook. > >> [PATCHv2.99 07/11] cpufreq: intel_pstate: Implement VLP controller stati= > stics and target range calculation. > >> [PATCHv2.99 08/11] cpufreq: intel_pstate: Implement VLP controller for H= > WP parts. > >> [PATCHv2.99 09/11] cpufreq: intel_pstate: Enable VLP controller based on= > ACPI FADT profile and CPUID. > >> [PATCHv2.99 10/11] OPTIONAL: cpufreq: intel_pstate: Add tracing of VLP c= > ontroller status. > >> [PATCHv2.99 11/11] OPTIONAL: cpufreq: intel_pstate: Expose VLP controlle= > r parameters via debugfs. > > > > What I'm missing is an explanation for why this isn't using the > > infrastructure that was build for these kinds of things? The thermal > > framework, was AFAIU, supposed to help with these things, and the IPA > > thing in particular is used by ARM to do exactly this GPU/CPU power > > budget thing. > > > > If thermal/IPA is found wanting, why aren't we improving that? > > The GPU/CPU power budget "thing" is only a positive side effect of this > series on some TDP-bound systems. Its ultimate purpose is improving the > energy efficiency of workloads which have a bottleneck on a device other > than the CPU, by giving the bottlenecking device driver some influence > over the response latency of CPUFREQ governors via a PM QoS interface. > This seems to be completely outside the scope of the thermal framework > and IPA AFAIU. > > > > > How much of that ADAPTIVE crud is actually intel_pstate specific? On a > > (really) quick read it appears to me that much of the controller bits > > there can be applied more generic, and thus should not be part of any > > one governor. > > > > The implementation of that is intel_pstate-specific right now, but the > basic algorithm could be made to work on any other governor in > principle, which is why it is exposed as a generic CPUFREQ governor. I > don't care about taking out the generic CPUFREQ governor changes if you > don't like them, and going back to some driver-specific means of turning > it on and off (though Rafael might disagree with that). > > > Specifically, I want to use sched_util as cpufreq governor and use the > > intel_pstate as a passive driver. > > Yeah, getting a similar optimization into the schedutil governor has > been on my wish list for a while, but I haven't had the time to get very > far on that except for a handful of hacks. The intel_pstate handling is > going to be necessary anyway in order to handle HWP systems gracefully, > at least in the near future until schedutil becomes a viable alternative > to intel_pstate in active mode on HWP systems. FWIW, work is under way to make intel_pstate in the passive mode work on HWP systems. I have a prototype patch for that, but it can be improved. I'll post a new version of it for review, possibly next week. Cheers! From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.7 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,MENTIONS_GIT_HOSTING,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 20889C433E0 for ; Thu, 14 May 2020 10:26:33 +0000 (UTC) Received: from gabe.freedesktop.org (gabe.freedesktop.org [131.252.210.177]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id F3462206A5 for ; Thu, 14 May 2020 10:26:32 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org F3462206A5 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=rjwysocki.net Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=intel-gfx-bounces@lists.freedesktop.org Received: from gabe.freedesktop.org (localhost [127.0.0.1]) by gabe.freedesktop.org (Postfix) with ESMTP id 7CE2B6E30C; Thu, 14 May 2020 10:26:32 +0000 (UTC) Received: from cloudserver094114.home.pl (cloudserver094114.home.pl [79.96.170.134]) by gabe.freedesktop.org (Postfix) with ESMTPS id 64DC16E30C for ; Thu, 14 May 2020 10:26:30 +0000 (UTC) Received: from 89-64-84-17.dynamic.chello.pl (89.64.84.17) (HELO kreacher.localnet) by serwer1319399.home.pl (79.96.170.134) with SMTP (IdeaSmtpServer 0.83.415) id ccccac26da084f9f; Thu, 14 May 2020 12:26:26 +0200 From: "Rafael J. Wysocki" To: Francisco Jerez Date: Thu, 14 May 2020 12:26:25 +0200 Message-ID: <10201092.tPqIMJ2V0o@kreacher> In-Reply-To: <874ksmuqx6.fsf@riseup.net> References: <20200428032258.2518-1-currojerez@riseup.net> <20200511105701.GA2940@hirez.programming.kicks-ass.net> <874ksmuqx6.fsf@riseup.net> MIME-Version: 1.0 Subject: Re: [Intel-gfx] [RFC] GPU-bound energy efficiency improvements for the intel_pstate driver (v2.99) X-BeenThere: intel-gfx@lists.freedesktop.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Intel graphics driver community testing & development List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: amit.kucheria@verdurent.com, linux-pm@vger.kernel.org, Peter Zijlstra , intel-gfx@lists.freedesktop.org, daniel.lezcano@linaro.org, chris.p.wilson@intel.com, "Pandruvada, Srinivas" , rui.zhang@intel.com Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: intel-gfx-bounces@lists.freedesktop.org Sender: "Intel-gfx" On Monday, May 11, 2020 11:01:41 PM CEST Francisco Jerez wrote: > > --==-=-= > Content-Type: multipart/mixed; boundary="=-=-=" > > --=-=-= > Content-Type: text/plain; charset=utf-8 > Content-Disposition: inline > Content-Transfer-Encoding: quoted-printable > > Peter Zijlstra writes: > > > On Mon, Apr 27, 2020 at 08:22:47PM -0700, Francisco Jerez wrote: > >> This addresses the technical concerns people brought up about my > >> previous v2 revision of this series. Other than a few bug fixes, the > >> only major change relative to v2 is that the controller is now exposed > >> as a new CPUFREQ generic governor as requested by Rafael (named > >> "adaptive" in this RFC though other naming suggestions are welcome). > >> Main reason for calling this v2.99 rather than v3 is that I haven't > >> yet addressed all the documentation requests from the v2 thread -- > >> Will spend some time doing that as soon as I have an ACK (ideally from > >> Rafael) that things are moving in the right direction. > >>=20 > >> You can also find this series along with the WIP code for non-HWP > >> platforms in this branch: > >>=20 > >> https://github.com/curro/linux/tree/intel_pstate-vlp-v2.99 > >>=20 > >> Thanks! > >>=20 > >> [PATCHv2.99 01/11] PM: QoS: Add CPU_SCALING_RESPONSE global PM QoS limit. > >> [PATCHv2.99 02/11] drm/i915: Adjust PM QoS scaling response frequency ba= > sed on GPU load. > >> [PATCHv2.99 03/11] OPTIONAL: drm/i915: Expose PM QoS control parameters = > via debugfs. > >> [PATCHv2.99 04/11] cpufreq: Define ADAPTIVE frequency governor policy. > >> [PATCHv2.99 05/11] cpufreq: intel_pstate: Reorder intel_pstate_clear_upd= > ate_util_hook() and intel_pstate_set_update_util_hook(). > >> [PATCHv2.99 06/11] cpufreq: intel_pstate: Call intel_pstate_set_update_u= > til_hook() once from the setpolicy hook. > >> [PATCHv2.99 07/11] cpufreq: intel_pstate: Implement VLP controller stati= > stics and target range calculation. > >> [PATCHv2.99 08/11] cpufreq: intel_pstate: Implement VLP controller for H= > WP parts. > >> [PATCHv2.99 09/11] cpufreq: intel_pstate: Enable VLP controller based on= > ACPI FADT profile and CPUID. > >> [PATCHv2.99 10/11] OPTIONAL: cpufreq: intel_pstate: Add tracing of VLP c= > ontroller status. > >> [PATCHv2.99 11/11] OPTIONAL: cpufreq: intel_pstate: Expose VLP controlle= > r parameters via debugfs. > > > > What I'm missing is an explanation for why this isn't using the > > infrastructure that was build for these kinds of things? The thermal > > framework, was AFAIU, supposed to help with these things, and the IPA > > thing in particular is used by ARM to do exactly this GPU/CPU power > > budget thing. > > > > If thermal/IPA is found wanting, why aren't we improving that? > > The GPU/CPU power budget "thing" is only a positive side effect of this > series on some TDP-bound systems. Its ultimate purpose is improving the > energy efficiency of workloads which have a bottleneck on a device other > than the CPU, by giving the bottlenecking device driver some influence > over the response latency of CPUFREQ governors via a PM QoS interface. > This seems to be completely outside the scope of the thermal framework > and IPA AFAIU. > > > > > How much of that ADAPTIVE crud is actually intel_pstate specific? On a > > (really) quick read it appears to me that much of the controller bits > > there can be applied more generic, and thus should not be part of any > > one governor. > > > > The implementation of that is intel_pstate-specific right now, but the > basic algorithm could be made to work on any other governor in > principle, which is why it is exposed as a generic CPUFREQ governor. I > don't care about taking out the generic CPUFREQ governor changes if you > don't like them, and going back to some driver-specific means of turning > it on and off (though Rafael might disagree with that). > > > Specifically, I want to use sched_util as cpufreq governor and use the > > intel_pstate as a passive driver. > > Yeah, getting a similar optimization into the schedutil governor has > been on my wish list for a while, but I haven't had the time to get very > far on that except for a handful of hacks. The intel_pstate handling is > going to be necessary anyway in order to handle HWP systems gracefully, > at least in the near future until schedutil becomes a viable alternative > to intel_pstate in active mode on HWP systems. FWIW, work is under way to make intel_pstate in the passive mode work on HWP systems. I have a prototype patch for that, but it can be improved. I'll post a new version of it for review, possibly next week. Cheers! _______________________________________________ Intel-gfx mailing list Intel-gfx@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/intel-gfx