All of lore.kernel.org
 help / color / mirror / Atom feed
From: Robert Bragg <robert@sixbynine.org>
To: Chris Wilson <chris@chris-wilson.co.uk>,
	Robert Bragg <robert@sixbynine.org>,
	Intel Graphics Development <intel-gfx@lists.freedesktop.org>,
	Daniel Vetter <daniel.vetter@intel.com>,
	Jani Nikula <jani.nikula@linux.intel.com>,
	David Airlie <airlied@linux.ie>,
	Zhenyu Wang <zhenyuw@linux.intel.com>,
	Sourab Gupta <sourab.gupta@intel.com>,
	Matthew Auld <matthew.william.auld@gmail.com>,
	ML dri-devel <dri-devel@lists.freedesktop.org>
Subject: Re: [PATCH v8 11/12] drm/i915: Add more Haswell OA metric sets
Date: Tue, 1 Nov 2016 16:53:29 +0000	[thread overview]
Message-ID: <CAMou1-28zxdxtMZ_i+NYn8ApLyQCD9bgGRhB3OtwcVJu1aBCnQ@mail.gmail.com> (raw)
In-Reply-To: <20161101145705.GD26576@nuc-i3427.alporthouse.com>


[-- Attachment #1.1: Type: text/plain, Size: 7343 bytes --]

On Tue, Nov 1, 2016 at 2:57 PM, Chris Wilson <chris@chris-wilson.co.uk>
wrote:

> On Fri, Oct 28, 2016 at 03:14:29AM +0100, Robert Bragg wrote:
> > This adds 'compute', 'compute extended', 'memory reads', 'memory writes'
> > and 'sampler balance' metric sets for Haswell.
> >
> > The code is auto generated from an XML description of metric sets,
> > currently maintained in gputop, ref:
> >
> >  https://github.com/rib/gputop
> >  > gputop-data/oa-*.xml
> >  > scripts/i915-perf-kernelgen.py
> >
> >  $ make -C gputop-data -f Makefile.xml
> >
> > Signed-off-by: Robert Bragg <robert@sixbynine.org>
> > Reviewed-by: Matthew Auld <matthew.auld@intel.com>
> > ---
> >  drivers/gpu/drm/i915/i915_oa_hsw.c | 559 ++++++++++++++++++++++++++++++
> ++++++-
> >  1 file changed, 558 insertions(+), 1 deletion(-)
> >
> > diff --git a/drivers/gpu/drm/i915/i915_oa_hsw.c
> b/drivers/gpu/drm/i915/i915_oa_hsw.c
> > index 6af25cf..4ddf756 100644
> > --- a/drivers/gpu/drm/i915/i915_oa_hsw.c
> > +++ b/drivers/gpu/drm/i915/i915_oa_hsw.c
> > @@ -31,9 +31,14 @@
> >
> >  enum metric_set_id {
> >       METRIC_SET_ID_RENDER_BASIC = 1,
> > +     METRIC_SET_ID_COMPUTE_BASIC,
> > +     METRIC_SET_ID_COMPUTE_EXTENDED,
> > +     METRIC_SET_ID_MEMORY_READS,
> > +     METRIC_SET_ID_MEMORY_WRITES,
> > +     METRIC_SET_ID_SAMPLER_BALANCE,
> >  };
> >
> > -int i915_oa_n_builtin_metric_sets_hsw = 1;
> > +int i915_oa_n_builtin_metric_sets_hsw = 6;
> >
> >  static const struct i915_oa_reg b_counter_config_render_basic[] = {
> >       { _MMIO(0x2724), 0x00800000 },
> > @@ -112,6 +117,298 @@ get_render_basic_mux_config(struct
> drm_i915_private *dev_priv,
> >       return mux_config_render_basic;
> >  }
> >
> > +static const struct i915_oa_reg b_counter_config_compute_basic[] = {
> > +     { _MMIO(0x2710), 0x00000000 },
> > +     { _MMIO(0x2714), 0x00800000 },
> > +     { _MMIO(0x2718), 0xaaaaaaaa },
> > +     { _MMIO(0x271c), 0xaaaaaaaa },
> > +     { _MMIO(0x2720), 0x00000000 },
> > +     { _MMIO(0x2724), 0x00800000 },
> > +     { _MMIO(0x2728), 0xaaaaaaaa },
> > +     { _MMIO(0x272c), 0xaaaaaaaa },
> > +     { _MMIO(0x2740), 0x00000000 },
> > +     { _MMIO(0x2744), 0x00000000 },
> > +     { _MMIO(0x2748), 0x00000000 },
> > +     { _MMIO(0x274c), 0x00000000 },
> > +     { _MMIO(0x2750), 0x00000000 },
> > +     { _MMIO(0x2754), 0x00000000 },
> > +     { _MMIO(0x2758), 0x00000000 },
> > +     { _MMIO(0x275c), 0x00000000 },
> > +     { _MMIO(0x236c), 0x00000000 },
> > +};
> > +
> > +static const struct i915_oa_reg mux_config_compute_basic[] = {
> > +     { _MMIO(0x253a4), 0x00000000 },
> > +     { _MMIO(0x2681c), 0x01f00800 },
> > +     { _MMIO(0x26820), 0x00001000 },
> > +     { _MMIO(0x2781c), 0x01f00800 },
> > +     { _MMIO(0x26520), 0x00000007 },
> > +     { _MMIO(0x265a0), 0x00000007 },
> > +     { _MMIO(0x25380), 0x00000010 },
> > +     { _MMIO(0x2538c), 0x00300000 },
> > +     { _MMIO(0x25384), 0xaa8aaaaa },
> > +     { _MMIO(0x25404), 0xffffffff },
> > +     { _MMIO(0x26800), 0x00004202 },
> > +     { _MMIO(0x26808), 0x00605817 },
> > +     { _MMIO(0x2680c), 0x10001005 },
> > +     { _MMIO(0x26804), 0x00000000 },
> > +     { _MMIO(0x27800), 0x00000102 },
> > +     { _MMIO(0x27808), 0x0c0701e0 },
> > +     { _MMIO(0x2780c), 0x000200a0 },
> > +     { _MMIO(0x27804), 0x00000000 },
> > +     { _MMIO(0x26484), 0x44000000 },
> > +     { _MMIO(0x26704), 0x44000000 },
> > +     { _MMIO(0x26500), 0x00000006 },
> > +     { _MMIO(0x26510), 0x00000001 },
> > +     { _MMIO(0x26504), 0x88000000 },
> > +     { _MMIO(0x26580), 0x00000006 },
> > +     { _MMIO(0x26590), 0x00000020 },
> > +     { _MMIO(0x26584), 0x00000000 },
> > +     { _MMIO(0x26104), 0x55822222 },
> > +     { _MMIO(0x26184), 0xaa866666 },
> > +     { _MMIO(0x25420), 0x08320c83 },
> > +     { _MMIO(0x25424), 0x06820c83 },
> > +     { _MMIO(0x2541c), 0x00000000 },
> > +     { _MMIO(0x25428), 0x00000c03 },
> > +};
> > +
> > +static const struct i915_oa_reg *
> > +get_compute_basic_mux_config(struct drm_i915_private *dev_priv,
> > +                          int *len)
> > +{
> > +     *len = ARRAY_SIZE(mux_config_compute_basic);
> > +     return mux_config_compute_basic;
> > +}
>
> > @@ -140,6 +437,106 @@ int i915_oa_select_metric_set_hsw(struct
> drm_i915_private *dev_priv)
> >                       ARRAY_SIZE(b_counter_config_render_basic);
> >
> >               return 0;
> > +     case METRIC_SET_ID_COMPUTE_BASIC:
> > +             dev_priv->perf.oa.mux_regs =
> > +                     get_compute_basic_mux_config(dev_priv,
> > +
> &dev_priv->perf.oa.mux_regs_len);
> > +             if (!dev_priv->perf.oa.mux_regs) {
> > +                     DRM_DEBUG_DRIVER("No suitable MUX config for
> \"COMPUTE_BASIC\" metric set");
> > +
> > +                     /* EINVAL because *_register_sysfs already checked
> this
> > +                      * and so it wouldn't have been advertised so
> userspace and
> > +                      * so shouldn't have been requested
> > +                      */
> > +                     return -EINVAL;
> > +             }
> > +
> > +             dev_priv->perf.oa.b_counter_regs =
> > +                     b_counter_config_compute_basic;
> > +             dev_priv->perf.oa.b_counter_regs_len =
> > +                     ARRAY_SIZE(b_counter_config_compute_basic);
> > +
> > +             return 0;
>
> >  int
> >  i915_perf_register_sysfs_hsw(struct drm_i915_private *dev_priv)
> >  {
> > @@ -178,9 +685,49 @@ i915_perf_register_sysfs_hsw(struct
> drm_i915_private *dev_priv)
> >               if (ret)
> >                       goto error_render_basic;
> >       }
> > +     if (get_compute_basic_mux_config(dev_priv, &mux_len)) {
>
> Why not use the derived state in dev_priv->perf.oa.mux_regs? Then we
> only expose what is initialised.
>

Although for Haswell none of our metric sets have conditional MUX
configurations, the generated code should already be in shape to only
advertising metric sets applicable to the system (which becomes an issue
for gen8+). This was changed relatively recently in the gen8+ series after
Mark Janes was hitting issues on Skylake in some of his tooling due to Mesa
advertising one of the compute metric sets that wasn't really available on
the system he had, which was only discoverable as a GL error when
attempting to use it.

The perf.oa.mux_regs state only pertains to one current metric set that the
OA unit has been configured with, after calling the generated
i915_oa_select_metric_set_hsw() function in hsw_enable_metric_set(). Until
an OA stream is opened and enabled perf.oa.mux_regs won't be initialised.

Notably the recent change for gen8+ mentioned above was to have the
_select_metric_set_<gen>() code and the _register_sysfs_<gen>() code both
work in terms of the get_<metric_set>_mux_config() functions since it's
these functions that will check the fiddly sku specfic details on gen8+ to
select the right MUX config or potentially fail if the metric set isn't
available on the current system. So for gen8+ we can expect
get_compute_basic_mux_config() will fail if the config isn't available and
then won't be advertised via sysfs. On Haswell it looks a little redundant
having these get_ functions unconditionally return a pointer to a
corresponding array.

Hope that clarifies,
- Robert



> -Chris
>
> --
> Chris Wilson, Intel Open Source Technology Centre
>

[-- Attachment #1.2: Type: text/html, Size: 9799 bytes --]

[-- Attachment #2: Type: text/plain, Size: 160 bytes --]

_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx

  reply	other threads:[~2016-11-01 16:53 UTC|newest]

Thread overview: 31+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-10-28  2:14 [PATCH v8 00/12] Enable i915 perf stream for Haswell OA unit Robert Bragg
2016-10-28  2:14 ` [PATCH v8 01/12] ctx-pin placeholder from chris Robert Bragg
2016-10-28  2:14 ` [PATCH v8 02/12] drm/i915: Add i915 perf infrastructure Robert Bragg
2016-10-28 14:27   ` Matthew Auld
2016-10-31 16:27     ` Robert Bragg
2016-10-31 17:13       ` Matthew Auld
2016-10-31 18:54         ` Robert Bragg
2016-11-04  8:59   ` sourab gupta
2016-11-04 13:19     ` Robert Bragg
2016-11-07  8:40       ` sourab gupta
2016-10-28  2:14 ` [PATCH v8 03/12] drm/i915: rename OACONTROL GEN7_OACONTROL Robert Bragg
2016-11-02  6:35   ` sourab gupta
2016-10-28  2:14 ` [PATCH v8 04/12] drm/i915: return EACCES for check_cmd() failures Robert Bragg
2016-11-04  5:18   ` sourab gupta
2016-10-28  2:14 ` [PATCH v8 05/12] drm/i915: don't whitelist oacontrol in cmd parser Robert Bragg
2016-11-04  9:17   ` sourab gupta
2016-10-28  2:14 ` [PATCH v8 06/12] drm/i915: Add 'render basic' Haswell OA unit config Robert Bragg
2016-10-28  2:14 ` [PATCH v8 07/12] drm/i915: Enable i915 perf stream for Haswell OA unit Robert Bragg
2016-10-31 21:44   ` Matthew Auld
2016-10-28  2:14 ` [PATCH v8 08/12] drm/i915: advertise available metrics via sysfs Robert Bragg
2016-11-04  9:01   ` sourab gupta
2016-10-28  2:14 ` [PATCH v8 09/12] drm/i915: Add dev.i915.perf_stream_paranoid sysctl option Robert Bragg
2016-11-04  9:06   ` sourab gupta
2016-10-28  2:14 ` [PATCH v8 10/12] drm/i915: add oa_event_min_timer_exponent sysctl Robert Bragg
2016-11-02  6:29   ` sourab gupta
2016-11-04  0:58     ` Robert Bragg
2016-10-28  2:14 ` [PATCH v8 11/12] drm/i915: Add more Haswell OA metric sets Robert Bragg
2016-11-01 14:57   ` Chris Wilson
2016-11-01 16:53     ` Robert Bragg [this message]
2016-10-28  2:14 ` [PATCH v8 12/12] drm/i915: Add a kerneldoc summary for i915_perf.c Robert Bragg
2016-10-28  3:16 ` ✗ Fi.CI.BAT: failure for Enable i915 perf stream for Haswell OA unit Patchwork

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CAMou1-28zxdxtMZ_i+NYn8ApLyQCD9bgGRhB3OtwcVJu1aBCnQ@mail.gmail.com \
    --to=robert@sixbynine.org \
    --cc=airlied@linux.ie \
    --cc=chris@chris-wilson.co.uk \
    --cc=daniel.vetter@intel.com \
    --cc=dri-devel@lists.freedesktop.org \
    --cc=intel-gfx@lists.freedesktop.org \
    --cc=jani.nikula@linux.intel.com \
    --cc=matthew.william.auld@gmail.com \
    --cc=sourab.gupta@intel.com \
    --cc=zhenyuw@linux.intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.