All of lore.kernel.org
 help / color / mirror / Atom feed
From: Vaibhav Jain <vaibhav@linux.ibm.com>
To: "Dan Williams" <dan.j.williams@intel.com>,
	"Michal Suchánek" <msuchanek@suse.de>
Cc: Alastair D'Silva <alastair@d-silva.org>,
	"Aneesh Kumar K.V" <aneesh.kumar@linux.ibm.com>,
	linux-nvdimm <linux-nvdimm@lists.01.org>
Subject: Re: Feedback requested: Exposing NVDIMM performance statistics in a generic way
Date: Mon, 23 Nov 2020 12:51:33 +0530	[thread overview]
Message-ID: <87k0ucfshu.fsf@vajain21.in.ibm.com> (raw)
In-Reply-To: <CAPcyv4hTYQ8upuDd0RqUzBtSqjBr4rJz0eaceUmr4b=XeXqs-A@mail.gmail.com>

Hi Dan and Michal,

I have posted an RFC patch to implment the kernel side interface for
this in libnvdimm with an implementation in papr-scm driver module at [1]. Can
you please take a look at the patch seried and provide your inputs.

[1] https://lore.kernel.org/linux-nvdimm/20201108211549.122018-1-vaibhav@linux.ibm.com/

Thanks,
~ Vaibhav

Dan Williams <dan.j.williams@intel.com> writes:

> On Fri, Oct 23, 2020 at 10:28 AM Michal Suchánek <msuchanek@suse.de> wrote:
>>
>> Hello,
>>
>> On Thu, May 28, 2020 at 11:59 AM Vaibhav Jain <vaibhav@linux.ibm.com> wrote:
>> >
>> > Thanks for this taking time to look into this Dan,
>> >
>> > Agree with the points you have made earlier that I am summarizing below:
>> >
>> > * This is better done in ndctl rather than ipmctl.
>> > * Should only expose general performance metrics and not performance
>> >   counters. Performance counter should be exposed via perf
>> > * Vendor specific metrics to be separated from generic performance
>> >   metrics.
>> >
>> > One way to split generic and vendor specific metrics might be to report
>> > generic performance metrics together with dimm health metrics such as
>> > "temprature_celsius" or "spares_percentage" that are already reported in
>> > by dimm health output.
>> >
>> > Vendor specific performance metrics can be reported as a seperate object
>> > in the json output. Something similar to output below:
>> >
>> > # ndctl list -DH --stats --vendor-stats
>> > [
>> >   {
>> >     "dev":"nmem0",
>> >     "health":{
>> >       "health_state":"ok",
>> >       "shutdown_state":"clean",
>> >       "temperature_celsius":48.00,
>> >       "spares_percentage":10,
>> >
>> >       /* Generic performance metrics/stats */
>> >       "TotalMediaReads": 18929,
>> >       "TotalMediaWrites": 0,
>> >       ....
>> >     }
>> >
>> >     /* Vendor specific stats for the dimm */
>> >     "vendor-stats": {
>> >     "Controller Reset Count":10
>> >     "Controller Reset Elapsed Time": 3600
>> >     "Power-on Seconds": 3600
>>
>> How do you tell generic from vendor-specific stats, though?
>>
>> Controller reset count and power-on time may not be reported by some
>> controllers but sound pretty generic.
>>
>> Even if you declare that the stats reported by all controllers
>> available at this moment are generic a later one may not report some of
>> these 'generic' statistics, or report them in different way/units, or
>> may simply not report anything at all for some technical reason.
>>
>> Kernels that do not have this feature will not report anything at all
>> either.
>
> My expectation is that for a given json attribute name any vendor
> backend that supports it must convey it in a compatible way. If a
> given attribute does not make sense for a given vendor, or is not yet
> implemented then leaving it unpopulated is indeed the expectation.
>
> The goal is to both minimize vendor specific logic in infrastructure
> that consumes the ndctl json while at the same time balance vendor
> needs. In other words avoid "needless" differentiation as much as
> possible with small amount of compat work across vendors.
_______________________________________________
Linux-nvdimm mailing list -- linux-nvdimm@lists.01.org
To unsubscribe send an email to linux-nvdimm-leave@lists.01.org

      reply	other threads:[~2020-11-23  7:21 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-05-25  9:00 Feedback requested: Exposing NVDIMM performance statistics in a generic way Vaibhav Jain
2020-05-27 19:24 ` Dan Williams
2020-05-28  0:55   ` Dan Williams
2020-05-28 18:59     ` Vaibhav Jain
2020-05-28 22:40       ` Dan Williams
2020-10-23 17:28       ` Michal Suchánek
2020-10-23 19:03         ` Dan Williams
2020-11-23  7:21           ` Vaibhav Jain [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87k0ucfshu.fsf@vajain21.in.ibm.com \
    --to=vaibhav@linux.ibm.com \
    --cc=alastair@d-silva.org \
    --cc=aneesh.kumar@linux.ibm.com \
    --cc=dan.j.williams@intel.com \
    --cc=linux-nvdimm@lists.01.org \
    --cc=msuchanek@suse.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.