From: Jiri Olsa <jolsa@redhat.com>
To: Alexey Budankov <alexey.budankov@linux.intel.com>
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>,
Namhyung Kim <namhyung@kernel.org>,
Alexander Shishkin <alexander.shishkin@linux.intel.com>,
Peter Zijlstra <peterz@infradead.org>,
Ingo Molnar <mingo@redhat.com>, Andi Kleen <ak@linux.intel.com>,
linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH v5 07/10] perf record: implement -z,--compression_level=n option and compression
Date: Fri, 8 Mar 2019 11:46:57 +0100 [thread overview]
Message-ID: <20190308104657.GA21500@krava> (raw)
In-Reply-To: <002e7e10-b0ef-df2a-261c-88fd9c00364d@linux.intel.com>
On Thu, Mar 07, 2019 at 06:26:47PM +0300, Alexey Budankov wrote:
>
> On 07.03.2019 15:14, Jiri Olsa wrote:
> > On Thu, Mar 07, 2019 at 11:39:46AM +0300, Alexey Budankov wrote:
> >>
> >> On 05.03.2019 15:25, Jiri Olsa wrote:
> >>> On Fri, Mar 01, 2019 at 06:58:32PM +0300, Alexey Budankov wrote:
> >>>
> >>> SNIP
> >>>
> >>>>
> >>>> /*
> >>>> * Increment md->refcount to guard md->data[idx] buffer
> >>>> @@ -350,7 +357,7 @@ int perf_mmap__aio_push(struct perf_mmap *md, void *to, int idx,
> >>>> md->prev = head;
> >>>> perf_mmap__consume(md);
> >>>>
> >>>> - rc = push(to, &md->aio.cblocks[idx], md->aio.data[idx], size0 + size, *off);
> >>>> + rc = push(to, md->aio.data[idx], size0 + size, *off, &md->aio.cblocks[idx]);
> >>>> if (!rc) {
> >>>> *off += size0 + size;
> >>>> } else {
> >>>> @@ -556,13 +563,15 @@ int perf_mmap__read_init(struct perf_mmap *map)
> >>>> }
> >>>>
> >>>> int perf_mmap__push(struct perf_mmap *md, void *to,
> >>>> - int push(struct perf_mmap *map, void *to, void *buf, size_t size))
> >>>> + int push(struct perf_mmap *map, void *to, void *buf, size_t size),
> >>>> + perf_mmap__compress_fn_t compress, void *comp_data)
> >>>> {
> >>>> u64 head = perf_mmap__read_head(md);
> >>>> unsigned char *data = md->base + page_size;
> >>>> unsigned long size;
> >>>> void *buf;
> >>>> int rc = 0;
> >>>> + size_t mmap_len = perf_mmap__mmap_len(md);
> >>>>
> >>>> rc = perf_mmap__read_init(md);
> >>>> if (rc < 0)
> >>>> @@ -574,7 +583,10 @@ int perf_mmap__push(struct perf_mmap *md, void *to,
> >>>> buf = &data[md->start & md->mask];
> >>>> size = md->mask + 1 - (md->start & md->mask);
> >>>> md->start += size;
> >>>> -
> >>>> + if (compress) {
> >>>> + size = compress(comp_data, md->data, mmap_len, buf, size);
> >>>> + buf = md->data;
> >>>> + }
> >>>> if (push(md, to, buf, size) < 0) {
> >>>> rc = -1;
> >>>> goto out;
> >>>
> >>> when we discussed the compress callback should be another layer
> >>> in perf_mmap__push I was thinking more of the layered/fifo design,
> >>> like:
> >>>
> >>> normaly we call:
> >>>
> >>> perf_mmap__push(... push = record__pushfn ...)
> >>> -> reads mmap data and calls push(data), which translates as:
> >>>
> >>> record__pushfn(data);
> >>> - which stores the data
> >>>
> >>>
> >>> for compressed it'd be:
> >>>
> >>> perf_mmap__push(... push = compressed_push ...)
> >>>
> >>> -> reads mmap data and calls push(data), which translates as:
> >>>
> >>> compressed_push(data)
> >>> -> reads data, compresses them and calls, next push callback in line:
> >>>
> >>> record__pushfn(data)
> >>> - which stores the data
> >>>
> >>>
> >>> there'd need to be the logic for compressed_push to
> >>> remember the 'next push' function
> >>
> >> That is suboptimal for AIO. Also compression is an independent operation that
> >> could be applied on any of push stages you mean.
> >
> > not sure what you mean by suboptimal, but I think
> > that it can still happen in subsequent push callback
> >
> >>
> >>>
> >>> but I think this was the original idea behind the
> >>> perf_mmap__push -> it gets the data and pushes them for
> >>> the next processing.. it should stay as simple as that
> >>
> >> Agree on keeping simplicity and, at the moment, there is no any push to the next
> >> processing in the code so provided implementation fits as for serial as for AIO
> >> at the same time sticking to simplicity as much as possibly. If you see something
> >> that would fit better please speak up and share.
> >
> > I have to insist that perf_mmap__push stays untouched
> > and we do other processing in the push callbacks
>
> What is about perf_mmap__aio_push()?
>
> Without compression it does
> memcpy(), memcpy(), aio_push()
>
> With compression its does
> memcpy_with_compression(), memcpy_with_compression(), aio_push()
so to be on the same page.. normal processing without compression is:
perf_mmap__push does:
push(mmap buf)
record__pushfn
record__write
write(buf)
perf_mmap__aio_push does:
memcpy(aio buf, mmap buf)
push(aio buf)
record__aio_pushfn
record__aio_write
aio_write(aio buf)
and for compression it would be:
perf_mmap__push does:
push(mmap buf)
compress_push
memcpy(compress buffer, mmapbuf) EXTRA copy
record__pushfn
record__write
write(buf)
perf_mmap__aio_push does:
memcpy(aio buf, mmap buf)
memcpy(compress buffer, mmapbuf) EXTRA copy
push(aio buf)
record__aio_pushfn
record__aio_write
aio_write(aio buf)
side note: that actualy makes me think why do we even have perf_mmap__aio_push,
it looks like we could copy the buf in the callback push function with no harm?
so.. there's one extra memcpy for compression, is it right?
I might miss some part which makes this scheme unusable..
thanks,
jirka
next prev parent reply other threads:[~2019-03-08 10:47 UTC|newest]
Thread overview: 52+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-03-01 15:27 [PATCH v5 00/10] perf: enable compression of record mode trace to save storage space Alexey Budankov
2019-03-01 15:41 ` [PATCH v5 02/10] perf record: implement -f,--mmap-flush=<threshold> option Alexey Budankov
2019-03-05 12:25 ` Jiri Olsa
2019-03-07 8:28 ` Alexey Budankov
2019-03-05 12:25 ` Jiri Olsa
2019-03-07 8:28 ` Alexey Budankov
2019-03-05 12:26 ` Jiri Olsa
2019-03-07 8:42 ` Alexey Budankov
2019-03-07 8:54 ` Alexey Budankov
2019-03-01 15:43 ` [PATCH v5 03/10] perf session: define bytes_transferred and bytes_compressed metrics Alexey Budankov
2019-03-05 12:26 ` Jiri Olsa
2019-03-07 8:29 ` Alexey Budankov
2019-03-11 8:19 ` Alexey Budankov
2019-03-11 12:33 ` Jiri Olsa
2019-03-11 13:41 ` Alexey Budankov
2019-03-01 15:46 ` [PATCH v5 04/10] perf record: implement COMPRESSED event record and its attributes Alexey Budankov
2019-03-01 15:52 ` [PATCH v5 06/10] perf util: introduce Zstd based streaming compression API Alexey Budankov
2019-03-05 12:26 ` Jiri Olsa
2019-03-07 8:29 ` Alexey Budankov
2019-03-01 15:58 ` [PATCH v5 07/10] perf record: implement -z,--compression_level=n option and compression Alexey Budankov
2019-03-05 0:01 ` Andi Kleen
2019-03-05 9:19 ` Alexey Budankov
2019-03-05 12:25 ` Jiri Olsa
2019-03-07 8:39 ` Alexey Budankov
2019-03-07 12:14 ` Jiri Olsa
2019-03-07 15:26 ` Alexey Budankov
2019-03-07 15:56 ` Alexey Budankov
2019-03-08 10:46 ` Jiri Olsa [this message]
2019-03-10 15:55 ` Alexey Budankov
2019-03-10 16:17 ` Alexey Budankov
2019-03-11 10:56 ` Jiri Olsa
2019-03-05 12:25 ` Jiri Olsa
2019-03-07 8:26 ` Alexey Budankov
2019-03-05 12:26 ` Jiri Olsa
2019-03-07 8:26 ` Alexey Budankov
2019-03-07 11:59 ` Jiri Olsa
2019-03-07 14:51 ` Alexey Budankov
2019-03-05 12:26 ` Jiri Olsa
2019-03-07 8:26 ` Alexey Budankov
2019-03-05 12:26 ` Jiri Olsa
2019-03-07 8:26 ` Alexey Budankov
2019-03-01 16:06 ` [PATCH v5 08/10] perf report: implement record trace decompression Alexey Budankov
2019-03-05 12:25 ` Jiri Olsa
2019-03-07 8:27 ` Alexey Budankov
2019-03-01 16:07 ` [PATCH v5 09/10] perf inject: enable COMPRESSED records decompression Alexey Budankov
2019-03-05 12:26 ` Jiri Olsa
2019-03-07 8:28 ` Alexey Budankov
2019-03-01 16:09 ` [PATCH v5 10/10] perf tests: implement Zstd comp/decomp integration test Alexey Budankov
2019-03-05 12:25 ` Jiri Olsa
2019-03-07 8:29 ` Alexey Budankov
2019-03-01 16:37 ` [PATCH v5 05/10] perf mmap: implement dedicated memory buffer for data compression Alexey Budankov
2019-03-01 16:38 ` [PATCH v5 01/10] feature: implement libzstd check, LIBZSTD_DIR and NO_LIBZSTD defines Alexey Budankov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20190308104657.GA21500@krava \
--to=jolsa@redhat.com \
--cc=acme@kernel.org \
--cc=ak@linux.intel.com \
--cc=alexander.shishkin@linux.intel.com \
--cc=alexey.budankov@linux.intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@redhat.com \
--cc=namhyung@kernel.org \
--cc=peterz@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).