From: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
To: Max Reitz <mreitz@redhat.com>, qemu-block@nongnu.org
Cc: qemu-devel@nongnu.org, crosa@redhat.com, ehabkost@redhat.com,
kwolf@redhat.com, jsnow@redhat.com
Subject: Re: [PATCH v3 6/6] block/qcow2: use seqcache for compressed writes
Date: Mon, 29 Mar 2021 23:18:46 +0300 [thread overview]
Message-ID: <8c702bb9-416f-ada9-4691-22f37d82b2b8@virtuozzo.com> (raw)
In-Reply-To: <e85d05f3-5500-9a55-0bd5-ceb581c27ef7@redhat.com>
12.03.2021 21:15, Max Reitz wrote:
> On 05.03.21 18:35, Vladimir Sementsov-Ogievskiy wrote:
>> Compressed writes are unaligned to 512, which works very slow in
>> O_DIRECT mode. Let's use the cache.
>>
>> Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
>> ---
>> block/coroutines.h | 3 +
>> block/qcow2.h | 4 ++
>> block/qcow2-refcount.c | 10 +++
>> block/qcow2.c | 158 ++++++++++++++++++++++++++++++++++++++---
>> 4 files changed, 164 insertions(+), 11 deletions(-)
>>
[..]
>> +static int coroutine_fn
>> +qcow2_co_compressed_flush_one(BlockDriverState *bs, bool unfinished)
>> +{
>> + BDRVQcow2State *s = bs->opaque;
>> + int ret;
>> + int64_t align = 1;
>> + int64_t offset, bytes;
>> + uint8_t *buf;
>> +
>> + if (!s->compressed_cache) {
>> + return 0;
>> + }
>> +
>> + if (!seqcache_get_next_flush(s->compressed_cache, &offset, &bytes, &buf,
>> + &unfinished))
>> + {
>> + return 0;
>> + }
>> +
>> + qcow2_inflight_writes_inc(bs, offset, bytes);
>> +
>> + /*
>> + * Don't try to align-up unfinished cached cluster, parallel write to it is
>> + * possible! For finished host clusters data in the tail of the cluster will
>> + * be never used. So, take some good alignment for speed.
>> + *
>> + * Note also, that seqcache guarantees that allocated size of @buf is enough
>> + * to fill the cluster up to the end, so we are safe to align up with
>> + * align <= cluster_size.
>> + */
>> + if (!unfinished) {
>> + align = MIN(s->cluster_size,
>> + MAX(s->data_file->bs->bl.request_alignment,
>> + bdrv_opt_mem_align(bs)));
>> + }
>> +
>
> I’d move the pre-write overlap check here, because its purpose is to prevent writes to metadata structures as they are happening, not as they are queued into a cache. (I’d say.)
Hmm. pre-write overlap check usually done under mutex.. Should I add an additional critical section to do overlap check? I'm not sure.
>
>> + BLKDBG_EVENT(s->data_file, BLKDBG_WRITE_COMPRESSED);
>> + ret = bdrv_co_pwrite(s->data_file, offset,
>> + QEMU_ALIGN_UP(offset + bytes, align) - offset,
>
> I remember you said you wanted to describe more of the properties of the buffer returned by seqcache_get_next_flush(). That would be nice here, because intuitively one would assume that the buffer is @bytes bytes long, which aligning here will exceed. (It’s fine, but it would be nicer if there was a comment that assured that it’s fine.)
>
It's here, read several lines above: "Note also, that..."
>> + buf, 0);
>> + if (ret < 0 && s->compressed_flush_ret == 0) {
>> + /*
>> + * The data that was "written" earlier is lost. We don't want to
>> + * care with storing it somehow to retry flushing later.
--
Best regards,
Vladimir
prev parent reply other threads:[~2021-03-29 20:19 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-03-05 17:35 [PATCH v3 0/6] qcow2: compressed write cache Vladimir Sementsov-Ogievskiy
2021-03-05 17:35 ` [PATCH v3 1/6] block-jobs: flush target at the end of .run() Vladimir Sementsov-Ogievskiy
2021-03-11 16:57 ` Max Reitz
2021-03-05 17:35 ` [PATCH v3 2/6] iotests: add qcow2-discard-during-rewrite Vladimir Sementsov-Ogievskiy
2021-03-05 17:35 ` [PATCH v3 3/6] block/qcow2: introduce inflight writes counters: fix discard Vladimir Sementsov-Ogievskiy
2021-03-11 19:58 ` Max Reitz
2021-03-12 9:09 ` Vladimir Sementsov-Ogievskiy
2021-03-12 11:17 ` Max Reitz
2021-03-12 12:32 ` Vladimir Sementsov-Ogievskiy
2021-03-12 12:42 ` Vladimir Sementsov-Ogievskiy
2021-03-12 15:01 ` Max Reitz
2021-03-12 12:46 ` Vladimir Sementsov-Ogievskiy
2021-03-12 15:10 ` Max Reitz
2021-03-12 15:24 ` Vladimir Sementsov-Ogievskiy
2021-03-12 15:52 ` Max Reitz
2021-03-12 16:03 ` Vladimir Sementsov-Ogievskiy
2021-03-12 14:58 ` Max Reitz
2021-03-12 15:39 ` Vladimir Sementsov-Ogievskiy
2021-03-05 17:35 ` [PATCH v3 4/6] util: implement seqcache Vladimir Sementsov-Ogievskiy
2021-03-12 13:41 ` Max Reitz
2021-03-12 14:37 ` Vladimir Sementsov-Ogievskiy
2021-03-12 15:13 ` Max Reitz
2021-06-04 14:31 ` Vladimir Sementsov-Ogievskiy
2021-03-05 17:35 ` [PATCH v3 5/6] block-coroutine-wrapper: allow non bdrv_ prefix Vladimir Sementsov-Ogievskiy
2021-03-12 16:53 ` Max Reitz
2021-03-05 17:35 ` [PATCH v3 6/6] block/qcow2: use seqcache for compressed writes Vladimir Sementsov-Ogievskiy
2021-03-12 18:15 ` Max Reitz
2021-03-12 18:43 ` Vladimir Sementsov-Ogievskiy
2021-03-15 9:58 ` Max Reitz
2021-03-15 14:40 ` Vladimir Sementsov-Ogievskiy
2021-03-16 12:25 ` Max Reitz
2021-03-16 17:48 ` Vladimir Sementsov-Ogievskiy
2021-03-17 8:09 ` Max Reitz
2021-03-12 18:45 ` Vladimir Sementsov-Ogievskiy
2021-03-29 20:18 ` Vladimir Sementsov-Ogievskiy [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=8c702bb9-416f-ada9-4691-22f37d82b2b8@virtuozzo.com \
--to=vsementsov@virtuozzo.com \
--cc=crosa@redhat.com \
--cc=ehabkost@redhat.com \
--cc=jsnow@redhat.com \
--cc=kwolf@redhat.com \
--cc=mreitz@redhat.com \
--cc=qemu-block@nongnu.org \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).