All of lore.kernel.org
 help / color / mirror / Atom feed
From: Vladimir Sementsov-Ogievskiy <vsementsov@virtuozzo.com>
To: Kevin Wolf <kwolf@redhat.com>
Cc: qemu block <qemu-block@nongnu.org>,
	qemu-devel <qemu-devel@nongnu.org>, Max Reitz <mreitz@redhat.com>,
	"Denis V. Lunev" <den@openvz.org>, Eric Blake <eblake@redhat.com>
Subject: Re: What prevents discarding a cluster during rewrite?
Date: Tue, 23 Feb 2021 18:23:07 +0300	[thread overview]
Message-ID: <e7480b84-bda0-c280-603e-38ba176c44c9@virtuozzo.com> (raw)
In-Reply-To: <20210223103534.GD5083@merkur.fritz.box>



On 23.02.2021 13:35, Kevin Wolf wrote:
> Am 22.02.2021 um 22:42 hat Vladimir Sementsov-Ogievskiy geschrieben:
>> 23.02.2021 00:30, Vladimir Sementsov-Ogievskiy wrote:
>>> Thinking of how to prevent dereferencing to zero (and discard) of
>>> host cluster during flush of compressed cache (which I'm working on
>>> now), I have a question.. What prevents it for normal writes?
>>
>> I have no idea about why didn't it fail for years.. May be, I'm
>> missing something?
> 
> Ouch. I suppose the reason why we never ran into it is that afaik Linux
> drains the queue before sending discard requests.
> 
> Of course, we still need to protect against this in QEMU. We can't just
> unref a cluster that is still in use.
> 
>> I have idea of fixing: increase the refcount of host cluster before
>> write to data_file (it's important to increase refacount in same
>> s->lock critical section where we get host_offset) and dereference it
>> after write.. It should help. Any thoughts?
> 
> It would cause metadata updates for rewrites. I don't know whether the
> intermediate value would ever be written to disk, but at least we'd
> rewrite the same refcounts as before. I don't think we want that.
> 

Hmm, if that can provoke extra refcount cache flush that's bad..

May be we need something like of additional "dynamic" refcounts, so
that total_refcount = normal_refcount + dynamic_refcount.. And for
in-flight clusters dynamic_refcount is > 0. We can store dynamic
refcounts in GHashTable(cluster -> refcount).



> Discard requests might be rare enough that not considering the cluster
> at all could work. We could then take a reader CoRwlock during most
> operations, and a writer lock for discard.
> 
> Actually, maybe s->lock should be this CoRwlock, and instead of dropping
> it temporarily like we do now we would just upgrade and downgrade it as
> needed. Maybe this would allow finer grained locking in other places,
> too.

In this case many operations will be blocked during data writing, like 
allocation of another cluster.. That doesn't seem good. Separate 
CoRwLock looks more feasible.

--
Best regards,
Vladimir


  reply	other threads:[~2021-02-23 15:24 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-02-22 21:30 What prevents discarding a cluster during rewrite? Vladimir Sementsov-Ogievskiy
2021-02-22 21:42 ` Vladimir Sementsov-Ogievskiy
2021-02-23 10:35   ` Kevin Wolf
2021-02-23 15:23     ` Vladimir Sementsov-Ogievskiy [this message]
2021-02-23 16:29       ` Kevin Wolf
2021-02-24  6:04         ` Vladimir Sementsov-Ogievskiy

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=e7480b84-bda0-c280-603e-38ba176c44c9@virtuozzo.com \
    --to=vsementsov@virtuozzo.com \
    --cc=den@openvz.org \
    --cc=eblake@redhat.com \
    --cc=kwolf@redhat.com \
    --cc=mreitz@redhat.com \
    --cc=qemu-block@nongnu.org \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.