All of lore.kernel.org
 help / color / mirror / Atom feed
From: Eric Wheeler <bcache@lists.ewheeler.net>
To: Coly Li <colyli@suse.de>
Cc: linux-bcache@vger.kernel.org, mlyle@lyle.org,
	linux-block@vger.kernel.org,
	Kent Overstreet <kent.overstreet@gmail.com>,
	Nix <nix@esperi.org.uk>, Kai Krakow <hurikhan77@gmail.com>,
	Eric Wheeler <bcache@lists.ewheeler.net>,
	Junhui Tang <tang.junhui@zte.com.cn>,
	stable@vger.kernel.org
Subject: Re: [PATCHv4] bcache: only permit to recovery read error when cache device is clean
Date: Fri, 27 Oct 2017 19:57:05 +0000 (UTC)	[thread overview]
Message-ID: <alpine.LRH.2.11.1710271955340.24585@mail.ewheeler.net> (raw)
In-Reply-To: <20171016180428.123657-1-colyli@suse.de>

On Tue, 17 Oct 2017, Coly Li wrote:

> When bcache does read I/Os, for example in writeback or writethrough mode,
> if a read request on cache device is failed, bcache will try to recovery
> the request by reading from cached device. If the data on cached device is
> not synced with cache device, then requester will get a stale data.
> 
> For critical storage system like database, providing stale data from
> recovery may result an application level data corruption, which is
> unacceptible.
> 
> With this patch, for a failed read request in writeback or writethrough
> mode, recovery a recoverable read request only happens when cache device
> is clean. That is to say, all data on cached device is up to update.

Can this be relaxed to only error when the key failing to read is dirty?  
The liklihood of a 100% clean cache in writeback on a busy system seems 
unlikely.

Can KEY_DIRTY facilitate this?


--
Eric Wheeler



> 
> For other cache modes in bcache, read request will never hit
> cached_dev_read_error(), they don't need this patch.
> 
> Please note, because cache mode can be switched arbitrarily in run time, a
> writethrough mode might be switched from a writeback mode. Therefore
> checking dc->has_data in writethrough mode still makes sense.
> 
> Changelog:
> V4: Fix parens error pointed by Michael Lyle.
> v3: By response from Kent Oversteet, he thinks recovering stale data is a
>     bug to fix, and option to permit it is unneccessary. So this version
>     the sysfs file is removed.
> v2: rename sysfs entry from allow_stale_data_on_failure  to
>     allow_stale_data_on_failure, and fix the confusing commit log.
> v1: initial patch posted.
> 
> Signed-off-by: Coly Li <colyli@suse.de>
> Reported-by: Arne Wolf <awolf@lenovo.com>
> Acked-by: Michael Lyle <mlyle@lyle.org>
> Cc: Kent Overstreet <kent.overstreet@gmail.com>
> Cc: Nix <nix@esperi.org.uk>
> Cc: Kai Krakow <hurikhan77@gmail.com>
> Cc: Eric Wheeler <bcache@lists.ewheeler.net>
> Cc: Junhui Tang <tang.junhui@zte.com.cn>
> Cc: stable@vger.kernel.org
> ---
>  drivers/md/bcache/request.c | 10 +++++++++-
>  1 file changed, 9 insertions(+), 1 deletion(-)
> 
> diff --git a/drivers/md/bcache/request.c b/drivers/md/bcache/request.c
> index 681b4f12b05a..e7f769ff7234 100644
> --- a/drivers/md/bcache/request.c
> +++ b/drivers/md/bcache/request.c
> @@ -697,8 +697,16 @@ static void cached_dev_read_error(struct closure *cl)
>  {
>  	struct search *s = container_of(cl, struct search, cl);
>  	struct bio *bio = &s->bio.bio;
> +	struct cached_dev *dc = container_of(s->d, struct cached_dev, disk);
>  
> -	if (s->recoverable) {
> +	/*
> +	 * If cache device is dirty (dc->has_dirty is non-zero), then
> +	 * recovery a failed read request from cached device may get a
> +	 * stale data back. So read failure recovery is only permitted
> +	 * when cache device is clean.
> +	 */
> +	if (s->recoverable &&
> +	    (dc && !atomic_read(&dc->has_dirty))) {
>  		/* Retry from the backing device: */
>  		trace_bcache_read_retry(s->orig_bio);
>  
> -- 
> 2.13.6
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-bcache" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 

  reply	other threads:[~2017-10-27 19:57 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-10-16 18:04 [PATCHv4] bcache: only permit to recovery read error when cache device is clean Coly Li
2017-10-27 19:57 ` Eric Wheeler [this message]
2017-10-27 20:00   ` Michael Lyle
2017-10-27 21:13     ` Eric Wheeler
2017-10-27 21:24       ` Michael Lyle
2017-10-27 23:31         ` Eric Wheeler
2017-10-27 23:31           ` Eric Wheeler
2017-10-27 23:36           ` Michael Lyle

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=alpine.LRH.2.11.1710271955340.24585@mail.ewheeler.net \
    --to=bcache@lists.ewheeler.net \
    --cc=colyli@suse.de \
    --cc=hurikhan77@gmail.com \
    --cc=kent.overstreet@gmail.com \
    --cc=linux-bcache@vger.kernel.org \
    --cc=linux-block@vger.kernel.org \
    --cc=mlyle@lyle.org \
    --cc=nix@esperi.org.uk \
    --cc=stable@vger.kernel.org \
    --cc=tang.junhui@zte.com.cn \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.