All of lore.kernel.org
 help / color / mirror / Atom feed
From: Richard Weinberger <richard.weinberger@gmail.com>
To: Ronak Desai <ronak.desai@rockwellcollins.com>
Cc: "linux-mtd @ lists . infradead . org" <linux-mtd@lists.infradead.org>
Subject: Re: Increased frequency of fastmap failure due to CRC mismatch
Date: Thu, 17 May 2018 18:46:46 +0200	[thread overview]
Message-ID: <CAFLxGvx4HVc3bAioJuwTsAxDo8ZqwvxC50_-R4Q=xocHyEqZNQ@mail.gmail.com> (raw)
In-Reply-To: <CADFuHZ787+cEaZ7ZT7YXZg4zBXUVpya2v29rUdq=FtJ7emv=cQ@mail.gmail.com>

On Thu, May 17, 2018 at 5:47 PM, Ronak Desai
<ronak.desai@rockwellcollins.com> wrote:
> On one of our units we noticed increase in fastmap failure due to
> fastmap CRC mismatch.  On this unit, on every power-up UBI observed
> fixable bit flips on a specific PEB. We are using SW ECC for ECC
> correction as the processor's NAND controller does not support the
> required ECC strength. We have also implemented read retry in the NAND
> controller driver.
>
> When UBI reads the fastmap data using NAND-MTD framework,  NAND-MTD
> subsystem returns EUCLEAN meaning there were corrections greater or
> equals to ECC strength. But the data should be corrected as the read
> call does not return any other error.
>
> In this failure scenarios, even though NAND-MTD subsystem has fixed
> the corruption with SW ECC, UBI still finds CRC mis-match on fastmap
> data. Successful data read with read retries has already been tested
> at temperature as well so there is no doubt about the reliability of
> read-retries. So, UBI should never receive corrupted data with fixable
> bit flip return code.
>
> So, would like to understand what is causing the fastamp data
> corruption which leads to CRC mis-match.  Interesting thing is we see
> fixable bit flip error message for that specific PEB on every power up
> but we don't see fastmap CRC failure on every power up. All the
> reboots are graceful (UBI partition is  detached and unmounted) and
> there are no abrupt power-cut.

Hmm, did you manually check the fastmap? I wonder what in the fastmap is wrong.
Is it just a bitlfip or are many bytes bad?

Is your mtd driver sane?

-- 
Thanks,
//richard

  reply	other threads:[~2018-05-17 16:49 UTC|newest]

Thread overview: 5+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-05-17 15:47 Increased frequency of fastmap failure due to CRC mismatch Ronak Desai
2018-05-17 16:46 ` Richard Weinberger [this message]
2018-05-17 16:49 ` Richard Weinberger
2018-05-18 13:50   ` Ronak Desai
2018-05-18 15:31     ` Richard Weinberger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAFLxGvx4HVc3bAioJuwTsAxDo8ZqwvxC50_-R4Q=xocHyEqZNQ@mail.gmail.com' \
    --to=richard.weinberger@gmail.com \
    --cc=linux-mtd@lists.infradead.org \
    --cc=ronak.desai@rockwellcollins.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.