linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Does the discard bug introduced in 4.19 by 744889b7cbb56a6 and fixed by 1adfc5e4136f5967 have potential for fstrim-caused corruption?
@ 2018-11-10  8:06 Vito Caputo
  2018-11-12  1:22 ` Ming Lei
  0 siblings, 1 reply; 2+ messages in thread
From: Vito Caputo @ 2018-11-10  8:06 UTC (permalink / raw)
  To: linux-kernel; +Cc: ming.lei, snitzer, hch, xni, mariusz.dabrowski, axboe

I ask because I recently performed some fstrims on my 4.19-running
laptop after a good house cleaning, and things started going rather
haywire today at the filesystem level, on different filesystems of
differing types (ext2 and ext4) but sharing the same underlying lvm
pv+dmcrypt on an SSD.

At the time I needed to get work done and couldn't investigate.  So I
crossed my fingers and rebooted back into 4.18, let some ugly fscks
complete, and carried on like nothing happened.

This all seems to be a software bug, not a genuine SSD failure of any
sort.

My filesystems are now all in a questionable state, though the machine
seems usable enough for the moment while I prepare for rebuilding.

Should I assume this was all caused by the fstrims and 744889b7cbb56a6?

Or does it not explain how this happened and we may have a different
problem on our hands?

I had been running 4.19-rc[178], but am uncertain if I performed fstrims
in any of those, I definitely did in 4.18 though and had no trouble.

744889b7cbb56a6 is especially suspect to me as it was in none of the
4.19 RCs but for some reason landed in 4.19 to fix a22c4d7e34402ccdf3,
even though a22c4d7e34402ccdf3 has been there since 4.3, that's left a
bad taste in my mouth but maybe I'm missing something.

Thanks,
Vito Caputo

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: Does the discard bug introduced in 4.19 by 744889b7cbb56a6 and fixed by 1adfc5e4136f5967 have potential for fstrim-caused corruption?
  2018-11-10  8:06 Does the discard bug introduced in 4.19 by 744889b7cbb56a6 and fixed by 1adfc5e4136f5967 have potential for fstrim-caused corruption? Vito Caputo
@ 2018-11-12  1:22 ` Ming Lei
  0 siblings, 0 replies; 2+ messages in thread
From: Ming Lei @ 2018-11-12  1:22 UTC (permalink / raw)
  To: Vito Caputo; +Cc: linux-kernel, snitzer, hch, xni, mariusz.dabrowski, axboe

On Sat, Nov 10, 2018 at 12:06:35AM -0800, Vito Caputo wrote:
> I ask because I recently performed some fstrims on my 4.19-running
> laptop after a good house cleaning, and things started going rather
> haywire today at the filesystem level, on different filesystems of
> differing types (ext2 and ext4) but sharing the same underlying lvm
> pv+dmcrypt on an SSD.

Could you share us your test case? And what is the exact issue triggered
in your system?

> 
> At the time I needed to get work done and couldn't investigate.  So I
> crossed my fingers and rebooted back into 4.18, let some ugly fscks
> complete, and carried on like nothing happened.
> 
> This all seems to be a software bug, not a genuine SSD failure of any
> sort.
> 
> My filesystems are now all in a questionable state, though the machine
> seems usable enough for the moment while I prepare for rebuilding.
> 
> Should I assume this was all caused by the fstrims and 744889b7cbb56a6?

You may confirm that by revert 744889b7cbb56a6 and see if your issue is
gone.

However, 1adfc5e4136f5967d is supposed to fix 744889b7cbb56a6, so I
strongly suggest you to can verify 1adfc5e4136f5967d.

Thanks,
Ming

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2018-11-12  1:22 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-11-10  8:06 Does the discard bug introduced in 4.19 by 744889b7cbb56a6 and fixed by 1adfc5e4136f5967 have potential for fstrim-caused corruption? Vito Caputo
2018-11-12  1:22 ` Ming Lei

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).