From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752656Ab0LLNLv (ORCPT ); Sun, 12 Dec 2010 08:11:51 -0500 Received: from mail-bw0-f45.google.com ([209.85.214.45]:51226 "EHLO mail-bw0-f45.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751099Ab0LLNLu convert rfc822-to-8bit (ORCPT ); Sun, 12 Dec 2010 08:11:50 -0500 MIME-Version: 1.0 In-Reply-To: <20101212124354.GA4513@thunk.org> References: <1291945065-sup-1838@think> <20101210023852.GB3059@thunk.org> <20101212023415.GG3059@thunk.org> <20101212124354.GA4513@thunk.org> From: Jon Nelson Date: Sun, 12 Dec 2010 07:11:28 -0600 Message-ID: Subject: Re: hunt for 2.6.37 dm-crypt+ext4 corruption? (was: Re: dm-crypt barrier support is effective) To: "Ted Ts'o" , Jon Nelson , Matt , Chris Mason , Andi Kleen , Mike Snitzer , Milan Broz , linux-btrfs , dm-devel , Linux Kernel , htd , htejun , linux-ext4 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8BIT Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Dec 12, 2010 at 6:43 AM, Ted Ts'o wrote: > On Sun, Dec 12, 2010 at 04:18:29AM -0600, Jon Nelson wrote: >> > I have one CPU configured in the environment, 512MB of memory. >> > I have not done any memory-constriction tests whatsoever. > > I've finally been able to reproduce it myself, on real hardware.  SMP > is not necessary to reproduce it, although having more than one CPU > doesn't hurt.  What I did need to do (on real hardware with 4 gigs of > memory) was to turn off swap and pin enough memory so that free memory > was around 200megs or so before the start of the test.  (This is the > natural amount of free memory that the system would try to reach, > since 200 megs is about 5% of 4 gigs.) > > Then, during the test, free memory would drop to 50-70 megabytes, > forcing writeback to run, and then I could trigger it about 1-2 times > out of three. > > I'm guessing that when you used 512mb of memory, that was in effect a > memory-constriction test, and if you were to push the memory down a > little further, it might reproduce even more quickly.  My next step is > to try to reproduce this in a VM, and then I can start probing to see > what might be going on. I'm glad you've been able to reproduce the problem! If you should need any further assistance, please do not hesitate to ask. -- Jon