From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758607Ab2BJJDg (ORCPT ); Fri, 10 Feb 2012 04:03:36 -0500 Received: from cantor2.suse.de ([195.135.220.15]:36089 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753419Ab2BJJDc (ORCPT ); Fri, 10 Feb 2012 04:03:32 -0500 Date: Fri, 10 Feb 2012 10:03:28 +0100 From: Jan Kara To: Jamie Lokier Cc: Alan Stern , Pavel Machek , "Rafael J. Wysocki" , Linux PM list , LKML , Jan Kara , linux-fsdevel@vger.kernel.org, Dave Chinner , Nigel Cunningham , "Srivatsa S. Bhat" Subject: Re: [RFC][PATCH] PM / Sleep: Freeze filesystems during system suspend/hibernation Message-ID: <20120210090328.GA10509@quack.suse.cz> References: <20120201133642.GC3673@elf.ucw.cz> <20120210025217.GA9258@jl-vm1.vm.bytemark.co.uk> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20120210025217.GA9258@jl-vm1.vm.bytemark.co.uk> User-Agent: Mutt/1.5.20 (2009-06-14) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri 10-02-12 02:52:17, Jamie Lokier wrote: > Alan Stern wrote: > > On Wed, 1 Feb 2012, Pavel Machek wrote: > > > > > > particular, this should help to solve a long-standing issue that, in > > > > some cases, during resume from hibernation the boot loader causes the > > > > journal to be replied for the filesystem containing the kernel image > > > > and/or initrd causing it to become inconsistent with the information > > > > stored in the hibernation image. > > > > > > Ungood. Why is bootloader/initrd doing that? If it mounts filesystem > > > read/write, what is the guarantee that it will not change data on the > > > filesystem, breaking stuff? > > > > > > Bootloaders should just not replay journals. > > > > > > > The user-space-driven hibernation (s2disk) is not covered by this > > > > change, because the freezing of filesystems prevents s2disk from > > > > accessing device special files it needs to do its job. > > > > > > ...so bootloaders need to be fixed, anyway. > > > > I don't know about bootloaders, but from what I've heard, Linux fs > > drivers (including those inside initrds) always replay the journal, > > even if the filesystem is mounted read-only. This could be considered > > a bug in the filesystem code. > > Theoretically a filesystem might need replay for the bootloader to see > a non-corrupt image, even for just the files it uses. > > For example if the last state was in the middle of updating the root > directory, the /boot entry in the root directory might not be reliably > found without replaying the journal. > > However replaying a journal when mounted read-only should probably > track journalled blocks in memory only, not commit back to the storage. Yes, that would be nice. Although memory constraints of the bootloader might make it tricky. The reason why we don't have the functionality in kernel is not so much about memory demands but more about the complexities of implementing the caching - we'd have to create an equivalent of dm-snapshot of the device from the mount code to trick generic code in pagecache to loading proper replayed data). Honza -- Jan Kara SUSE Labs, CR