From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752758Ab2DWPXi (ORCPT ); Mon, 23 Apr 2012 11:23:38 -0400 Received: from mail-wg0-f44.google.com ([74.125.82.44]:50477 "EHLO mail-wg0-f44.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752311Ab2DWPXg (ORCPT ); Mon, 23 Apr 2012 11:23:36 -0400 From: Miklos Szeredi To: Chuck Lever Cc: "J. Bruce Fields" , Jeff Layton , Malahal Naineni , Steve Dickson , linux-fsdevel@vger.kernel.org, linux-nfs@vger.kernel.org, linux-kernel@vger.kernel.org, viro@zeniv.linux.org.uk, hch@infradead.org, michael.brantley@deshaw.com, sven.breuner@itwm.fraunhofer.de, pstaubach@exagrid.com, trond.myklebust@fys.uio.no, rees@umich.edu Subject: Re: [PATCH RFC v3] vfs: make fstatat retry once on ESTALE errors from getattr call References: <20120420104055.511e15bc@tlielax.poochiereds.net> <4F91C49D.8070908@RedHat.com> <20120420203725.GA3512@us.ibm.com> <20120420171314.73801874@corrin.poochiereds.net> <20120423080012.7c23ef24@tlielax.poochiereds.net> <20120423130009.GA13681@fieldses.org> <20120423091255.00f926c4@tlielax.poochiereds.net> <20120423133412.GB13681@fieldses.org> <20120423095021.1a91a23b@tlielax.poochiereds.net> <20120423135456.GC13681@fieldses.org> <87hawasdrb.fsf@tucsk.pomaz.szeredi.hu> <9991AAC7-5256-4AFE-9722-7AF1119EE7BE@oracle.com> Date: Mon, 23 Apr 2012 17:23:54 +0200 In-Reply-To: <9991AAC7-5256-4AFE-9722-7AF1119EE7BE@oracle.com> (Chuck Lever's message of "Mon, 23 Apr 2012 11:02:32 -0400") Message-ID: <874nsasc8l.fsf@tucsk.pomaz.szeredi.hu> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.3 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Chuck Lever writes: > On Apr 23, 2012, at 10:51 AM, Miklos Szeredi wrote: > >> "J. Bruce Fields" writes: >> >>> >>> I also wonder whether it would be making too many assumptions about the >>> server or filesystem: just because ordinary posix interfaces don't allow >>> atomic replacement of a whole directory tree doesn't mean the server >>> might not have some way to do it. >> >> Exactly because posix limits the atomic replacement to empty directories >> is that this feature is not useful and is why linux can get away with >> the dead directory behavior in this case. And thinking about fixing >> this in NFS is completely pointless since no one will rely on the atomic >> replacement behavior. Fixing local filesystems is also pointless for >> the same reason. >> >> Atomic replacement of whole directory trees would indeed be more useful, >> but it's highly unlikely to be used anywhere since applications relying >> on this feature would be limited to special filesystems that allow this. > > The cases I can think of have to do with file system restore, file > system and block device snapshots, and so on. This type of use case > may not practical on today's Linux server, but they are a reality for > anyone using high-end NFS storage. Problem with this is: if some directory file handles are stale (e.g. due to directories being removed, recreated, moved around) and on the client there are cwd-s referring to these handles, then they are going to become stale no matter what you do, even if the same path does exist after the restore on the server. Thanks, Miklos