From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 1366DECDE46 for ; Mon, 5 Nov 2018 05:09:25 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id D960B20831 for ; Mon, 5 Nov 2018 05:09:24 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org D960B20831 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=suse.com Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729037AbeKEO1K (ORCPT ); Mon, 5 Nov 2018 09:27:10 -0500 Received: from mx2.suse.de ([195.135.220.15]:52524 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1728918AbeKEO1J (ORCPT ); Mon, 5 Nov 2018 09:27:09 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 320DDAE8F; Mon, 5 Nov 2018 05:09:20 +0000 (UTC) From: NeilBrown To: Malahal Naineni Date: Mon, 05 Nov 2018 16:09:10 +1100 Cc: eshel@us.ibm.com, bcodding@redhat.com, linux-nfs@vger.kernel.org, linux-nfs-owner@vger.kernel.org, mbenjami@redhat.com, trondmy@hammerspace.com Subject: Re: "(deleted)" directories In-Reply-To: References: <24BEBD2F-BCC2-4AE0-81D7-185D6CAB8CD7@redhat.com> <435a5a0fcdbefc30201c91b0a36b6159f6df32eb.camel@hammerspace.com> <87a7mp1k9a.fsf@notabene.neil.brown.name> <87sh0gz7mt.fsf@notabene.neil.brown.name> Message-ID: <87muqoyutl.fsf@notabene.neil.brown.name> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha256; protocol="application/pgp-signature" Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org --=-=-= Content-Type: text/plain On Mon, Nov 05 2018, Malahal Naineni wrote: >> Do we know exactly why the FH changed in this particular circumstance? > > In this instance, this is due to a code bug but obviously, there are > legitimate cases where this occur with Ganesha. Good to know that bug has been found, and presumably fixed. It is not obvious to me that there are any such legitimate cases for directories. > >> (I'm particularly thinking of volatile file handles). > > NFS4 RFC has "unique filehandles" concept as well. Linux NFS client > doesn't seem to use "unique filehandles" attribute as well. A client doesn't need to use that attribute. My reading of section 10.3.4 of RFC7530 suggests that the client should generally compare fsid and fileid to see if two different filehandles refer to the same object or not. If unique_handles is known to be set for a given fsid, then different filehandles imply different files, without bothering to check the fileid. So the use of unique_handles is an optimization. I haven't looked at the Linux/NFS code to see if it conforms to 10.3.4. NeilBrown > > On Mon, Nov 5, 2018 at 6:02 AM NeilBrown wrote: >> >> On Sun, Nov 04 2018, Marc Eshel wrote: >> >> > linux-nfs-owner@vger.kernel.org wrote on 11/03/2018 10:31:29 PM: >> > >> >> From: NeilBrown >> >> To: Marc Eshel , Trond Myklebust >> > >> >> Cc: "bcodding\@redhat.com" , "linux-nfs >> >> \@vger.kernel.org" , linux-nfs- >> >> owner@vger.kernel.org, "malahal\@gmail.com" , >> >> "mbenjami\@redhat.com" >> >> Date: 11/03/2018 10:41 PM >> >> Subject: Re: "(deleted)" directories >> >> Sent by: linux-nfs-owner@vger.kernel.org >> >> >> >> On Fri, Nov 02 2018, Marc Eshel wrote: >> >> >> >> > One reason to have different FHs for the same file is that a file can >> > be >> >> > linked from multiple directories. >> >> >> >> This has some based when considering filehandles for non-directories. >> >> However the original problem was with filehandles for directories..... >> > >> > This was just an example of why FH might be different, I don't think we >> > depend on it for the parent information anymore. Malahal listed some other >> > reasons for having different FH for the same file. I believe that Ganesha >> > split the FH to the key portion (the unique id of the file) and some other >> > information that is file system dependent. If the NFS client can not >> > handle the spec definition of FH maybe the spec should be updated to >> > something like Ganesha does. >> > Marc. >> >> Do we know exactly why the FH changed in this particular circumstance? >> Is there some way to find out? >> >> The NFSv3 spec has been updated - it is called "NFSv4" (now 4.2). It >> says a lot more things about filehandles, but even there, the spec is >> only as good as the what has been implemented and tested. I'm pretty >> sure that there are parts of the FH spec that have never been put into >> practice - so using them would not be wise (I'm particularly thinking of >> volatile file handles). >> >> For better or worse, Linux requires directories to have stable >> filehandles for NFSv3. This requirement is effectively imposed by the >> dcache. If there were some way to reliably check if two filehandles >> referred to the same directory, then we could relax that restriction, >> but I don't think there is. >> >> I think the other possible reason mentioned for changing the filehandle >> is to support migration. NFSv3 definitely doesn't support migration. >> NFSv4 explicitly tries to. >> >> NeilBrown >> >> >> > >> >> >> >> > Adding the parent inode to the FH help finding the the name of the >> > file by >> >> > looking for the file inode in >> >> > the parent directoy. >> >> > >> >> >> >> ....and directories have a ".." link, obviating the need to store parent >> >> information in the filehandle. >> >> >> >> NeilBrown >> >> >> >> >> >> > Marc. >> >> > >> >> > linux-nfs-owner@vger.kernel.org wrote on 11/02/2018 05:15:42 PM: >> >> > >> >> >> From: Trond Myklebust >> >> >> To: "mbenjami@redhat.com" >> >> >> Cc: "bcodding@redhat.com" , "malahal@gmail.com" >> >> >> , "linux-nfs@vger.kernel.org" >> >> > >> >> >> Date: 11/02/2018 05:15 PM >> >> >> Subject: Re: "(deleted)" directories >> >> >> Sent by: linux-nfs-owner@vger.kernel.org >> >> >> >> >> >> On Fri, 2018-11-02 at 18:07 -0400, Matt Benjamin wrote: >> >> >> > It sounds like a pretty good one, that goes to the heart of what a >> >> >> > specification is >> >> >> > >> >> >> >> >> >> While admittedly it is (still) Dia de los Muertos today, I would >> > think >> >> >> that someone who resurrected a part of the NFSv3 spec that has been >> >> >> unused for the full 23 years of its existence might have some >> >> >> explanation for why they did so? >> >> >> >> >> >> IOW: not being of a particularly religious persuasion, I usually want >> >> >> to understand why features are needed rather than having blind faith >> > in >> >> >> the person who wrote the spec. >> >> >> >> >> >> > Matt >> >> >> > >> >> >> > On Fri, Nov 2, 2018 at 4:26 PM, Trond Myklebust < >> >> >> > trondmy@hammerspace.com> wrote: >> >> >> > > On Fri, 2018-11-02 at 21:24 +0530, Malahal Naineni wrote: >> >> >> > > > Ben, NFSv3 RFC1813.txt states: "If two file handles from the >> > same >> >> >> > > > server are equal, they must refer to the same file, but if >> >> >> > > > they are >> >> >> > > > not equal, no conclusions can be drawn." Ganesha does return >> > same >> >> >> > > > fileid here (inode). >> >> >> > > > >> >> >> > > > In NFSv4, they have introduced "unique_handles" attribute. I >> >> >> > > > don't >> >> >> > > > see >> >> >> > > > Linux NFS client using this at all though. >> >> >> > > >> >> >> > > Why does your server need to have multiple filehandles refer to >> > the >> >> >> > > same file, and why do you expect clients to support this? >> >> >> > > >> >> >> > > Yes, the spec allows it, but that's not a sufficient reason. >> >> >> > > >> >> >> > > > Regards, Malahal. >> >> >> > > > On Fri, Nov 2, 2018 at 4:35 PM Benjamin Coddington < >> >> >> > > > bcodding@redhat.com> wrote: >> >> >> > > > > On 2 Nov 2018, at 1:26, Malahal Naineni wrote: >> >> >> > > > > >> >> >> > > > > > Hi All, we are using NFS-Ganesha with Linux NFS clients. >> > The >> >> >> > > > > > client's >> >> >> > > > > > shell reports the following. Based on lsof, the directory >> > is >> >> >> > > > > > marked >> >> >> > > > > > deleted. "cd to ROOT and cd to the same home directory >> > fixes >> >> >> > > > > > the >> >> >> > > > > > issue. The client behaves as though the directory is >> > deleted >> >> >> > > > > > and >> >> >> > > > > > recreated! Our NFS-Ganesha server implementation uses >> >> >> > > > > > multiple >> >> >> > > > > > file >> >> >> > > > > > handles that point to the same object. NFS spec says this >> >> >> > > > > > should >> >> >> > > > > > be >> >> >> > > > > > fine, but Linux NFS seems to be broken in this regard. >> >> >> > > > > > tcpdump >> >> >> > > > > > does >> >> >> > > > > > indicate file handle change (note that all file handles are >> >> >> > > > > > permanent, >> >> >> > > > > > meaning they are valid at the server any time) around this >> >> >> > > > > > issue >> >> >> > > > > > time. >> >> >> > > > > > >> >> >> > > > > > "shell-init: error retrieving current directory: getcwd: >> >> >> > > > > > cannot >> >> >> > > > > > access >> >> >> > > > > > parent directories: No such file or directory" >> >> >> > > > > > sh 112544 malahal cwd DIR >> >> >> > > > > > 0,67 >> >> >> > > > > > 65536 45605209 /home/malahal (deleted) >> >> >> > > > > > (10.120.154.42:/nfs/malahal-export/) >> >> >> > > > > > >> >> >> > > > > > Function nfs_prime_dcache() seems to invalidate the dcache >> >> >> > > > > > entry >> >> >> > > > > > if >> >> >> > > > > > nfs_same_file() returns false. nfs_same_file() does seem to >> >> >> > > > > > return >> >> >> > > > > > false with the following change, if I read it correctly, if >> >> >> > > > > > there >> >> >> > > > > > is a >> >> >> > > > > > file handle change. Can this be the source of my issue? It >> >> >> > > > > > seems >> >> >> > > > > > that >> >> >> > > > > > the client should do this only if the file handle is NOT >> >> >> > > > > > valid >> >> >> > > > > > (e.g. >> >> >> > > > > > if it gets ESTALE), right? >> >> >> > > > > > >> >> >> > > > > > The following commit seems to assume that the objects are >> >> >> > > > > > different if >> >> >> > > > > > they have different file handles! >> >> >> > > > > > commit 7dc72d5f7a0ec97a53e126c46e2cbd2560757955 >> >> >> > > > > > Author: Trond Myklebust >> >> >> > > > > > Date: Thu Sep 22 13:38:52 2016 -0400 >> >> >> > > > > > >> >> >> > > > > > NFS: Fix inode corruption in nfs_prime_dcache() >> >> >> > > > > >> >> >> > > > > My understanding is that for NFSv3 we have to assume that >> >> >> > > > > distinct >> >> >> > > > > filehandles are distinct objects, but maybe I'm wrong about >> >> >> > > > > this. >> >> >> > > > > >> >> >> > > > > For NFSv4.x, we can follow the guidance in RFCs 5661 or 7530 >> >> >> > > > > section 10.3.4 >> >> >> > > > > to determine if the differing filehandles are the same >> > object, >> >> >> > > > > specifically >> >> >> > > > > the fileid recommended attribute needs to be implemented. Is >> >> >> > > > > Ganesha >> >> >> > > > > returning the same fileid for both filehandles? >> >> >> > > > > >> >> >> > > > > Ben >> >> >> > > -- >> >> >> > > Trond Myklebust >> >> >> > > CTO, Hammerspace Inc >> >> >> > > 4300 El Camino Real, Suite 105 >> >> >> > > Los Altos, CA 94022 >> >> >> > > www.hammer.space >> >> >> > > >> >> >> > > >> >> >> > >> >> >> > >> >> >> -- >> >> >> Trond Myklebust >> >> >> CTO, Hammerspace Inc >> >> >> 4300 El Camino Real, Suite 105 >> >> >> Los Altos, CA 94022 >> >> >> www.hammer.space >> >> >> >> >> >> >> >> [attachment "signature.asc" deleted by Marc Eshel/Almaden/IBM] --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQIzBAEBCAAdFiEEG8Yp69OQ2HB7X0l6Oeye3VZigbkFAlvf0HcACgkQOeye3VZi gbnYiA//TX/6e/UmHIYTSFXmbXBzZwawRiHKC/ys1E7YjjEKvlKh59nho+uAMyem zw4tL4b1DeaUciX1/HSEDvecf8fv7MKa18dbq77KTcfiJ0rq2e0gxrx2Pv4V+/Zm kc4fZtLYySlGL+/yXtAs5GZoxhkBCCmFn2VKHfjYzAtTb1LuMauSLiaf4y7+4iF3 o56sAfMzyQjtYm3kdt/VRBLrhZ2drvd7bvfeVcm8BuqO1opCV8lCv/hz2M3mRXoU RaOFmd84FZgGArWctStwqQ19CSQZP1PMpL/pn4w/bB3PeELIkoY6T+MKlyixILtr vPaNi/DTwA2QZa+4EK3/qYKM/ySD5d8qAjmSGu07VEkrdCwfhh4zuvSv+fvrOGi4 AAVJTfeRGqa35p+Rl6h9wMMPhTwjLVHR6+asovyl8qdSd7NS9coKcIlOjkOw0iNq GGnKVJrk6ZowRvvSIiwea8c52fH6JWL25NTzrhtnKhCM9j3yZ45jnEFGy4BWbFi2 NH8Swp0nWk+EiS9xNWzl1tfoQWvJvaxUXURP4YWqfKSp1A+7jHHJFl7hKIVZTgKk PeCxguOemQN/AvqyFCwPBSQoBQcFlD7wnnNILYrZ+DTyt41RV+9eeXQnHOUv9hLY sng0nxC5SM3+6NdXI8EdXpuXscwgxMQbOQhf4eNN9yGaA8gt+Mo= =peqc -----END PGP SIGNATURE----- --=-=-=--