* [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc @ 2009-09-08 22:05 Chuck Lever [not found] ` <20090908220230.7590.69833.stgit-RytpoXr2tKZ9HhUboXbp9zCvJB+x5qRC@public.gmane.org> 0 siblings, 1 reply; 8+ messages in thread From: Chuck Lever @ 2009-09-08 22:05 UTC (permalink / raw) To: trond.myklebust; +Cc: linux-nfs Don't dive into memory reclaim in the NFS direct I/O paths, otherwise we can deadlock. Reported by: Wengang Wang <wen.gang.wang@oracle.com> Fix-suggested-by: Zach Brown <zach.brown@oracle.com> Signed-off-by: Chuck Lever <chuck.lever@oracle.com> --- Trond- Thoughts? fs/nfs/direct.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/fs/nfs/direct.c b/fs/nfs/direct.c index e4e089a..00f06b8 100644 --- a/fs/nfs/direct.c +++ b/fs/nfs/direct.c @@ -149,7 +149,7 @@ static inline struct nfs_direct_req *nfs_direct_req_alloc(void) { struct nfs_direct_req *dreq; - dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_KERNEL); + dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_NOFS); if (!dreq) return NULL; ^ permalink raw reply related [flat|nested] 8+ messages in thread
[parent not found: <20090908220230.7590.69833.stgit-RytpoXr2tKZ9HhUboXbp9zCvJB+x5qRC@public.gmane.org>]
* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc [not found] ` <20090908220230.7590.69833.stgit-RytpoXr2tKZ9HhUboXbp9zCvJB+x5qRC@public.gmane.org> @ 2009-09-08 22:32 ` Trond Myklebust [not found] ` <1252449178.8099.64.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org> 0 siblings, 1 reply; 8+ messages in thread From: Trond Myklebust @ 2009-09-08 22:32 UTC (permalink / raw) To: Chuck Lever; +Cc: linux-nfs On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote: > Don't dive into memory reclaim in the NFS direct I/O paths, otherwise > we can deadlock. > > Reported by: Wengang Wang <wen.gang.wang@oracle.com> > Fix-suggested-by: Zach Brown <zach.brown@oracle.com> > Signed-off-by: Chuck Lever <chuck.lever@oracle.com> Wait... What??? How does an O_DIRECT read or write allocation deadlock with memory reclaim? Both the read and the write path call nfs_direct_req_alloc() before they pin any user pages in memory. Trond > --- > > Trond- > > Thoughts? > > fs/nfs/direct.c | 2 +- > 1 files changed, 1 insertions(+), 1 deletions(-) > > diff --git a/fs/nfs/direct.c b/fs/nfs/direct.c > index e4e089a..00f06b8 100644 > --- a/fs/nfs/direct.c > +++ b/fs/nfs/direct.c > @@ -149,7 +149,7 @@ static inline struct nfs_direct_req *nfs_direct_req_alloc(void) > { > struct nfs_direct_req *dreq; > > - dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_KERNEL); > + dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_NOFS); > if (!dreq) > return NULL; > > > -- > To unsubscribe from this list: send the line "unsubscribe linux-nfs" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html -- Trond Myklebust Linux NFS client maintainer NetApp Trond.Myklebust@netapp.com www.netapp.com ^ permalink raw reply [flat|nested] 8+ messages in thread
[parent not found: <1252449178.8099.64.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>]
* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc [not found] ` <1252449178.8099.64.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org> @ 2009-09-08 22:43 ` Chuck Lever 2009-09-08 23:05 ` Trond Myklebust 0 siblings, 1 reply; 8+ messages in thread From: Chuck Lever @ 2009-09-08 22:43 UTC (permalink / raw) To: Trond Myklebust; +Cc: linux-nfs On Sep 8, 2009, at 6:32 PM, Trond Myklebust wrote: > On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote: >> Don't dive into memory reclaim in the NFS direct I/O paths, otherwise >> we can deadlock. >> >> Reported by: Wengang Wang <wen.gang.wang@oracle.com> >> Fix-suggested-by: Zach Brown <zach.brown@oracle.com> >> Signed-off-by: Chuck Lever <chuck.lever@oracle.com> > > Wait... What??? How does an O_DIRECT read or write allocation deadlock > with memory reclaim? Both the read and the write path call > nfs_direct_req_alloc() before they pin any user pages in memory. This may be an issue only for loopback mounts where the backing device is an NFS O_DIRECT file. This type of deadlock may not be able to happen in upstream kernels at this point. Even so, it makes sense for this allocation to be consistent with similar allocations in the other NFS I/O paths. > Trond > >> --- >> >> Trond- >> >> Thoughts? >> >> fs/nfs/direct.c | 2 +- >> 1 files changed, 1 insertions(+), 1 deletions(-) >> >> diff --git a/fs/nfs/direct.c b/fs/nfs/direct.c >> index e4e089a..00f06b8 100644 >> --- a/fs/nfs/direct.c >> +++ b/fs/nfs/direct.c >> @@ -149,7 +149,7 @@ static inline struct nfs_direct_req >> *nfs_direct_req_alloc(void) >> { >> struct nfs_direct_req *dreq; >> >> - dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_KERNEL); >> + dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_NOFS); >> if (!dreq) >> return NULL; >> >> >> -- >> To unsubscribe from this list: send the line "unsubscribe linux- >> nfs" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html > > -- > Trond Myklebust > Linux NFS client maintainer > > NetApp > Trond.Myklebust@netapp.com > www.netapp.com > -- > To unsubscribe from this list: send the line "unsubscribe linux-nfs" > in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html -- Chuck Lever chuck[dot]lever[at]oracle[dot]com ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc 2009-09-08 22:43 ` Chuck Lever @ 2009-09-08 23:05 ` Trond Myklebust [not found] ` <1252451130.8099.81.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org> 0 siblings, 1 reply; 8+ messages in thread From: Trond Myklebust @ 2009-09-08 23:05 UTC (permalink / raw) To: Chuck Lever; +Cc: linux-nfs On Tue, 2009-09-08 at 18:43 -0400, Chuck Lever wrote: > On Sep 8, 2009, at 6:32 PM, Trond Myklebust wrote: > > On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote: > >> Don't dive into memory reclaim in the NFS direct I/O paths, otherwise > >> we can deadlock. > >> > >> Reported by: Wengang Wang <wen.gang.wang@oracle.com> > >> Fix-suggested-by: Zach Brown <zach.brown@oracle.com> > >> Signed-off-by: Chuck Lever <chuck.lever@oracle.com> > > > > Wait... What??? How does an O_DIRECT read or write allocation deadlock > > with memory reclaim? Both the read and the write path call > > nfs_direct_req_alloc() before they pin any user pages in memory. > > This may be an issue only for loopback mounts where the backing device > is an NFS O_DIRECT file. This type of deadlock may not be able to > happen in upstream kernels at this point. I don't see how that makes any difference whatsoever. If the backing device is a non-O_DIRECT file, then you have GFP_KERNEL allocation of the pages. Anything that calls down into a filesystem on a read() or write() path had better not assume that it won't block. > Even so, it makes sense for this allocation to be consistent with > similar allocations in the other NFS I/O paths. I don't buy the 'symmetry' argument. The reason for the GFP_NOFS in the nfs_writedata_alloc() is that you have a deadlock when the VM calls ->writepages() in order to reclaim memory. That is not the case here, and so this is not a symmetrical case. -- Trond Myklebust Linux NFS client maintainer NetApp Trond.Myklebust@netapp.com www.netapp.com ^ permalink raw reply [flat|nested] 8+ messages in thread
[parent not found: <1252451130.8099.81.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>]
* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc [not found] ` <1252451130.8099.81.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org> @ 2009-09-09 1:01 ` Chuck Lever 2009-09-09 1:37 ` Trond Myklebust 0 siblings, 1 reply; 8+ messages in thread From: Chuck Lever @ 2009-09-09 1:01 UTC (permalink / raw) To: Trond Myklebust; +Cc: linux-nfs On Sep 8, 2009, at 7:05 PM, Trond Myklebust wrote: > On Tue, 2009-09-08 at 18:43 -0400, Chuck Lever wrote: >> On Sep 8, 2009, at 6:32 PM, Trond Myklebust wrote: >>> On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote: >>>> Don't dive into memory reclaim in the NFS direct I/O paths, >>>> otherwise >>>> we can deadlock. >>>> >>>> Reported by: Wengang Wang <wen.gang.wang@oracle.com> >>>> Fix-suggested-by: Zach Brown <zach.brown@oracle.com> >>>> Signed-off-by: Chuck Lever <chuck.lever@oracle.com> >>> >>> Wait... What??? How does an O_DIRECT read or write allocation >>> deadlock >>> with memory reclaim? Both the read and the write path call >>> nfs_direct_req_alloc() before they pin any user pages in memory. >> >> This may be an issue only for loopback mounts where the backing >> device >> is an NFS O_DIRECT file. This type of deadlock may not be able to >> happen in upstream kernels at this point. > > I don't see how that makes any difference whatsoever. If the backing > device is a non-O_DIRECT file, then you have GFP_KERNEL allocation of > the pages. > > Anything that calls down into a filesystem on a read() or write() path > had better not assume that it won't block. Basically we're treating an O_DIRECT file just like a block device. If the block I/O path blocks when a kernel file system calls in to do a memory reclaim, we're in dutch. >> Even so, it makes sense for this allocation to be consistent with >> similar allocations in the other NFS I/O paths. > > I don't buy the 'symmetry' argument. The reason for the GFP_NOFS in > the > nfs_writedata_alloc() is that you have a deadlock when the VM calls > ->writepages() in order to reclaim memory. > That is not the case here, and so this is not a symmetrical case. That is precisely the case here, in fact. The upper file system is attempting to reclaim memory in the same kernel where the NFS client is trying to allocate with GFP_KERNEL. > > -- > Trond Myklebust > Linux NFS client maintainer > > NetApp > Trond.Myklebust@netapp.com > www.netapp.com -- Chuck Lever chuck[dot]lever[at]oracle[dot]com ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc 2009-09-09 1:01 ` Chuck Lever @ 2009-09-09 1:37 ` Trond Myklebust [not found] ` <1252460233.5092.12.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org> 0 siblings, 1 reply; 8+ messages in thread From: Trond Myklebust @ 2009-09-09 1:37 UTC (permalink / raw) To: Chuck Lever; +Cc: linux-nfs On Tue, 2009-09-08 at 21:01 -0400, Chuck Lever wrote: > On Sep 8, 2009, at 7:05 PM, Trond Myklebust wrote: > > On Tue, 2009-09-08 at 18:43 -0400, Chuck Lever wrote: > >> On Sep 8, 2009, at 6:32 PM, Trond Myklebust wrote: > >>> On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote: > >>>> Don't dive into memory reclaim in the NFS direct I/O paths, > >>>> otherwise > >>>> we can deadlock. > >>>> > >>>> Reported by: Wengang Wang <wen.gang.wang@oracle.com> > >>>> Fix-suggested-by: Zach Brown <zach.brown@oracle.com> > >>>> Signed-off-by: Chuck Lever <chuck.lever@oracle.com> > >>> > >>> Wait... What??? How does an O_DIRECT read or write allocation > >>> deadlock > >>> with memory reclaim? Both the read and the write path call > >>> nfs_direct_req_alloc() before they pin any user pages in memory. > >> > >> This may be an issue only for loopback mounts where the backing > >> device > >> is an NFS O_DIRECT file. This type of deadlock may not be able to > >> happen in upstream kernels at this point. > > > > I don't see how that makes any difference whatsoever. If the backing > > device is a non-O_DIRECT file, then you have GFP_KERNEL allocation of > > the pages. > > > > Anything that calls down into a filesystem on a read() or write() path > > had better not assume that it won't block. > > Basically we're treating an O_DIRECT file just like a block device. > If the block I/O path blocks when a kernel file system calls in to do > a memory reclaim, we're in dutch. Without a lot more changelog context that explains what you are wanting to do, why it is relevant to NFS (and O_DIRECT in particular), and why you can't work around it in other ways (PF_MEMALLOC comes to mind), I'm not at all interested in applying this patch. > >> Even so, it makes sense for this allocation to be consistent with > >> similar allocations in the other NFS I/O paths. > > > > I don't buy the 'symmetry' argument. The reason for the GFP_NOFS in > > the > > nfs_writedata_alloc() is that you have a deadlock when the VM calls > > ->writepages() in order to reclaim memory. > > That is not the case here, and so this is not a symmetrical case. > > That is precisely the case here, in fact. The upper file system is > attempting to reclaim memory in the same kernel where the NFS client > is trying to allocate with GFP_KERNEL. That's the "upper file system"'s problem, not ours... Stacking filesystems causes issues. Screwing over the existing users of the underlying filesystem is not a fix for those issues... -- Trond Myklebust Linux NFS client maintainer NetApp Trond.Myklebust@netapp.com www.netapp.com ^ permalink raw reply [flat|nested] 8+ messages in thread
[parent not found: <1252460233.5092.12.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>]
* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc [not found] ` <1252460233.5092.12.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org> @ 2009-09-09 2:16 ` Chuck Lever 2009-09-09 3:49 ` Trond Myklebust 0 siblings, 1 reply; 8+ messages in thread From: Chuck Lever @ 2009-09-09 2:16 UTC (permalink / raw) To: Trond Myklebust; +Cc: linux-nfs On Sep 8, 2009, at 9:37 PM, Trond Myklebust wrote: > On Tue, 2009-09-08 at 21:01 -0400, Chuck Lever wrote: >> On Sep 8, 2009, at 7:05 PM, Trond Myklebust wrote: >>> On Tue, 2009-09-08 at 18:43 -0400, Chuck Lever wrote: >>>> On Sep 8, 2009, at 6:32 PM, Trond Myklebust wrote: >>>>> On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote: >>>>>> Don't dive into memory reclaim in the NFS direct I/O paths, >>>>>> otherwise >>>>>> we can deadlock. >>>>>> >>>>>> Reported by: Wengang Wang <wen.gang.wang@oracle.com> >>>>>> Fix-suggested-by: Zach Brown <zach.brown@oracle.com> >>>>>> Signed-off-by: Chuck Lever <chuck.lever@oracle.com> >>>>> >>>>> Wait... What??? How does an O_DIRECT read or write allocation >>>>> deadlock >>>>> with memory reclaim? Both the read and the write path call >>>>> nfs_direct_req_alloc() before they pin any user pages in memory. >>>> >>>> This may be an issue only for loopback mounts where the backing >>>> device >>>> is an NFS O_DIRECT file. This type of deadlock may not be able to >>>> happen in upstream kernels at this point. >>> >>> I don't see how that makes any difference whatsoever. If the backing >>> device is a non-O_DIRECT file, then you have GFP_KERNEL allocation >>> of >>> the pages. >>> >>> Anything that calls down into a filesystem on a read() or write() >>> path >>> had better not assume that it won't block. >> >> Basically we're treating an O_DIRECT file just like a block device. >> If the block I/O path blocks when a kernel file system calls in to do >> a memory reclaim, we're in dutch. > > Without a lot more changelog context that explains what you are > wanting > to do, why it is relevant to NFS (and O_DIRECT in particular), and why > you can't work around it in other ways (PF_MEMALLOC comes to mind), > I'm > not at all interested in applying this patch. I didn't ask you to apply it, I just asked for your thoughts. We have a dm target that uses an NFS file as a backing device. It converts bios to NFS read and write requests using direct and async I/ O. It's a loopback block device with a local file system residing in it. (OK, so "loopback mount" was probably not a clear way to explain what is going on). >>>> Even so, it makes sense for this allocation to be consistent with >>>> similar allocations in the other NFS I/O paths. >>> >>> I don't buy the 'symmetry' argument. The reason for the GFP_NOFS in >>> the >>> nfs_writedata_alloc() is that you have a deadlock when the VM calls >>> ->writepages() in order to reclaim memory. >>> That is not the case here, and so this is not a symmetrical case. >> >> That is precisely the case here, in fact. The upper file system is >> attempting to reclaim memory in the same kernel where the NFS client >> is trying to allocate with GFP_KERNEL. > > That's the "upper file system"'s problem, not ours... Stacking > filesystems causes issues. Screwing over the existing users of the > underlying filesystem is not a fix for those issues... How does this change "screw over" the existing users of NFS O_DIRECT? -- Chuck Lever chuck[dot]lever[at]oracle[dot]com ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc 2009-09-09 2:16 ` Chuck Lever @ 2009-09-09 3:49 ` Trond Myklebust 0 siblings, 0 replies; 8+ messages in thread From: Trond Myklebust @ 2009-09-09 3:49 UTC (permalink / raw) To: Chuck Lever; +Cc: linux-nfs On Tue, 2009-09-08 at 22:16 -0400, Chuck Lever wrote: > On Sep 8, 2009, at 9:37 PM, Trond Myklebust wrote: > > On Tue, 2009-09-08 at 21:01 -0400, Chuck Lever wrote: > >> That is precisely the case here, in fact. The upper file system is > >> attempting to reclaim memory in the same kernel where the NFS client > >> is trying to allocate with GFP_KERNEL. > > > > That's the "upper file system"'s problem, not ours... Stacking > > filesystems causes issues. Screwing over the existing users of the > > underlying filesystem is not a fix for those issues... > > How does this change "screw over" the existing users of NFS O_DIRECT? If they are low on memory, and call read() or write() on an O_DIRECT file, the kernel will fail to start the necessary memory reclaim. Given that by far the most common users of NFS O_DIRECT these days tend to be large databases with rather heavy memory requirements our code changes should rather be moving in the opposite direction w.r.t. use of GFP_NOFS. -- Trond Myklebust Linux NFS client maintainer NetApp Trond.Myklebust@netapp.com www.netapp.com ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2009-09-09 3:50 UTC | newest] Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2009-09-08 22:05 [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc Chuck Lever [not found] ` <20090908220230.7590.69833.stgit-RytpoXr2tKZ9HhUboXbp9zCvJB+x5qRC@public.gmane.org> 2009-09-08 22:32 ` Trond Myklebust [not found] ` <1252449178.8099.64.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org> 2009-09-08 22:43 ` Chuck Lever 2009-09-08 23:05 ` Trond Myklebust [not found] ` <1252451130.8099.81.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org> 2009-09-09 1:01 ` Chuck Lever 2009-09-09 1:37 ` Trond Myklebust [not found] ` <1252460233.5092.12.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org> 2009-09-09 2:16 ` Chuck Lever 2009-09-09 3:49 ` Trond Myklebust
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.