All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc
@ 2009-09-08 22:05 Chuck Lever
       [not found] ` <20090908220230.7590.69833.stgit-RytpoXr2tKZ9HhUboXbp9zCvJB+x5qRC@public.gmane.org>
  0 siblings, 1 reply; 8+ messages in thread
From: Chuck Lever @ 2009-09-08 22:05 UTC (permalink / raw)
  To: trond.myklebust; +Cc: linux-nfs

Don't dive into memory reclaim in the NFS direct I/O paths, otherwise
we can deadlock.

Reported by: Wengang Wang <wen.gang.wang@oracle.com>
Fix-suggested-by: Zach Brown <zach.brown@oracle.com>
Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
---

Trond-

Thoughts?

 fs/nfs/direct.c |    2 +-
 1 files changed, 1 insertions(+), 1 deletions(-)

diff --git a/fs/nfs/direct.c b/fs/nfs/direct.c
index e4e089a..00f06b8 100644
--- a/fs/nfs/direct.c
+++ b/fs/nfs/direct.c
@@ -149,7 +149,7 @@ static inline struct nfs_direct_req *nfs_direct_req_alloc(void)
 {
 	struct nfs_direct_req *dreq;
 
-	dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_KERNEL);
+	dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_NOFS);
 	if (!dreq)
 		return NULL;
 


^ permalink raw reply related	[flat|nested] 8+ messages in thread

* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc
       [not found] ` <20090908220230.7590.69833.stgit-RytpoXr2tKZ9HhUboXbp9zCvJB+x5qRC@public.gmane.org>
@ 2009-09-08 22:32   ` Trond Myklebust
       [not found]     ` <1252449178.8099.64.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
  0 siblings, 1 reply; 8+ messages in thread
From: Trond Myklebust @ 2009-09-08 22:32 UTC (permalink / raw)
  To: Chuck Lever; +Cc: linux-nfs

On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote:
> Don't dive into memory reclaim in the NFS direct I/O paths, otherwise
> we can deadlock.
> 
> Reported by: Wengang Wang <wen.gang.wang@oracle.com>
> Fix-suggested-by: Zach Brown <zach.brown@oracle.com>
> Signed-off-by: Chuck Lever <chuck.lever@oracle.com>

Wait... What??? How does an O_DIRECT read or write allocation deadlock
with memory reclaim? Both the read and the write path call
nfs_direct_req_alloc() before they pin any user pages in memory.

Trond

> ---
> 
> Trond-
> 
> Thoughts?
> 
>  fs/nfs/direct.c |    2 +-
>  1 files changed, 1 insertions(+), 1 deletions(-)
> 
> diff --git a/fs/nfs/direct.c b/fs/nfs/direct.c
> index e4e089a..00f06b8 100644
> --- a/fs/nfs/direct.c
> +++ b/fs/nfs/direct.c
> @@ -149,7 +149,7 @@ static inline struct nfs_direct_req *nfs_direct_req_alloc(void)
>  {
>  	struct nfs_direct_req *dreq;
>  
> -	dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_KERNEL);
> +	dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_NOFS);
>  	if (!dreq)
>  		return NULL;
>  
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

-- 
Trond Myklebust
Linux NFS client maintainer

NetApp
Trond.Myklebust@netapp.com
www.netapp.com

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc
       [not found]     ` <1252449178.8099.64.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
@ 2009-09-08 22:43       ` Chuck Lever
  2009-09-08 23:05         ` Trond Myklebust
  0 siblings, 1 reply; 8+ messages in thread
From: Chuck Lever @ 2009-09-08 22:43 UTC (permalink / raw)
  To: Trond Myklebust; +Cc: linux-nfs

On Sep 8, 2009, at 6:32 PM, Trond Myklebust wrote:
> On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote:
>> Don't dive into memory reclaim in the NFS direct I/O paths, otherwise
>> we can deadlock.
>>
>> Reported by: Wengang Wang <wen.gang.wang@oracle.com>
>> Fix-suggested-by: Zach Brown <zach.brown@oracle.com>
>> Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
>
> Wait... What??? How does an O_DIRECT read or write allocation deadlock
> with memory reclaim? Both the read and the write path call
> nfs_direct_req_alloc() before they pin any user pages in memory.

This may be an issue only for loopback mounts where the backing device  
is an NFS O_DIRECT file.  This type of deadlock may not be able to  
happen in upstream kernels at this point.

Even so, it makes sense for this allocation to be consistent with  
similar allocations in the other NFS I/O paths.

> Trond
>
>> ---
>>
>> Trond-
>>
>> Thoughts?
>>
>> fs/nfs/direct.c |    2 +-
>> 1 files changed, 1 insertions(+), 1 deletions(-)
>>
>> diff --git a/fs/nfs/direct.c b/fs/nfs/direct.c
>> index e4e089a..00f06b8 100644
>> --- a/fs/nfs/direct.c
>> +++ b/fs/nfs/direct.c
>> @@ -149,7 +149,7 @@ static inline struct nfs_direct_req  
>> *nfs_direct_req_alloc(void)
>> {
>> 	struct nfs_direct_req *dreq;
>>
>> -	dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_KERNEL);
>> +	dreq = kmem_cache_alloc(nfs_direct_cachep, GFP_NOFS);
>> 	if (!dreq)
>> 		return NULL;
>>
>>
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux- 
>> nfs" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
> -- 
> Trond Myklebust
> Linux NFS client maintainer
>
> NetApp
> Trond.Myklebust@netapp.com
> www.netapp.com
> --
> To unsubscribe from this list: send the line "unsubscribe linux-nfs"  
> in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
Chuck Lever
chuck[dot]lever[at]oracle[dot]com




^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc
  2009-09-08 22:43       ` Chuck Lever
@ 2009-09-08 23:05         ` Trond Myklebust
       [not found]           ` <1252451130.8099.81.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
  0 siblings, 1 reply; 8+ messages in thread
From: Trond Myklebust @ 2009-09-08 23:05 UTC (permalink / raw)
  To: Chuck Lever; +Cc: linux-nfs

On Tue, 2009-09-08 at 18:43 -0400, Chuck Lever wrote:
> On Sep 8, 2009, at 6:32 PM, Trond Myklebust wrote:
> > On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote:
> >> Don't dive into memory reclaim in the NFS direct I/O paths, otherwise
> >> we can deadlock.
> >>
> >> Reported by: Wengang Wang <wen.gang.wang@oracle.com>
> >> Fix-suggested-by: Zach Brown <zach.brown@oracle.com>
> >> Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
> >
> > Wait... What??? How does an O_DIRECT read or write allocation deadlock
> > with memory reclaim? Both the read and the write path call
> > nfs_direct_req_alloc() before they pin any user pages in memory.
> 
> This may be an issue only for loopback mounts where the backing device  
> is an NFS O_DIRECT file.  This type of deadlock may not be able to  
> happen in upstream kernels at this point.

I don't see how that makes any difference whatsoever. If the backing
device is a non-O_DIRECT file, then you have GFP_KERNEL allocation of
the pages.

Anything that calls down into a filesystem on a read() or write() path
had better not assume that it won't block.

> Even so, it makes sense for this allocation to be consistent with  
> similar allocations in the other NFS I/O paths.

I don't buy the 'symmetry' argument. The reason for the GFP_NOFS in the
nfs_writedata_alloc() is that you have a deadlock when the VM calls
->writepages() in order to reclaim memory.
That is not the case here, and so this is not a symmetrical case.

-- 
Trond Myklebust
Linux NFS client maintainer

NetApp
Trond.Myklebust@netapp.com
www.netapp.com

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc
       [not found]           ` <1252451130.8099.81.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
@ 2009-09-09  1:01             ` Chuck Lever
  2009-09-09  1:37               ` Trond Myklebust
  0 siblings, 1 reply; 8+ messages in thread
From: Chuck Lever @ 2009-09-09  1:01 UTC (permalink / raw)
  To: Trond Myklebust; +Cc: linux-nfs

On Sep 8, 2009, at 7:05 PM, Trond Myklebust wrote:
> On Tue, 2009-09-08 at 18:43 -0400, Chuck Lever wrote:
>> On Sep 8, 2009, at 6:32 PM, Trond Myklebust wrote:
>>> On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote:
>>>> Don't dive into memory reclaim in the NFS direct I/O paths,  
>>>> otherwise
>>>> we can deadlock.
>>>>
>>>> Reported by: Wengang Wang <wen.gang.wang@oracle.com>
>>>> Fix-suggested-by: Zach Brown <zach.brown@oracle.com>
>>>> Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
>>>
>>> Wait... What??? How does an O_DIRECT read or write allocation  
>>> deadlock
>>> with memory reclaim? Both the read and the write path call
>>> nfs_direct_req_alloc() before they pin any user pages in memory.
>>
>> This may be an issue only for loopback mounts where the backing  
>> device
>> is an NFS O_DIRECT file.  This type of deadlock may not be able to
>> happen in upstream kernels at this point.
>
> I don't see how that makes any difference whatsoever. If the backing
> device is a non-O_DIRECT file, then you have GFP_KERNEL allocation of
> the pages.
>
> Anything that calls down into a filesystem on a read() or write() path
> had better not assume that it won't block.

Basically we're treating an O_DIRECT file just like a block device.   
If the block I/O path blocks when a kernel file system calls in to do  
a memory reclaim, we're in dutch.

>> Even so, it makes sense for this allocation to be consistent with
>> similar allocations in the other NFS I/O paths.
>
> I don't buy the 'symmetry' argument. The reason for the GFP_NOFS in  
> the
> nfs_writedata_alloc() is that you have a deadlock when the VM calls
> ->writepages() in order to reclaim memory.
> That is not the case here, and so this is not a symmetrical case.

That is precisely the case here, in fact.  The upper file system is  
attempting to reclaim memory in the same kernel where the NFS client  
is trying to allocate with GFP_KERNEL.

>
> -- 
> Trond Myklebust
> Linux NFS client maintainer
>
> NetApp
> Trond.Myklebust@netapp.com
> www.netapp.com

--
Chuck Lever
chuck[dot]lever[at]oracle[dot]com




^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc
  2009-09-09  1:01             ` Chuck Lever
@ 2009-09-09  1:37               ` Trond Myklebust
       [not found]                 ` <1252460233.5092.12.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
  0 siblings, 1 reply; 8+ messages in thread
From: Trond Myklebust @ 2009-09-09  1:37 UTC (permalink / raw)
  To: Chuck Lever; +Cc: linux-nfs

On Tue, 2009-09-08 at 21:01 -0400, Chuck Lever wrote:
> On Sep 8, 2009, at 7:05 PM, Trond Myklebust wrote:
> > On Tue, 2009-09-08 at 18:43 -0400, Chuck Lever wrote:
> >> On Sep 8, 2009, at 6:32 PM, Trond Myklebust wrote:
> >>> On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote:
> >>>> Don't dive into memory reclaim in the NFS direct I/O paths,  
> >>>> otherwise
> >>>> we can deadlock.
> >>>>
> >>>> Reported by: Wengang Wang <wen.gang.wang@oracle.com>
> >>>> Fix-suggested-by: Zach Brown <zach.brown@oracle.com>
> >>>> Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
> >>>
> >>> Wait... What??? How does an O_DIRECT read or write allocation  
> >>> deadlock
> >>> with memory reclaim? Both the read and the write path call
> >>> nfs_direct_req_alloc() before they pin any user pages in memory.
> >>
> >> This may be an issue only for loopback mounts where the backing  
> >> device
> >> is an NFS O_DIRECT file.  This type of deadlock may not be able to
> >> happen in upstream kernels at this point.
> >
> > I don't see how that makes any difference whatsoever. If the backing
> > device is a non-O_DIRECT file, then you have GFP_KERNEL allocation of
> > the pages.
> >
> > Anything that calls down into a filesystem on a read() or write() path
> > had better not assume that it won't block.
> 
> Basically we're treating an O_DIRECT file just like a block device.   
> If the block I/O path blocks when a kernel file system calls in to do  
> a memory reclaim, we're in dutch.

Without a lot more changelog context that explains what you are wanting
to do, why it is relevant to NFS (and O_DIRECT in particular), and why
you can't work around it in other ways (PF_MEMALLOC comes to mind), I'm
not at all interested in applying this patch.

> >> Even so, it makes sense for this allocation to be consistent with
> >> similar allocations in the other NFS I/O paths.
> >
> > I don't buy the 'symmetry' argument. The reason for the GFP_NOFS in  
> > the
> > nfs_writedata_alloc() is that you have a deadlock when the VM calls
> > ->writepages() in order to reclaim memory.
> > That is not the case here, and so this is not a symmetrical case.
> 
> That is precisely the case here, in fact.  The upper file system is  
> attempting to reclaim memory in the same kernel where the NFS client  
> is trying to allocate with GFP_KERNEL.

That's the "upper file system"'s problem, not ours... Stacking
filesystems causes issues. Screwing over the existing users of the
underlying filesystem is not a fix for those issues...

-- 
Trond Myklebust
Linux NFS client maintainer

NetApp
Trond.Myklebust@netapp.com
www.netapp.com

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc
       [not found]                 ` <1252460233.5092.12.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
@ 2009-09-09  2:16                   ` Chuck Lever
  2009-09-09  3:49                     ` Trond Myklebust
  0 siblings, 1 reply; 8+ messages in thread
From: Chuck Lever @ 2009-09-09  2:16 UTC (permalink / raw)
  To: Trond Myklebust; +Cc: linux-nfs

On Sep 8, 2009, at 9:37 PM, Trond Myklebust wrote:
> On Tue, 2009-09-08 at 21:01 -0400, Chuck Lever wrote:
>> On Sep 8, 2009, at 7:05 PM, Trond Myklebust wrote:
>>> On Tue, 2009-09-08 at 18:43 -0400, Chuck Lever wrote:
>>>> On Sep 8, 2009, at 6:32 PM, Trond Myklebust wrote:
>>>>> On Tue, 2009-09-08 at 18:05 -0400, Chuck Lever wrote:
>>>>>> Don't dive into memory reclaim in the NFS direct I/O paths,
>>>>>> otherwise
>>>>>> we can deadlock.
>>>>>>
>>>>>> Reported by: Wengang Wang <wen.gang.wang@oracle.com>
>>>>>> Fix-suggested-by: Zach Brown <zach.brown@oracle.com>
>>>>>> Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
>>>>>
>>>>> Wait... What??? How does an O_DIRECT read or write allocation
>>>>> deadlock
>>>>> with memory reclaim? Both the read and the write path call
>>>>> nfs_direct_req_alloc() before they pin any user pages in memory.
>>>>
>>>> This may be an issue only for loopback mounts where the backing
>>>> device
>>>> is an NFS O_DIRECT file.  This type of deadlock may not be able to
>>>> happen in upstream kernels at this point.
>>>
>>> I don't see how that makes any difference whatsoever. If the backing
>>> device is a non-O_DIRECT file, then you have GFP_KERNEL allocation  
>>> of
>>> the pages.
>>>
>>> Anything that calls down into a filesystem on a read() or write()  
>>> path
>>> had better not assume that it won't block.
>>
>> Basically we're treating an O_DIRECT file just like a block device.
>> If the block I/O path blocks when a kernel file system calls in to do
>> a memory reclaim, we're in dutch.
>
> Without a lot more changelog context that explains what you are  
> wanting
> to do, why it is relevant to NFS (and O_DIRECT in particular), and why
> you can't work around it in other ways (PF_MEMALLOC comes to mind),  
> I'm
> not at all interested in applying this patch.

I didn't ask you to apply it, I just asked for your thoughts.

We have a dm target that uses an NFS file as a backing device.  It  
converts bios to NFS read and write requests using direct and async I/ 
O.  It's a loopback block device with a local file system residing in  
it.

(OK, so "loopback mount" was probably not a clear way to explain what  
is going on).

>>>> Even so, it makes sense for this allocation to be consistent with
>>>> similar allocations in the other NFS I/O paths.
>>>
>>> I don't buy the 'symmetry' argument. The reason for the GFP_NOFS in
>>> the
>>> nfs_writedata_alloc() is that you have a deadlock when the VM calls
>>> ->writepages() in order to reclaim memory.
>>> That is not the case here, and so this is not a symmetrical case.
>>
>> That is precisely the case here, in fact.  The upper file system is
>> attempting to reclaim memory in the same kernel where the NFS client
>> is trying to allocate with GFP_KERNEL.
>
> That's the "upper file system"'s problem, not ours... Stacking
> filesystems causes issues. Screwing over the existing users of the
> underlying filesystem is not a fix for those issues...

How does this change "screw over" the existing users of NFS O_DIRECT?

--
Chuck Lever
chuck[dot]lever[at]oracle[dot]com




^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc
  2009-09-09  2:16                   ` Chuck Lever
@ 2009-09-09  3:49                     ` Trond Myklebust
  0 siblings, 0 replies; 8+ messages in thread
From: Trond Myklebust @ 2009-09-09  3:49 UTC (permalink / raw)
  To: Chuck Lever; +Cc: linux-nfs

On Tue, 2009-09-08 at 22:16 -0400, Chuck Lever wrote:
> On Sep 8, 2009, at 9:37 PM, Trond Myklebust wrote:
> > On Tue, 2009-09-08 at 21:01 -0400, Chuck Lever wrote:
> >> That is precisely the case here, in fact.  The upper file system is
> >> attempting to reclaim memory in the same kernel where the NFS client
> >> is trying to allocate with GFP_KERNEL.
> >
> > That's the "upper file system"'s problem, not ours... Stacking
> > filesystems causes issues. Screwing over the existing users of the
> > underlying filesystem is not a fix for those issues...
> 
> How does this change "screw over" the existing users of NFS O_DIRECT?

If they are low on memory, and call read() or write() on an O_DIRECT
file, the kernel will fail to start the necessary memory reclaim. Given
that by far the most common users of NFS O_DIRECT these days tend to be
large databases with rather heavy memory requirements our code changes
should rather be moving in the opposite direction w.r.t. use of
GFP_NOFS.

-- 
Trond Myklebust
Linux NFS client maintainer

NetApp
Trond.Myklebust@netapp.com
www.netapp.com

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2009-09-09  3:50 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2009-09-08 22:05 [PATCH] NFS: Use GFP_NOFS in nfs_direct_req_alloc Chuck Lever
     [not found] ` <20090908220230.7590.69833.stgit-RytpoXr2tKZ9HhUboXbp9zCvJB+x5qRC@public.gmane.org>
2009-09-08 22:32   ` Trond Myklebust
     [not found]     ` <1252449178.8099.64.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-09-08 22:43       ` Chuck Lever
2009-09-08 23:05         ` Trond Myklebust
     [not found]           ` <1252451130.8099.81.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-09-09  1:01             ` Chuck Lever
2009-09-09  1:37               ` Trond Myklebust
     [not found]                 ` <1252460233.5092.12.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2009-09-09  2:16                   ` Chuck Lever
2009-09-09  3:49                     ` Trond Myklebust

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.