[PATCH] mm, page_alloc: only use per-cpu allocator for irq-safe requests -fix

All of lore.kernel.org
 help / color / mirror / Atom feed

* [PATCH] mm, page_alloc: only use per-cpu allocator for irq-safe requests -fix
@ 2017-02-08 14:31 ` Mel Gorman
  0 siblings, 0 replies; 6+ messages in thread
From: Mel Gorman @ 2017-02-08 14:31 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Peter Zijlstra, Thomas Gleixner, Michal Hocko, Vlastimil Babka,
	linux-mm, linux-kernel, Ingo Molnar

preempt_enable_no_resched() was used based on review feedback that had no
strong objection at the time. It avoided introducing a preemption point
where one didn't exist before which was marginal at best.

However, it is hazardous to the RT tree according to Thomas Gleixner
and is a violation of its expected use according to Peter Zijlstra. In
Peter's own words "the only acceptable use of preempt_enable_no_resched()
is if the next statement is a schedule() variant".

The impact of using preempt_enable in this particular
fast path is negligible. This is a fix to the mmotm patch
mm-page_alloc-only-use-per-cpu-allocator-for-irq-safe-requests.patch

Signed-off-by: Mel Gorman <mgorman@techsingularity.net>
---
 mm/page_alloc.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/mm/page_alloc.c b/mm/page_alloc.c
index eaecb4b145e6..2a36dad03dac 100644
--- a/mm/page_alloc.c
+++ b/mm/page_alloc.c
@@ -2520,7 +2520,7 @@ void free_hot_cold_page(struct page *page, bool cold)
 	}

 out:
-	preempt_enable_no_resched();
+	preempt_enable();
 }

 /*
@@ -2686,7 +2686,7 @@ static struct page *rmqueue_pcplist(struct zone *preferred_zone,
 		__count_zid_vm_events(PGALLOC, page_zonenum(page), 1 << order);
 		zone_statistics(preferred_zone, zone);
 	}
-	preempt_enable_no_resched();
+	preempt_enable();
 	return page;
 }

^ permalink raw reply related	[flat|nested] 6+ messages in thread

* [PATCH] mm, page_alloc: only use per-cpu allocator for irq-safe requests -fix
@ 2017-02-08 14:31 ` Mel Gorman
  0 siblings, 0 replies; 6+ messages in thread
From: Mel Gorman @ 2017-02-08 14:31 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Peter Zijlstra, Thomas Gleixner, Michal Hocko, Vlastimil Babka,
	linux-mm, linux-kernel, Ingo Molnar

preempt_enable_no_resched() was used based on review feedback that had no
strong objection at the time. It avoided introducing a preemption point
where one didn't exist before which was marginal at best.

However, it is hazardous to the RT tree according to Thomas Gleixner
and is a violation of its expected use according to Peter Zijlstra. In
Peter's own words "the only acceptable use of preempt_enable_no_resched()
is if the next statement is a schedule() variant".

The impact of using preempt_enable in this particular
fast path is negligible. This is a fix to the mmotm patch
mm-page_alloc-only-use-per-cpu-allocator-for-irq-safe-requests.patch

Signed-off-by: Mel Gorman <mgorman@techsingularity.net>
---
 mm/page_alloc.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/mm/page_alloc.c b/mm/page_alloc.c
index eaecb4b145e6..2a36dad03dac 100644
--- a/mm/page_alloc.c
+++ b/mm/page_alloc.c
@@ -2520,7 +2520,7 @@ void free_hot_cold_page(struct page *page, bool cold)
 	}

 out:
-	preempt_enable_no_resched();
+	preempt_enable();
 }

 /*
@@ -2686,7 +2686,7 @@ static struct page *rmqueue_pcplist(struct zone *preferred_zone,
 		__count_zid_vm_events(PGALLOC, page_zonenum(page), 1 << order);
 		zone_statistics(preferred_zone, zone);
 	}
-	preempt_enable_no_resched();
+	preempt_enable();
 	return page;
 }

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply related	[flat|nested] 6+ messages in thread

* Re: [PATCH] mm, page_alloc: only use per-cpu allocator for irq-safe requests -fix
  2017-02-08 14:31 ` Mel Gorman
@ 2017-02-08 14:56   ` Thomas Gleixner
  -1 siblings, 0 replies; 6+ messages in thread
From: Thomas Gleixner @ 2017-02-08 14:56 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Andrew Morton, Peter Zijlstra, Michal Hocko, Vlastimil Babka,
	linux-mm, linux-kernel, Ingo Molnar

On Wed, 8 Feb 2017, Mel Gorman wrote:

> preempt_enable_no_resched() was used based on review feedback that had no
> strong objection at the time. It avoided introducing a preemption point
> where one didn't exist before which was marginal at best.

Actually local_irq_enable() _IS_ a preemption point, indirect but still:

   local_irq_disable()
   ....
--> HW interrupt is raised
   ....
   local_irq_enable()

   handle_irq()
	set_need_resched()
   ret_from_irq()
     preempt()

while with preempt_disable that looks like this:

   preempt_disable()
   ....
--> HW interrupt is raised
   handle_irq()
	set_need_resched()
   ret_from_irq()
   ....
   preempt_enable()
      preempt()

Now if you use preempt_enable_no_resched() then you miss the preemption and
depending on the actual code path you might run something which takes ages
without hitting a preemption point after that.

It's not only a problem for RT. It's also in mainline a violation of the
preemption mechanism.

Thanks,

	tglx

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] mm, page_alloc: only use per-cpu allocator for irq-safe requests -fix
@ 2017-02-08 14:56   ` Thomas Gleixner
  0 siblings, 0 replies; 6+ messages in thread
From: Thomas Gleixner @ 2017-02-08 14:56 UTC (permalink / raw)
  To: Mel Gorman
  Cc: Andrew Morton, Peter Zijlstra, Michal Hocko, Vlastimil Babka,
	linux-mm, linux-kernel, Ingo Molnar

On Wed, 8 Feb 2017, Mel Gorman wrote:

> preempt_enable_no_resched() was used based on review feedback that had no
> strong objection at the time. It avoided introducing a preemption point
> where one didn't exist before which was marginal at best.

Actually local_irq_enable() _IS_ a preemption point, indirect but still:

   local_irq_disable()
   ....
--> HW interrupt is raised
   ....
   local_irq_enable()

   handle_irq()
	set_need_resched()
   ret_from_irq()
     preempt()

while with preempt_disable that looks like this:

   preempt_disable()
   ....
--> HW interrupt is raised
   handle_irq()
	set_need_resched()
   ret_from_irq()
   ....
   preempt_enable()
      preempt()

Now if you use preempt_enable_no_resched() then you miss the preemption and
depending on the actual code path you might run something which takes ages
without hitting a preemption point after that.

It's not only a problem for RT. It's also in mainline a violation of the
preemption mechanism.

Thanks,

	tglx



--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] mm, page_alloc: only use per-cpu allocator for irq-safe requests -fix
  2017-02-08 14:56   ` Thomas Gleixner
@ 2017-02-08 15:12     ` Mel Gorman
  -1 siblings, 0 replies; 6+ messages in thread
From: Mel Gorman @ 2017-02-08 15:12 UTC (permalink / raw)
  To: Thomas Gleixner
  Cc: Andrew Morton, Peter Zijlstra, Michal Hocko, Vlastimil Babka,
	linux-mm, linux-kernel, Ingo Molnar

On Wed, Feb 08, 2017 at 03:56:22PM +0100, Thomas Gleixner wrote:
> On Wed, 8 Feb 2017, Mel Gorman wrote:
> 
> > preempt_enable_no_resched() was used based on review feedback that had no
> > strong objection at the time. It avoided introducing a preemption point
> > where one didn't exist before which was marginal at best.
> 
> Actually local_irq_enable() _IS_ a preemption point, indirect but still:
> 
>    local_irq_disable()
>    ....
> --> HW interrupt is raised
>    ....
>    local_irq_enable()
> 
>    handle_irq()
> 	set_need_resched()
>    ret_from_irq()
>      preempt()
> 
> while with preempt_disable that looks like this:
> 
>    preempt_disable()
>    ....
> --> HW interrupt is raised
>    handle_irq()
> 	set_need_resched()
>    ret_from_irq()
>    ....
>    preempt_enable()
>       preempt()
> 
> Now if you use preempt_enable_no_resched() then you miss the preemption and
> depending on the actual code path you might run something which takes ages
> without hitting a preemption point after that.
> 

Thanks for the education, I had missed it. The changelog should have been
"fix a dumb mistake and stick to preempt_enable".  Assuming Andrew picks
this patch up, it'll be folded into the patch that introduced the problem
in the first place and will the broken usage will never hit mainline.

> It's not only a problem for RT. It's also in mainline a violation of the
> preemption mechanism.
> 

Understood, thanks.

-- 
Mel Gorman
SUSE Labs

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] mm, page_alloc: only use per-cpu allocator for irq-safe requests -fix
@ 2017-02-08 15:12     ` Mel Gorman
  0 siblings, 0 replies; 6+ messages in thread
From: Mel Gorman @ 2017-02-08 15:12 UTC (permalink / raw)
  To: Thomas Gleixner
  Cc: Andrew Morton, Peter Zijlstra, Michal Hocko, Vlastimil Babka,
	linux-mm, linux-kernel, Ingo Molnar

On Wed, Feb 08, 2017 at 03:56:22PM +0100, Thomas Gleixner wrote:
> On Wed, 8 Feb 2017, Mel Gorman wrote:
> 
> > preempt_enable_no_resched() was used based on review feedback that had no
> > strong objection at the time. It avoided introducing a preemption point
> > where one didn't exist before which was marginal at best.
> 
> Actually local_irq_enable() _IS_ a preemption point, indirect but still:
> 
>    local_irq_disable()
>    ....
> --> HW interrupt is raised
>    ....
>    local_irq_enable()
> 
>    handle_irq()
> 	set_need_resched()
>    ret_from_irq()
>      preempt()
> 
> while with preempt_disable that looks like this:
> 
>    preempt_disable()
>    ....
> --> HW interrupt is raised
>    handle_irq()
> 	set_need_resched()
>    ret_from_irq()
>    ....
>    preempt_enable()
>       preempt()
> 
> Now if you use preempt_enable_no_resched() then you miss the preemption and
> depending on the actual code path you might run something which takes ages
> without hitting a preemption point after that.
> 

Thanks for the education, I had missed it. The changelog should have been
"fix a dumb mistake and stick to preempt_enable".  Assuming Andrew picks
this patch up, it'll be folded into the patch that introduced the problem
in the first place and will the broken usage will never hit mainline.

> It's not only a problem for RT. It's also in mainline a violation of the
> preemption mechanism.
> 

Understood, thanks.

-- 
Mel Gorman
SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2017-02-08 16:35 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-02-08 14:31 [PATCH] mm, page_alloc: only use per-cpu allocator for irq-safe requests -fix Mel Gorman
2017-02-08 14:31 ` Mel Gorman
2017-02-08 14:56 ` Thomas Gleixner
2017-02-08 14:56   ` Thomas Gleixner
2017-02-08 15:12   ` Mel Gorman
2017-02-08 15:12     ` Mel Gorman

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.